Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Machine Learning

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 511 - 540 of 826

Full-Text Articles in Physical Sciences and Mathematics

Exploring Composite Dataset Biases For Heart Sound Classification, Davoud Shariat Panah, Andrew Hines, Susan Mckeever Jan 2020

Exploring Composite Dataset Biases For Heart Sound Classification, Davoud Shariat Panah, Andrew Hines, Susan Mckeever

Conference papers

In the last few years, the automatic classification of heart sounds has been widely studied as a screening method for heart disease. Some of these studies have achieved high accuracies in heart abnormality prediction. However, for such models to assist clinicians in the detection of heart abnormalities, it is of critical importance that they are generalisable, working on unseen real-world data. Despite the importance of generalisability, the presence of bias in the leading heart sound datasets used in these studies has remained unexplored. In this paper, we explore the presence of potential bias in heart sound datasets. Using a small …


An Automated Method For Detecting Water Levels Using Computer Vision And Artificial Intelligence, Priyanjani Chowdary Chandra Jan 2020

An Automated Method For Detecting Water Levels Using Computer Vision And Artificial Intelligence, Priyanjani Chowdary Chandra

Graduate Research Theses & Dissertations

Flooding is one of the most dangerous weather events today. Between 2015-2019, on average, it has caused more than 130 deaths every year in the USA alone. World Health Organization has reported that, between 1998-2017, floods have affected more than 2 billion people worldwide. The devastating nature of flood necessitates the continuous monitoring of water level in the rivers and streams in flood-prone areas to detect the incoming flood. In this thesis, we have designed and implemented a computer vision and AI-based system that continuously detect the water level in the creek. Our solution employs an effective template matching algorithm …


Exploring The Employment Landscape For Individuals With Autism Spectrum Disorders Using Supervised And Unsupervised Machine Learning, Kayleigh Hyde Jan 2020

Exploring The Employment Landscape For Individuals With Autism Spectrum Disorders Using Supervised And Unsupervised Machine Learning, Kayleigh Hyde

Computational and Data Sciences (PhD) Dissertations

Autism Spectrum Disorders (ASD) are a class of neurodevelopmental disorders which usually present with difficulties in social interactions, verbal and nonverbal forms of communication, repetitive behaviors, and restricted interests. Employment rates of young adults with ASD is a national concern, and research suggests that young adults with “high functioning” ASD experience significant difficulty in transitioning to work. One of the goals of this study was to identify the barriers associated with these individuals’ transition into the world of work. A classification tree analysis was used with a sample of 236 caregivers of individuals with ASD or the individuals themselves, who …


Fast Optimization Algorithms For Auc Maximization, Michael Natole, Jr. Jan 2020

Fast Optimization Algorithms For Auc Maximization, Michael Natole, Jr.

Legacy Theses & Dissertations (2009 - 2024)

Stochastic optimizations algorithms like stochastic gradient descent (SGD) are favorable for large-scale data analysis because they update the model sequentially and with low per-iteration costs. Much of the existing work focuses on optimizing accuracy, however, it is known that accuracy is not an appropriate measure for class imbalanced data. Area under the ROC curve (AUC) is a standard metric that is used to measure classification performance for such a situation. Therefore, developing stochastic learning algorithms that maximize AUC in lieu of accuracy is of both theoretical and practical interest. However, AUC maximization presents a challenge since the learning objective function …


Utilizing Raman Spectroscopy And Chemometrics For Novel Medical Diagnostic Purposes, Nicole M. Ralbovsky Jan 2020

Utilizing Raman Spectroscopy And Chemometrics For Novel Medical Diagnostic Purposes, Nicole M. Ralbovsky

Legacy Theses & Dissertations (2009 - 2024)

Many problems exist within the myriad of currently employed diagnostic techniques. Further, an incredibly wide variety of procedures are used to identify an even greater number of diseases which exist in the world. There is a definite unmet clinical need to improve the diagnostic capabilities of these methods, including improving test sensitivity and specificity, objectivity and definitiveness, and reducing cost and how invasive a test is, with an interest in replacing multiple diagnostic methods with one powerful tool. That is, the development of a singular technique which can accurately and objectively diagnose a wide variety of diseases in a cost-effective …


Process Data Analytics Using Deep Learning Techniques, Majid Moradi Aliabadi Jan 2020

Process Data Analytics Using Deep Learning Techniques, Majid Moradi Aliabadi

Wayne State University Theses

In chemical manufacturing plants, numerous types of data are accessible, which could be process operational data (historical or real-time), process design and product quality data, economic and environmental (including process safety, waste emission and health impact) data. Effective knowledge extraction from raw data has always been a very challenging task, especially the data needed for a type of study is huge. Other characteristics of process data such as noise, dynamics, and highly correlated process parameters make this more challenging.

In this study, we introduce an attention-based RNN for multi-step-ahead prediction that can have applications in model predictive control, fault diagnosis, …


Cheat Detection Using Machine Learning Within Counter-Strike: Global Offensive, Harry Dunham Jan 2020

Cheat Detection Using Machine Learning Within Counter-Strike: Global Offensive, Harry Dunham

Senior Independent Study Theses

Deep learning is becoming a steadfast means of solving complex problems that do not have a single concrete or simple solution. One complex problem that fits this description and that has also begun to appear at the forefront of society is cheating, specifically within video games. Therefore, this paper presents a means of developing a deep learning framework that successfully identifies cheaters within the video game CounterStrike: Global Offensive. This approach yields predictive accuracy metrics that range between 80-90% depending on the exact neural network architecture that is employed. This approach is easily scalable and applicable to all types of …


Toward Closing The Urban Surface Energy Balance Using Satellite Remote Sensing, Joshua Hrisko Jan 2020

Toward Closing The Urban Surface Energy Balance Using Satellite Remote Sensing, Joshua Hrisko

Dissertations and Theses

The energy exchanges at the Earth’s surface are responsible for many of the processes that govern weather, climate, human health, and energy use. This exchange, commonly known as the surface energy balance (SEB), determines the near-surface thermodynamic state by partitioning the available energy into surface fluxes. The net all-wave radiation is often the primary energy source, while the heat storage and sensible and latent heat fluxes account for the majority of energy distributed elsewhere. While the SEB of various natural environments(trees, crops, soils) has been well-observed and modeled, the urban surface energy balance remains elusive. This is due to the …


Exploration And Implementation Of Neural Ordinary Differential Equations, Long Huu Nguyen, Andy Malinsky Jan 2020

Exploration And Implementation Of Neural Ordinary Differential Equations, Long Huu Nguyen, Andy Malinsky

Capstone Showcase

Neural ordinary differential equations (ODEs) have recently emerged as a novel ap- proach to deep learning, leveraging the knowledge of two previously separate domains, neural networks and differential equations. In this paper, we first examine the back- ground and lay the foundation for traditional artificial neural networks. We then present neural ODEs from a rigorous mathematical perspective, and explore their advantages and trade-offs compared to traditional neural nets.


Autonomous Trading Strategies For Dynamic Energy Markets, Moinul Morshed Porag Chowdhury Jan 2020

Autonomous Trading Strategies For Dynamic Energy Markets, Moinul Morshed Porag Chowdhury

Open Access Theses & Dissertations

With increasing energy demand and an intermittent supply of renewable energy sources, our current energy grid needs a transformation towards a more robust, reliable energy trading architecture. The smart grid promises this architecture as the future of the present energy market, where traders will use digital technologies to automate the management of power delivery. It will improve many issues of the current energy grid such as sustainable, clean, renewable, reliable and secure energy supply, customer participation in markets, distributed generation, and transparency in energy trading. Using autonomous trading agents, we can bridge several dynamic energy markets and ensure an efficient …


An Examination Of The Smote And Other Smote-Based Techniques That Use Synthetic Data To Oversample The Minority Class In The Context Of Credit-Card Fraud Classification, Eduardo Parkinson De Castro Jan 2020

An Examination Of The Smote And Other Smote-Based Techniques That Use Synthetic Data To Oversample The Minority Class In The Context Of Credit-Card Fraud Classification, Eduardo Parkinson De Castro

Dissertations

This research project seeks to investigate some of the different sampling techniques that generate and use synthetic data to oversample the minority class as a means of handling the imbalanced distribution between non-fraudulent (majority class) and fraudulent (minority class) classes in a credit-card fraud dataset. The purpose of the research project is to assess the effectiveness of these techniques in the context of fraud detection which is a highly imbalanced and cost-sensitive dataset. Machine learning tasks that require learning from datasets that are highly unbalanced have difficulty learning since many of the traditional learning algorithms are not designed to cope …


Machine Learning Assisted Gait Analysis For The Determination Of Handedness In Able-Bodied People, Hugh Gallagher Jan 2020

Machine Learning Assisted Gait Analysis For The Determination Of Handedness In Able-Bodied People, Hugh Gallagher

Dissertations

This study has investigated the potential application of machine learning for video analysis, with a view to creating a system which can determine a person’s hand laterality (handedness) from the way that they walk (their gait). To this end, the convolutional neural network model VGG16 underwent transfer learning in order to classify videos under two ‘activities’: “walking left-handed” and “walking right-handed”. This saw varying degrees of success across five transfer learning trained models: Everything – the entire dataset; FiftyFifty – the dataset with enough right-handed samples removed to produce a set with parity between activities; Female – only the female …


Automatic Gaze Classification For Aviators: Using Multi-Task Convolutional Networks As A Proxy For Flight Instructor Observation, Justin Wilson, Sandro Scielzo, Sukumaran Nair, Eric C. Larson Jan 2020

Automatic Gaze Classification For Aviators: Using Multi-Task Convolutional Networks As A Proxy For Flight Instructor Observation, Justin Wilson, Sandro Scielzo, Sukumaran Nair, Eric C. Larson

International Journal of Aviation, Aeronautics, and Aerospace

In this work, we investigate how flight instructors observe aviator scan patterns and assign quality to an aviator's gaze. We first establish the reliability of instructors to assign similar quality to an aviator's scan patterns, and then investigate methods to automate this quality using machine learning. In particular, we focus on the classification of gaze for aviators in a mixed-reality flight simulation. We create and evaluate two machine learning models for classifying gaze quality of aviators: a task-agnostic model and a multi-task model. Both models use deep convolutional neural networks to classify the quality of pilot gaze patterns for 40 …


Orthogonal Recurrent Neural Networks And Batch Normalization In Deep Neural Networks, Kyle Eric Helfrich Jan 2020

Orthogonal Recurrent Neural Networks And Batch Normalization In Deep Neural Networks, Kyle Eric Helfrich

Theses and Dissertations--Mathematics

Despite the recent success of various machine learning techniques, there are still numerous obstacles that must be overcome. One obstacle is known as the vanishing/exploding gradient problem. This problem refers to gradients that either become zero or unbounded. This is a well known problem that commonly occurs in Recurrent Neural Networks (RNNs). In this work we describe how this problem can be mitigated, establish three different architectures that are designed to avoid this issue, and derive update schemes for each architecture. Another portion of this work focuses on the often used technique of batch normalization. Although found to be successful …


Cnn-Based Speed Detection Algorithm For Walking And Running Using Wrist-Worn Wearable Sensors, Venkata Devesh Reddy Seethi Jan 2020

Cnn-Based Speed Detection Algorithm For Walking And Running Using Wrist-Worn Wearable Sensors, Venkata Devesh Reddy Seethi

Graduate Research Theses & Dissertations

In recent years, there have been a surge in ubiquitous technologies such as smartwatches and fitness trackers that can track human physical activities effortlessly. These devices have enabled common citizens to track their physical fitness and encourage them to lead a healthy lifestyle. Among various exercises, walking and running are the most common activities people do in everyday life, either through commute, exercise, or by doing household chores. While performing these activities, the speed at which a person walks and runs is an essential factor to determine the intensity of activity. Therefore, it is important to measure walking/running speed to …


How Machine Learning And Probability Concepts Can Improve Nba Player Evaluation, Harrison Miller Jan 2020

How Machine Learning And Probability Concepts Can Improve Nba Player Evaluation, Harrison Miller

CMC Senior Theses

In this paper I will be breaking down a scholarly article, written by Sameer K. Deshpande and Shane T. Jensen, that proposed a new method to evaluate NBA players. The NBA is the highest level professional basketball league in America and stands for the National Basketball Association. They proposed to build a model that would result in how NBA players impact their teams chances of winning a game, using machine learning and probability concepts. I preface that by diving into these concepts and their mathematical backgrounds. These concepts include building a linear model using ordinary least squares method, the bias …


Studying The Effects Of Various Process Parameters On Early Age Hydration Of Single- And Multi-Phase Cementitious Systems, Rachel Cook Jan 2020

Studying The Effects Of Various Process Parameters On Early Age Hydration Of Single- And Multi-Phase Cementitious Systems, Rachel Cook

Doctoral Dissertations

”The hydration of multi-phase ordinary Portland cement (OPC) and its pure phase derivatives, such as tricalcium silicate (C3S) and belite (ß-C2S), are studied in the context varying process parameters -- for instance, variable water content, water activity, superplasticizer structure and dose, and mineral additive type and particle size. These parameters are studied by means of physical experiments and numerical/computational techniques, such as: thermodynamic estimations; numerical kinetic-based modelling; and artificial intelligence techniques like machine learning (ML) models. In the past decade, numerical kinetic modeling has greatly improved in terms of fitting experimental, isothermal calorimetry to kinetic-based modelling …


Exploring Virtual Worlds With Cultural Algorithms: Ancient Alpena-Amberley Land Bridge, Thomas Joseph Palazzolo Jan 2020

Exploring Virtual Worlds With Cultural Algorithms: Ancient Alpena-Amberley Land Bridge, Thomas Joseph Palazzolo

Wayne State University Dissertations

In this thesis the Land Bridge system (DEEPDIVE) is described. The goal of the project is to use Artificial Intelligence technology to aid Archaeologists in the discovery of ancient prehistoric sites, now underwater. The example used here is the Alpena-Amberley Land Bridge that stretched across Lake Huron from Alpena in Michigan to Amberley in Ontario. During the Ice Age (around 10,000 years ago) it was above water for several thousand years. It was postulated that during that time it was used as a migration pathway for caribou, a major food source then. AI techniques were used to create a virtual …


Multimodal Fusion Strategies For Outcome Prediction In Stroke, Esra Zihni, John D. Kelleher, Vince I. Madai, Ahmed Khalil, Ivana Galinovic, Jochen Fiebach, Michelle Livne, Dietmar Frey Jan 2020

Multimodal Fusion Strategies For Outcome Prediction In Stroke, Esra Zihni, John D. Kelleher, Vince I. Madai, Ahmed Khalil, Ivana Galinovic, Jochen Fiebach, Michelle Livne, Dietmar Frey

Conference papers

Data driven methods are increasingly being adopted in the medical domain for clinical predictive modeling. Prediction of stroke outcome using machine learning could provide a decision support system for physicians to assist them in patient-oriented diagnosis and treatment. While patient-specific clinical parameters play an important role in outcome prediction, a multimodal fusion approach that integrates neuroimaging with clinical data has the potential to improve accuracy. This paper addresses two research questions: (a) does multimodal fusion aid in the prediction of stroke outcome, and (b) what fusion strategy is more suitable for the task at hand. The baselines for our experimental …


The Application Of Machine Learning Models In The Concussion Diagnosis Process, Sujit Subhash Jan 2020

The Application Of Machine Learning Models In The Concussion Diagnosis Process, Sujit Subhash

Masters Theses

“Concussions represent a growing health concern and are challenging to diagnose and manage. Roughly four million concussions are diagnosed every year in the United States. Although research into the application of advanced metrics such as neuroimages and blood biomarkers has shown promise, they are yet to be implemented at a clinical level due to cost and reliability concerns. Therefore, concussion diagnosis is still reliant on clinical evaluations of symptoms, balance, and neurocognitive status and function. The lack of a universal threshold on these assessments makes the diagnosis process entirely reliant on a physician’s interpretation of these assessment scores. This study …


Image Features For Tuberculosis Classification In Digital Chest Radiographs, Brian Hooper Jan 2020

Image Features For Tuberculosis Classification In Digital Chest Radiographs, Brian Hooper

All Master's Theses

Tuberculosis (TB) is a respiratory disease which affects millions of people each year, accounting for the tenth leading cause of death worldwide, and is especially prevalent in underdeveloped regions where access to adequate medical care may be limited. Analysis of digital chest radiographs (CXRs) is a common and inexpensive method for the diagnosis of TB; however, a trained radiologist is required to interpret the results, and is subject to human error. Computer-Aided Detection (CAD) systems are a promising machine-learning based solution to automate the diagnosis of TB from CXR images. As the dimensionality of a high-resolution CXR image is very …


Deriving Statistical Inference From The Application Of Artificial Neural Networks To Clinical Metabolomics Data, Kevin M. Mendez Jan 2020

Deriving Statistical Inference From The Application Of Artificial Neural Networks To Clinical Metabolomics Data, Kevin M. Mendez

Theses: Doctorates and Masters

Metabolomics data are complex with a high degree of multicollinearity. As such, multivariate linear projection methods, such as partial least squares discriminant analysis (PLS-DA) have become standard. Non-linear projections methods, typified by Artificial Neural Networks (ANNs) may be more appropriate to model potential nonlinear latent covariance; however, they are not widely used due to difficulty in deriving statistical inference, and thus biological interpretation. Herein, we illustrate the utility of ANNs for clinical metabolomics using publicly available data sets and develop an open framework for deriving and visualising statistical inference from ANNs equivalent to standard PLS-DA methods.


Process Based Analysis Of Fluvial Stratigraphic Record: Middle Pennsylvanian Allegheny Formation, North-Central Wv, Oluwasegun O. Abatan Jan 2020

Process Based Analysis Of Fluvial Stratigraphic Record: Middle Pennsylvanian Allegheny Formation, North-Central Wv, Oluwasegun O. Abatan

Graduate Theses, Dissertations, and Problem Reports

Fluvial deposits represent some of the best hydrocarbon reservoirs, but the quality of fluvial reservoirs varies depending on the reservoir architecture, which is controlled by allogenic and autogenic processes. Allogenic controls, including paleoclimate, tectonics, and glacio-eustasy, have long been debated as dominant controls in the deposition of fluvial strata. However, recent research has questioned the validity of this cyclicity and may indicate major influence from autogenic controls. To further investigate allogenic controls on stratal order, I analyzed the facies architecture, geomorphology, paleohydrology, and the stratigraphic framework of the Middle Pennsylvanian Allegheny Formation (MPAF), a fluvial depositional system in the Appalachian …


Estimating Refactoring Efforts For Architecture Technical Debt, Samir Deeb Jan 2020

Estimating Refactoring Efforts For Architecture Technical Debt, Samir Deeb

Graduate Theses, Dissertations, and Problem Reports

Paying-off the Architectural Technical Debt by refactoring the flawed code is important to control the debt and to keep it as low as possible. Project Managers tend to delay paying off this debt because they face difficulties in comparing the cost of the refactoring against the benefits they gain. For these managers to decide whether to refactor or to postpone, they need to estimate the cost and the efforts required to conduct these refactoring activities as well as to decide which flaws have higher priority to be refactored among others.

Our research is based on a dataset used by other …


Three Essays On Health Economics And Policy Evaluation, Shishir Shakya Jan 2020

Three Essays On Health Economics And Policy Evaluation, Shishir Shakya

Graduate Theses, Dissertations, and Problem Reports

This dissertation consists of three essays on the U.S. Health care policy. Each paragraph below refers to the three abstracts for the three chapters in this dissertation, respectively. I provide quantitative evidence on how much Prescription Drug Monitoring Programs (PDMPs) affects the retail opioid prescribing behaviors. Using the American Community Survey (ACS), I retrieve county-level high dimensional panel data set from 2010 to 2017. I employ three separate identification strategies: difference-in-difference, double selection post-LASSO, and spatial difference-in-difference. I compare how the retail opioid prescribing behaviors of counties, that are mandatory for prescribers to check the PDMP before prescribing controlled substances …


Representation Learning With Adversarial Latent Autoencoders, Stanislav Pidhorskyi M.S. Jan 2020

Representation Learning With Adversarial Latent Autoencoders, Stanislav Pidhorskyi M.S.

Graduate Theses, Dissertations, and Problem Reports

A large number of deep learning methods applied to computer vision problems require encoder-decoder maps. These methods include, but are not limited to, self-representation learning, generalization, few-shot learning, and novelty detection. Encoder-decoder maps are also useful for photo manipulation, photo editing, superresolution, etc. Encoder-decoder maps are typically learned using autoencoder networks.
Traditionally, autoencoder reciprocity is achieved in the image-space using pixel-wise
similarity loss, which has a widely known flaw of producing non-realistic reconstructions. This flaw is typical for the Variational Autoencoder (VAE) family and is not only limited to pixel-wise similarity losses, but is common to all methods relying upon …


Searches For Fast Radio Bursts Using Machine Learning, Devansh Agarwal Jan 2020

Searches For Fast Radio Bursts Using Machine Learning, Devansh Agarwal

Graduate Theses, Dissertations, and Problem Reports

Fast Radio bursts (FRBs) are enigmatic astrophysical events with millisecond durations and flux densities in the range 0.1-100 Jy, with the prototype source discovered by Lorimer et al. (2007). Like pulsars, FRBs show the characteristic inverse square sweep in observing frequency due to propagation through an ionized medium. This effect is quantified by the dispersion measure (DM). Unlike pulsars, FRBs have anomalously high DMs, which are consistent with an extragalactic origin. Over 100 FRBs have been published at the time of writing, and 13 have been conclusively identified with host galaxies with spectroscopically determined redshifts in the range 0.003 ≤ …


A Machine Learning Approach To Estimate The Annihilation Photon Interactions Inside The Scintillator Of A Pet Scanner, Sai Akhil Bharthavarapu Jan 2020

A Machine Learning Approach To Estimate The Annihilation Photon Interactions Inside The Scintillator Of A Pet Scanner, Sai Akhil Bharthavarapu

Graduate Theses, Dissertations, and Problem Reports

Biochemical processes are chemical processes that occur in living organisms. They can be studied with nuclear medicine through the help of radioactive tracers. Based on the radioisotope used, the photons that are emitted from the body tissue are either detected by single-photon emission computed tomography (SPECT) or by positron emission tomography (PET) scanners. SPECT uses gamma rays as tracer but gives a weaker contrast and spatial resolution compared to a PET scanner which uses positrons as tracer. PET scans show the metabolic changes occurring at the cellular level in an organ or a tissue. This detection is important because diseases …


Deep Neural Architectures For End-To-End Relation Extraction, Tung Tran Jan 2020

Deep Neural Architectures For End-To-End Relation Extraction, Tung Tran

Theses and Dissertations--Computer Science

The rapid pace of scientific and technological advancements has led to a meteoric growth in knowledge, as evidenced by a sharp increase in the number of scholarly publications in recent years. PubMed, for example, archives more than 30 million biomedical articles across various domains and covers a wide range of topics including medicine, pharmacy, biology, and healthcare. Social media and digital journalism have similarly experienced their own accelerated growth in the age of big data. Hence, there is a compelling need for ways to organize and distill the vast, fragmented body of information (often unstructured in the form of natural …


Ordinal Hyperplane Loss, Bob Vanderheyden Dec 2019

Ordinal Hyperplane Loss, Bob Vanderheyden

Doctor of Data Science and Analytics Dissertations

This research presents the development of a new framework for analyzing ordered class data, commonly called “ordinal class” data. The focus of the work is the development of classifiers (predictive models) that predict classes from available data. Ratings scales, medical classification scales, socio-economic scales, meaningful groupings of continuous data, facial emotional intensity and facial age estimation are examples of ordinal data for which data scientists may be asked to develop predictive classifiers. It is possible to treat ordinal classification like any other classification problem that has more than two classes. Specifying a model with this strategy does not fully utilize …