Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Machine Learning

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 571 - 600 of 826

Full-Text Articles in Physical Sciences and Mathematics

Machine Learning Pipeline For Exoplanet Classification, George Clayton Sturrock, Brychan Manry, Sohail Rafiqi May 2019

Machine Learning Pipeline For Exoplanet Classification, George Clayton Sturrock, Brychan Manry, Sohail Rafiqi

SMU Data Science Review

Planet identification has typically been a tasked performed exclusively by teams of astronomers and astrophysicists using methods and tools accessible only to those with years of academic education and training. NASA’s Exoplanet Exploration program has introduced modern satellites capable of capturing a vast array of data regarding celestial objects of interest to assist with researching these objects. The availability of satellite data has opened up the task of planet identification to individuals capable of writing and interpreting machine learning models. In this study, several classification models and datasets are utilized to assign a probability of an observation being an exoplanet. …


Model-Independent Estimation Of Optimal Hedging Strategies With Deep Neural Networks, Tobias Michael Furtwaengler May 2019

Model-Independent Estimation Of Optimal Hedging Strategies With Deep Neural Networks, Tobias Michael Furtwaengler

Theses and Dissertations

Inspired by the recent paper Buehler et al. (2018), this thesis aims to investigate the optimal hedging and pricing of financial derivatives with neural networks. We utilize the concept of convex risk measures to define optimal hedging strategies without strong assumptions on the underlying market dynamics. Furthermore, the setting allows the incorporation of market frictions and thus the determination of optimal hedging strategies and prices even in incomplete markets. We then use the approximation capabilities of neural networks to find close-to optimal estimates for these strategies.

We will elaborate on the theoretical foundations of this approach and carry out implementations …


Model-Independent Estimation Of Optimal Hedging Strategies With Deep Neural Networks, Tobias Michael Furtwaengler May 2019

Model-Independent Estimation Of Optimal Hedging Strategies With Deep Neural Networks, Tobias Michael Furtwaengler

Theses and Dissertations

Inspired by the recent paper Buehler et al. (2018), this thesis aims to investigate the optimal hedging and pricing of financial derivatives with neural networks. We utilize the concept of convex risk measures to define optimal hedging strategies without strong assumptions on the underlying market dynamics. Furthermore, the setting allows the incorporation of market frictions and thus the determination of optimal hedging strategies and prices even in incomplete markets. We then use the approximation capabilities of neural networks to find close-to optimal estimates for these strategies.

We will elaborate on the theoretical foundations of this approach and carry out implementations …


Teaching Computers To Teach Themselves: Synthesizing Training Data Based On Human-Perceived Elements, James Little May 2019

Teaching Computers To Teach Themselves: Synthesizing Training Data Based On Human-Perceived Elements, James Little

Honors Projects

Isolation-Based Scene Generation (IBSG) is a process for creating synthetic datasets made to train machine learning detectors and classifiers. In this project, we formalize the IBSG process and describe the scenarios—object detection and object classification given audio or image input—in which it can be useful. We then look at the Stanford Street View House Number (SVHN) dataset and build several different IBSG training datasets based on existing SVHN data. We try to improve the compositing algorithm used to build the IBSG dataset so that models trained with synthetic data perform as well as models trained with the original SVHN training …


Supervised Machine Learning Models For Fake News Detection, Gofaas Group, Andrea Lopez, Adelo Vieira, Zafar Ahsan, Farooq Saqib, Shirley Marinho May 2019

Supervised Machine Learning Models For Fake News Detection, Gofaas Group, Andrea Lopez, Adelo Vieira, Zafar Ahsan, Farooq Saqib, Shirley Marinho

ICT

Fake news or the distribution of disinformation has become one of the most challenging issues in society. News and information are churned out across online websites and platforms in real-time, with little or no way for the viewing public to determine what is real or manufactured. But an awareness of what we are consuming online is becoming apparent and efforts are underway to explore how we separate fake content from genuine and truthful information.

The most challenging part of fake news is determining how to spot it. In technology, there are ways to help us do this. Supervised machine learning …


Commonsense Knowledge In Sentiment Analysis Of Ordinance Reactions For Smart Governance, Manish Puri May 2019

Commonsense Knowledge In Sentiment Analysis Of Ordinance Reactions For Smart Governance, Manish Puri

Theses, Dissertations and Culminating Projects

Smart Governance is an emerging research area which has attracted scientific as well as policy interests, and aims to improve collaboration between government and citizens, as well as other stakeholders. Our project aims to enable lawmakers to incorporate data driven decision making in enacting ordinances. Our first objective is to create a mechanism for mapping ordinances (local laws) and tweets to Smart City Characteristics (SCC). The use of SCC has allowed us to create a mapping between a huge number of ordinances and tweets, and the use of Commonsense Knowledge (CSK) has allowed us to utilize human judgment in mapping. …


Deep Embedding Kernel, Linh Le Apr 2019

Deep Embedding Kernel, Linh Le

Doctor of Data Science and Analytics Dissertations

Kernel methods and deep learning are two major branches of machine learning that have achieved numerous successes in both analytics and artificial intelligence. While having their own unique characteristics, both branches work through mapping data to a feature space that is supposedly more favorable towards the given task. This dissertation addresses the strengths and weaknesses of each mapping method through combining them and forming a family of novel deep architectures that center around the Deep Embedding Kernel (DEK). In short, DEK is a realization of a kernel function through a newly deep architecture. The mapping in DEK is both implicit …


Predicting Win Rates In Competitive Overwatchtm, Andrea Sibley Apr 2019

Predicting Win Rates In Competitive Overwatchtm, Andrea Sibley

Mathematics Senior Capstone Papers

OverwatchTM is a video game published by Blizzard Entertainment R where two teams comprised of six people each compete against one another to accomplish a specific goal. The goal of each game is dependent on which map is being played. The maps are divided into four categories: Assault, Escort, Control, and Hybrid. A data set comprised of 3000 games of competitive OverwatchTM is used to determine how likely a team is to win their match. The factors used to determine the likelihood of winning are the map type and the skill ranking for each team. The data set is pre-processed …


Detection And Prevention Of Abuse In Online Social Networks, Sajedul Karim Talukder Mar 2019

Detection And Prevention Of Abuse In Online Social Networks, Sajedul Karim Talukder

FIU Electronic Theses and Dissertations

Adversaries leverage social networks to collect sensitive data about regular users and target them with abuse that includes fake news, cyberbullying, malware distribution, and propaganda. Such behavior is more effective when performed by the social network friends of victims. In two preliminary user studies we found that 71 out of 80 participants have at least 1 Facebook friend with whom (1) they never interact, either in Facebook or in real life, or whom they believe is (2) likely to abuse their posted photos or status updates, or (3) post offensive, false or malicious content. Such friend abuse is often considered …


Dissertation_Davis.Pdf, Brian Davis Mar 2019

Dissertation_Davis.Pdf, Brian Davis

brian davis

Simplices are the ``simplest" examples of polytopes, and yet they exhibit much of the rich and subtle combinatorics and commutative algebra of their more general cousins. In this way they are sufficiently complicated --- insights gained from their study can inform broader research in Ehrhart theory and associated fields.

In this dissertation we consider two previously unstudied properties of lattice simplices; one algebraic and one combinatorial. The first is the Poincare series of the associated semigroup algebra, which is substantially more complicated than the Hilbert series of that same algebra. The second is the partial ordering of the elements of …


Neural Machine Translation, Quinn M. Lanners, Thomas Laurent Mar 2019

Neural Machine Translation, Quinn M. Lanners, Thomas Laurent

Honors Thesis

Neural Machine Translation is the primary algorithm used in industry to perform machine translation. This state-of-the-art algorithm is an application of deep learning in which massive datasets of translated sentences are used to train a model capable of translating between any two languages. The architecture behind neural machine translation is composed of two recurrent neural networks used together in tandem to create an Encoder Decoder structure. Attention mechanisms have recently been developed to further increase the accuracy of these models. In this senior thesis, the various parts of Neural Machine Translation are explored towards the eventual creation of a tutorial …


A Study Of Face Embedding In Face Recognition, Khanh Duc Le Mar 2019

A Study Of Face Embedding In Face Recognition, Khanh Duc Le

Master's Theses

Face Recognition has been a long-standing topic in computer vision and pattern recognition field because of its wide and important applications in our daily lives such as surveillance system, access control, and so on. The current modern face recognition model, which keeps only a couple of images per person in the database, can now recognize a face with high accuracy. Moreover, the model does not need to be retrained every time a new person is added to the database.

By using the face dataset from Digital Democracy, the thesis will explore the capability of this model by comparing it with …


Cs04all: Machine Learning Module, Hunter R. Johnson Feb 2019

Cs04all: Machine Learning Module, Hunter R. Johnson

Open Educational Resources

These are materials that may be used in a CS0 course as a light introduction to machine learning.

The materials are mostly Jupyter notebooks which contain a combination of labwork and lecture notes. There are notebooks on Classification, An Introduction to Numpy, and An Introduction to Pandas.

There are also two assessments that could be assigned to students. One is an essay assignment in which students are asked to read and respond to an article on machine bias. The other is a lab-like exercise in which students use pandas and numpy to extract useful information about subway ridership in NYC. …


Dish: Democracy In State Houses, Nicholas A. Russo Feb 2019

Dish: Democracy In State Houses, Nicholas A. Russo

Master's Theses

In our current political climate, state level legislators have become increasingly impor- tant. Due to cuts in funding and growing focus at the national level, public oversight for these legislators has drastically decreased. This makes it difficult for citizens and activists to understand the relationships and commonalities between legislators. This thesis provides three contributions to address this issue. First, we created a data set containing over 1200 features focused on a legislator’s activity on bills. Second, we created embeddings that represented a legislator’s level of activity and engagement for a given bill using a custom model called Democracy2Vec. Third, we …


An Evaluation Of Training Size Impact On Validation Accuracy For Optimized Convolutional Neural Networks, Jostein Barry-Straume, Adam Tschannen, Daniel W. Engels, Edward Fine Jan 2019

An Evaluation Of Training Size Impact On Validation Accuracy For Optimized Convolutional Neural Networks, Jostein Barry-Straume, Adam Tschannen, Daniel W. Engels, Edward Fine

SMU Data Science Review

In this paper, we present an evaluation of training size impact on validation accuracy for an optimized Convolutional Neural Network (CNN). CNNs are currently the state-of-the-art architecture for object classification tasks. We used Amazon’s machine learning ecosystem to train and test 648 models to find the optimal hyperparameters with which to apply a CNN towards the Fashion-MNIST (Mixed National Institute of Standards and Technology) dataset. We were able to realize a validation accuracy of 90% by using only 40% of the original data. We found that hidden layers appear to have had zero impact on validation accuracy, whereas the neural …


Comparisons Of Performance Between Quantum And Classical Machine Learning, Christopher Havenstein, Damarcus Thomas, Swami Chandrasekaran Jan 2019

Comparisons Of Performance Between Quantum And Classical Machine Learning, Christopher Havenstein, Damarcus Thomas, Swami Chandrasekaran

SMU Data Science Review

In this paper, we present a performance comparison of machine learning algorithms executed on traditional and quantum computers. Quantum computing has potential of achieving incredible results for certain types of problems, and we explore if it can be applied to machine learning. First, we identified quantum machine learning algorithms with reproducible code and had classical machine learning counterparts. Then, we found relevant data sets with which we tested the comparable quantum and classical machine learning algorithm's performance. We evaluated performance with algorithm execution time and accuracy. We found that quantum variational support vector machines in some cases had higher accuracy …


Comparative Study Of Sentiment Analysis With Product Reviews Using Machine Learning And Lexicon-Based Approaches, Heidi Nguyen, Aravind Veluchamy, Mamadou Diop, Rashed Iqbal Jan 2019

Comparative Study Of Sentiment Analysis With Product Reviews Using Machine Learning And Lexicon-Based Approaches, Heidi Nguyen, Aravind Veluchamy, Mamadou Diop, Rashed Iqbal

SMU Data Science Review

In this paper, we present a comparative study of text sentiment classification models using term frequency inverse document frequency vectorization in both supervised machine learning and lexicon-based techniques. There have been multiple promising machine learning and lexicon-based techniques, but the relative goodness of each approach on specific types of problems is not well understood. In order to offer researchers comprehensive insights, we compare a total of six algorithms to each other. The three machine learning algorithms are: Logistic Regression (LR), Support Vector Machine (SVM), and Gradient Boosting. The three lexicon-based algorithms are: Valence Aware Dictionary and Sentiment Reasoner (VADER), Pattern, …


Improving Vix Futures Forecasts Using Machine Learning Methods, James Hosker, Slobodan Djurdjevic, Hieu Nguyen, Robert Slater Jan 2019

Improving Vix Futures Forecasts Using Machine Learning Methods, James Hosker, Slobodan Djurdjevic, Hieu Nguyen, Robert Slater

SMU Data Science Review

The problem of forecasting market volatility is a difficult task for most fund managers. Volatility forecasts are used for risk management, alpha (risk) trading, and the reduction of trading friction. Improving the forecasts of future market volatility assists fund managers in adding or reducing risk in their portfolios as well as in increasing hedges to protect their portfolios in anticipation of a market sell-off event. Our analysis compares three existing financial models that forecast future market volatility using the Chicago Board Options Exchange Volatility Index (VIX) to six machine/deep learning supervised regression methods. This analysis determines which models provide best …


Data Driven Approach To Characterize And Forecast The Impact Of Freeway Work Zones On Mobility Using Probe Vehicle Data, Mohsen Kamyab Jan 2019

Data Driven Approach To Characterize And Forecast The Impact Of Freeway Work Zones On Mobility Using Probe Vehicle Data, Mohsen Kamyab

Wayne State University Dissertations

The presence of work zones on freeways causes traffic congestion and creates hazardous conditions for commuters and construction workers. Traffic congestion resulting from work zones causes negative impacts on traffic mobility (delay), the environment (vehicle emissions), and safety when stopped or slowed vehicles become vulnerable to rear-end collisions. Addressing these concerns, a data-driven approach was utilized to develop methodologies to measure, predict, and characterize the impact work zones have on Michigan interstates. This study used probe vehicle data, collected from GPS devices in vehicles, as the primary source for mobility data. This data was used to fulfill three objectives: develop …


A Machine Learning Recommender Model For Ride Sharing Based On Rider Characteristics And User Threshold Time, Govind Pramod Yatnalkar Jan 2019

A Machine Learning Recommender Model For Ride Sharing Based On Rider Characteristics And User Threshold Time, Govind Pramod Yatnalkar

Theses, Dissertations and Capstones

In the present age, human life is prospering incredibly due to the 4th Industrial Revolution or The Age of Digitization and Computing. The ubiquitous availability of the Internet and advanced computing systems have resulted in the rapid development of smart cities. From connected devices to live vehicle tracking, technology is taking the field of transportation to a new level. An essential part of the transportation domain in smart cities is Ride Sharing. It is an excellent solution to issues like pollution, traffic, and the rapid consumption of fuel. Even though Ride Sharing has several benefits, the current usage is …


Study Of Higgs Production From H -> Zz -> 4l Channel Using Machine Learning Methods, Daniel Arthur Faia Jr. Jan 2019

Study Of Higgs Production From H -> Zz -> 4l Channel Using Machine Learning Methods, Daniel Arthur Faia Jr.

Graduate Research Theses & Dissertations

In this thesis I will show how machine learning methods can improve on physics analysis in the H -> ZZ -> 4l channel. In particular we will explore how these methods can be used to classify Vector Boson Fusion (VBF) processes in the presence of more dominant Higgs production processes. The aim is to improve the ability to discriminate VBF Higgs boson production relative to other Higgs boson production modes. Since VBF has two quark jets in the final state, it is useful to discriminate between quark and gluon jets. We compare the effectiveness of quark gluon discrimination with machine …


Assessing The Performance And Merit Of The Random Survival Forest And Cox Models On A Pancreatic Cancer Data Set, Carl Edward Mueller Jan 2019

Assessing The Performance And Merit Of The Random Survival Forest And Cox Models On A Pancreatic Cancer Data Set, Carl Edward Mueller

Graduate Research Theses & Dissertations

Random Survival Forest (RSF) is one of the most powerful and easily applied machine learning models for survival data. RSF sacrifices some of the interpretability of the decision trees used to grow the forest in order to significantly reduce the bias and variance of the basic classification and regression tree (CART) paradigm. The lessened interpretability and higher computational intensity of RSF means that it may not always be the preferred method, even in settings where black-box methods are readily used. By contrast, the Cox Proportional Hazards (PH) model is incredibly flexible, resistant to overfitting, and transparently estimable. The tradeoff for …


A Novel Set Of Weight Initialization Techniques For Deep Learning Architectures, Diego Aguirre Jan 2019

A Novel Set Of Weight Initialization Techniques For Deep Learning Architectures, Diego Aguirre

Open Access Theses & Dissertations

The importance of weight initialization when building a deep learning model is often underappreciated. Even though it is usually seen as a minor detail in the model creation cycle, this process has shown to have a strong impact on the training time of a network and the quality of the resulting model. In fact, the implications of choosing a poor initialization scheme range from leading to the creation of a poorly performing model to preventing optimization techniques (like stochastic gradient descent) from converging.

In this work, we introduce and evaluate a set of novel weight initialization techniques for deep learning …


The Use Of Cultural Algorithms To Learn The Impact Of Climate On Local Fishing Behavior In Cerro Azul, Peru, Khalid Kattan Jan 2019

The Use Of Cultural Algorithms To Learn The Impact Of Climate On Local Fishing Behavior In Cerro Azul, Peru, Khalid Kattan

Wayne State University Dissertations

Recently it has been found that the earth’s oceans are warming at a pace that is 40% faster than predicted by a United Nations panel a few years ago. As a result, 2019 has become the warmest year on record for the earth’s oceans. That is because the oceans have acted as a buffer by absorbing 93% of the heat produced by the greenhouse gases [40].

The impact of the oceanic warming has already been felt in terms of the periodic warming of the Pacific Ocean as an effect of the ENSO process. The ENSO process is a cycle of …


Capso: A Multi-Objective Cultural Algorithm System To Predict Locations Of Ancient Sites, Samuel Dustin Stanley Jan 2019

Capso: A Multi-Objective Cultural Algorithm System To Predict Locations Of Ancient Sites, Samuel Dustin Stanley

Wayne State University Dissertations

ABSTRACT

CAPSO: A MULTI-OBJECTIVE CULTURAL ALGORITHM SYSTEM TO PREDICT LOCATIONS OF ANCIENT SITES

by

SAMUEL DUSTIN STANLEY

August 2019

Advisor: Dr. Robert Reynolds

Major: Computer Science

Degree: Doctor of Philosophy

The recent archaeological discovery by Dr. John O’Shea at University of Michigan of prehistoric caribou remains and Paleo-Indian structures underneath the Great Lakes has opened up an opportunity for Computer Scientists to develop dynamic systems modelling these ancient caribou routes and hunter-gatherer settlement systems as well as the prehistoric environments that they existed in. The Wayne State University Cultural Algorithm team has been interested assisting Dr. O’Shea’s archaeological team by …


Performance Comparison Of Hybrid Cnn-Svm And Cnn-Xgboost Models In Concrete Crack Detection, Sahana Thiyagarajan Jan 2019

Performance Comparison Of Hybrid Cnn-Svm And Cnn-Xgboost Models In Concrete Crack Detection, Sahana Thiyagarajan

Dissertations

Detection of cracks mainly has been a sort of essential step in visual inspection involved in construction engineering as it is the commonly used building material and cracks in them is an early sign of de-basement. It is hard to find cracks by a visual check for the massive structures. So, the development of crack detecting systems generally has been a critical issue. The utilization of contextual image processing in crack detection is constrained, as image data usually taken under real-world situations vary widely and also includes the complex modelling of cracks and the extraction of handcrafted features. Therefore the …


Abso2luteu-Net: Tissue Oxygenation Calculation Using Photoacoustic Imaging And Convolutional Neural Networks, Kevin Hoffer-Hawlik, Geoffrey P. Luke Jan 2019

Abso2luteu-Net: Tissue Oxygenation Calculation Using Photoacoustic Imaging And Convolutional Neural Networks, Kevin Hoffer-Hawlik, Geoffrey P. Luke

ENGS 88 Honors Thesis (AB Students)

Photoacoustic (PA) imaging uses incident light to generate ultrasound signals within tissues. Using PA imaging to accurately measure hemoglobin concentration and calculate oxygenation (sO2) requires prior tissue knowledge and costly computational methods. However, this thesis shows that machine learning algorithms can accurately and quickly estimate sO2. absO2luteU-Net, a convolutional neural network, was trained on Monte Carlo simulated multispectral PA data and predicted sO2 with higher accuracy compared to simple linear unmixing, suggesting machine learning can solve the fluence estimation problem. This project was funded by the Kaminsky Family Fund and the Neukom Institute.


Exploring Cyber-Physical Systems, Misbah Uddin Mohammed Jan 2019

Exploring Cyber-Physical Systems, Misbah Uddin Mohammed

Graduate Research Theses & Dissertations

The advances in IOT, Computer Vision, AI and Machine Learning have made these technologies ubiquitous to our daily lives. From Smart Phones to Connected Vehicles, Cyber Physical systems have been interspersed into everything we interact in today’s world. The aim or this thesis was to explore these advances in Cyber Physical Systems and analyze the different sectors they were affecting. We then hand-picked certain domains and explored further by carrying out practical projects using some of the latest software and hardware resources available. Technologies like Amazon Alexa services, NVIDIA Jetson boards, TensorFlow, OpenCV, NodeJS were heavily employed in our various …


Towards Improving Accuracy And Interpretability Of Deep Learning Based On Satellite Image Classification, Yamile Patino Vargas Jan 2019

Towards Improving Accuracy And Interpretability Of Deep Learning Based On Satellite Image Classification, Yamile Patino Vargas

Dissertations and Theses

ABSTRACT

The study of satellite images provides a way to monitor changes in the surface of the Earth and the atmosphere. Convolutional Neural Networks (CNN) have shown accurate results in solving practical problems in multiple fields. Some of the more recognized fields using CNNs are satellite imagery processing, medicine, communication, transportation, and computer vision. Despite the success of CNNs, there remains a need to explain the network predictions further and understand what the network is determining as valuable information.

There are several frameworks and methodologies developed to explain how CNNs predict outputs and what their internal representations are [1, 4, …


Explainable Neural Networks Based Anomaly Detection For Cyber-Physical Systems, Kasun Amarasinghe Jan 2019

Explainable Neural Networks Based Anomaly Detection For Cyber-Physical Systems, Kasun Amarasinghe

Theses and Dissertations

Cyber-Physical Systems (CPSs) are the core of modern critical infrastructure (e.g. power-grids) and securing them is of paramount importance. Anomaly detection in data is crucial for CPS security. While Artificial Neural Networks (ANNs) are strong candidates for the task, they are seldom deployed in safety-critical domains due to the perception that ANNs are black-boxes. Therefore, to leverage ANNs in CPSs, cracking open the black box through explanation is essential.

The main objective of this dissertation is developing explainable ANN-based Anomaly Detection Systems for Cyber-Physical Systems (CP-ADS). The main objective was broken down into three sub-objectives: 1) Identifying key-requirements that an …