Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Machine Learning

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 31 - 60 of 826

Full-Text Articles in Physical Sciences and Mathematics

Ghost Connect-Net: A Connectivity-Based Companion Network To Enhance Pruning Methods, Mary Isabelle Wisell May 2024

Ghost Connect-Net: A Connectivity-Based Companion Network To Enhance Pruning Methods, Mary Isabelle Wisell

Honors College

Deep neural network (DNN) approaches excel in various real-world applications like robotics and computer vision, yet their computational demands and memory requirements hinder usability on advanced devices. Also, larger models heighten overparameterization risks, making networks more vulnerable to input disturbances. Recent studies aim to boost DNN efficiency by trimming redundant neurons or filters based on task relevance. Instead of introducing a new pruning method, this project aims to enhance existing techniques by introducing a companion network, Ghost Connect-Net (GC-Net), to monitor the connections in the original network. The initial weights of GC- Net are equal to the connectivity measurements of …


Gender Detection In Facial Images: A Comprehensive Cnn Analysis, Jose N T Ambrosio, Anas Hourani, Magdalene Moy Apr 2024

Gender Detection In Facial Images: A Comprehensive Cnn Analysis, Jose N T Ambrosio, Anas Hourani, Magdalene Moy

SACAD: John Heinrichs Scholarly and Creative Activity Days

This research investigates the construction of a robust gender detection system using facial features and Convolutional Neural Networks (CNNs), exploring the impact of different layer configurations on accuracy and computational efficiency. With a validation accuracy of 91%, findings illuminate the nuanced relationship between precision and computational resources, enriching discussions on facial recognition technologies.


Artificial General Intelligence And The Mind-Body Problem: Exploring The Computability Of Simulated Human Intelligence In Light Of The Immaterial Mind, Caleb Parks Apr 2024

Artificial General Intelligence And The Mind-Body Problem: Exploring The Computability Of Simulated Human Intelligence In Light Of The Immaterial Mind, Caleb Parks

Senior Honors Theses

In this thesis I explore whether achieving artificial general intelligence (AGI) through simulating the human brain is theoretically possible. Because of the scientific community’s predominantly physicalist outlook on the mind-body problem, AGI research may be limited by erroneous foundational presuppositions. Arguments from linguistics and mathematics demonstrate that the human intellect is partially immaterial, opening the door for novel analysis of the mind’s simulability. I categorize mind-body problem philosophies in a manner relevant to computer science based upon state transitions, and determine their ramifications on mind-simulation. Finally, I demonstrate how classical architectures cannot resolve so-called Gödel statements, discuss why this inability …


Methods, Analyses, And Applications Of Multilayer Temporal Link Prediction In Networks, Xie He Apr 2024

Methods, Analyses, And Applications Of Multilayer Temporal Link Prediction In Networks, Xie He

Dartmouth College Ph.D Dissertations

Many applications stem from the possibility of accurately predicting links in various types of networks. In this thesis, we present methods, analyses, and applications for static, temporal, and multilayer networks. The first part of this thesis demonstrates how static network features serve as efficient and accurate predictors for link prediction in temporal networks. It includes an ensemble learning method we developed and presents experimental results on 90 synthetic stochastic block models and 19 real-world datasets. The second part closely follows, showcasing 20 different sampling methods and their effects on nine different link prediction algorithms for 250 real-world networks across 6 …


Exploring Tokenization Techniques To Optimize Patch-Based Time-Series Transformers, Gabriel L. Asher Apr 2024

Exploring Tokenization Techniques To Optimize Patch-Based Time-Series Transformers, Gabriel L. Asher

Computer Science Senior Theses

Transformer architectures have revolutionized deep learning, impacting natural language processing and computer vision. Recently, PatchTST has advanced long-term time-series forecasting by embedding patches of time-steps to use as tokens for transformers. This study examines and seeks to enhance PatchTST's embedding techniques. Using eight benchmark datasets, we explore explore novel token embedding techniques. To this end, we introduce several PatchTST variants, which alter the embedding methods of the original paper. These variants consist of the following architectural changes: using CNNs to embed inputs to tokens, embedding an aggregate measure like the mean, max, or sum of a patch, adding the exponential …


Assessing Gait Metrics For Early Parkinson's Disease Prediction: A Preliminary Analysis Of Underfit Models, Daniel Salinas, Gerardo Medellin, Katherine Bolado, Tomas Gomez, Kelsey Potter-Baker, Nawaz Khan Abdul Hack, Ramu Vadukapuram Mar 2024

Assessing Gait Metrics For Early Parkinson's Disease Prediction: A Preliminary Analysis Of Underfit Models, Daniel Salinas, Gerardo Medellin, Katherine Bolado, Tomas Gomez, Kelsey Potter-Baker, Nawaz Khan Abdul Hack, Ramu Vadukapuram

Research Symposium

Background: Parkinson's Disease (PD) is characterized by both motor and non-motor symptoms, and its diagnosis primarily relies on clinical presentation. There is a growing need for diagnostic tools to identify the early signs of PD, particularly the initial motor impairments often manifested as gait abnormalities. Here we seek to present preliminary findings to address this need. Our study focuses on using Machine Learning techniques (ML) to predict the PD clinical stage most efficiently and accurately. Specifically, we have sought to evaluate how spatiotemporal characteristics and other locomotor performance variables obtained on a walkway system can be utilized to identify the …


Xfuzz: Machine Learning Guided Cross-Contract Fuzzing, Yinxing Xue, Jiaming Ye, Wei Zhang, Jun Sun, Lei Ma, Haijun Wang, Jianjun Zhao Mar 2024

Xfuzz: Machine Learning Guided Cross-Contract Fuzzing, Yinxing Xue, Jiaming Ye, Wei Zhang, Jun Sun, Lei Ma, Haijun Wang, Jianjun Zhao

Research Collection School Of Computing and Information Systems

Smart contract transactions are increasingly interleaved by cross-contract calls. While many tools have been developed to identify a common set of vulnerabilities, the cross-contract vulnerability is overlooked by existing tools. Cross-contract vulnerabilities are exploitable bugs that manifest in the presence of more than two interacting contracts. Existing methods are however limited to analyze a maximum of two contracts at the same time. Detecting cross-contract vulnerabilities is highly non-trivial. With multiple interacting contracts, the search space is much larger than that of a single contract. To address this problem, we present xFuzz , a machine learning guided smart contract fuzzing framework. …


Using Natural Language Processing To Identify Mental Health Indicators In Aviation Voluntary Safety Reports, Michael Sawyer, Katherine Berry, Amelia Kinsella, R Jordan Hinson, Edward Bynum Feb 2024

Using Natural Language Processing To Identify Mental Health Indicators In Aviation Voluntary Safety Reports, Michael Sawyer, Katherine Berry, Amelia Kinsella, R Jordan Hinson, Edward Bynum

National Training Aircraft Symposium (NTAS)

Voluntary Safety Reporting Programs (VSRPs) are a critical tool in the aviation industry for monitoring safety issues observed by the frontline workforce. While VSRPs primarily focus on operational safety, report narratives often describe factors such as fatigue, workload, culture, staffing, and health, directly or indirectly impacting mental health. These reports can provide individual and organizational insights into aviation personnel's physical and psychological well-being. This poster introduces the AVIation Analytic Neural network for Safety events (AVIAN-S) model as a potential tool to extract and monitor these insights. AVIAN-S is a novel machine-learning model that leverages natural language processing (NLP) to analyze …


Mri Image Regression Cnn For Bone Marrow Lesion Volume Prediction, Kevin Yanagisawa Feb 2024

Mri Image Regression Cnn For Bone Marrow Lesion Volume Prediction, Kevin Yanagisawa

Theses and Dissertations

Bone marrow lesions (BMLs), occurs from fluid build up in the soft tissues inside your bone. This can be seen on magnetic resonance imaging (MRI) scans and is characterized by excess water signals in the bone marrow space. This disease is commonly caused by osteoarthritis (OA), a degenerative join disease where tissues within the joint breakdown over time [1]. These BMLs are an emerging target for OA, as they are commonly related to pain and worsening of the diseased area until surgical intervention is required [2]–[4]. In order to assess the BMLs, MRIs were utilized as input into a regression …


Machine Learning For Wireless Network Throughput Prediction, Gustavo A. Fernandez Jan 2024

Machine Learning For Wireless Network Throughput Prediction, Gustavo A. Fernandez

School of Mathematical and Statistical Sciences Faculty Publications and Presentations

This paper analyzes a dataset containing radio frequency (RF) measurements and Key Performance Indicators (KPIs) captured at 1876.6MHz with a bandwidth of 10MHz from an operational 4G LTE network in Nigeria. The dataset includes metrics such as RSRP (Reference Signal Received Power), which measures the power level of reference signals; RSRQ (Reference Signal Received Quality), an indicator of signal quality that provides insight into the number of users sharing the same resources; RSSI (Received Signal Strength Indicator), which gauges the total received power in a bandwidth; SINR (Signal to Interference plus Noise Ratio), a measure of signal quality considering both …


Mitigating Safety Issues In Pre-Trained Language Models: A Model-Centric Approach Leveraging Interpretation Methods, Weicheng Ma Jan 2024

Mitigating Safety Issues In Pre-Trained Language Models: A Model-Centric Approach Leveraging Interpretation Methods, Weicheng Ma

Dartmouth College Ph.D Dissertations

Pre-trained language models (PLMs), like GPT-4, which powers ChatGPT, face various safety issues, including biased responses and a lack of alignment with users' backgrounds and expectations. These problems threaten their sociability and public application. Present strategies for addressing these safety concerns primarily involve data-driven approaches, requiring extensive human effort in data annotation and substantial training resources. Research indicates that the nature of these safety issues evolves over time, necessitating continual updates to data and model re-training—an approach that is both resource-intensive and time-consuming. This thesis introduces a novel, model-centric strategy for understanding and mitigating the safety issues of PLMs by …


Word Prediction Using Dynamic Skip Connections Along With Arabert And Lstm In Arabic Language, Ahad Almalki, Faris Kateb, Rayan Mosli Jan 2024

Word Prediction Using Dynamic Skip Connections Along With Arabert And Lstm In Arabic Language, Ahad Almalki, Faris Kateb, Rayan Mosli

ASEAN Journal on Science and Technology for Development

Natural Language Generation (NLG) plays a crucial role in modern digital tools, including chatbots, virtual support, content suggestions, and tailored marketing, making bots more responsive and reducing the need for human staff. While there's much research on NLG for languages like English, languages like Arabic, Urdu, and Chinese still face challenges. This study examines Arabic NLG's unique aspects, dialects, and word variations. With around 420 million Arabic speakers globally, it's crucial to advance NLG for this language. We compared three models: Long Short-Term Memory (LSTM), a mix of Bidirectional Encoder Representations from Transformers (BERT) and LSTM, and a version that …


Equipment Performance Cost Optimization Using Machine Learning (A Surface Condenser Case Study), Firdaus Basheer, Mohamed Saleem Haja Nazmudeen, Fadzliwati Mohiddin, Elango Natrajan Jan 2024

Equipment Performance Cost Optimization Using Machine Learning (A Surface Condenser Case Study), Firdaus Basheer, Mohamed Saleem Haja Nazmudeen, Fadzliwati Mohiddin, Elango Natrajan

ASEAN Journal on Science and Technology for Development

Equipment performance assessment or prediction has usually been done using the conventional approach. Organization is often too busy to focus on improvement opportunities for equipment performance. Opportunities identifications are heavily reliant on expert opinion and the methods used often vary from one person to another depending on the knowledge they possess. The benefits of simplistic and realistic equipment performance prediction would significantly improve maintenance costs and hence could help to reduce the total operating cost of the asset. In this research work, a surface condenser was used as a case study. The solution proposed in this research work is to …


Using Pose Estimation Software To Predict Actions In Sabre Fencing, Micah Edwin Peters Ii Jan 2024

Using Pose Estimation Software To Predict Actions In Sabre Fencing, Micah Edwin Peters Ii

Honors College Theses

Fencing is a combat sport that uses three different swords: epee, foil, and sabre. Due to its fast-paced nature and employment of right of way, sabre fencing is often considered the most difficult of the three to learn. Computer vision and pose estimation software can be used to lower the barrier of entry to sabre fencing by identifying the different actions in sabre fencing. This project focuses on using open-source software to design a program that can identify the sabre parries as well as the main sabre movements. This program could be used to help newer fencers and spectators better …


Applications Of Genetic Algorithms To Chess, Elliot M. Harris Jan 2024

Applications Of Genetic Algorithms To Chess, Elliot M. Harris

Senior Projects Spring 2024

This thesis discusses the use of genetic algorithms to tune the parameters of a chess engine, resulting in a significant increase in playing strength. The design of the genetic algorithms builds on the 2008-2011 work of David-Tabibi et al. and Vázquez-Fernández et al. The overwhelmingly positive result presented in this thesis not only suggests a promising potential for genetic algorithm use to improve computer chess, but also supports the efficacy and potential of applying genetic algorithms to a broader set of use cases.


Machine Learning And Natural Language Processing For Crossword Puzzles, Finn Brennan Jan 2024

Machine Learning And Natural Language Processing For Crossword Puzzles, Finn Brennan

Senior Projects Spring 2024

Senior Project submitted to The Division of Science, Mathematics and Computing of Bard College.


Effective Drag Coefficient Prediction On Single-View 2d Images Of Snowflakes, Cameron Hudson Jan 2024

Effective Drag Coefficient Prediction On Single-View 2d Images Of Snowflakes, Cameron Hudson

Graduate College Dissertations and Theses

The drag coefficient of snowflakes is an crucial particle descriptor that can quantify the relationships with the mass, shape, size, and fall speed of snowflake particles. Previous studies has relied on estimating and improving empirical correlations for the drag coefficient of particles, utilizing 3D images from the Multi-Angled Snowflake Camera Database (MASCDB) to estimate snowflake properties such as mass, geometry, shape classification, and rimming degree. However, predictions of the drag coefficient with single-view 2D images of snowflakes has proven to be a challenging problem, primarily due to the lack of data and time-consuming, expensive methods used to estimate snowflake shape …


Sparse Representation Learning For Temporal Networks, Maxwell Mcneil Jan 2024

Sparse Representation Learning For Temporal Networks, Maxwell Mcneil

Electronic Theses & Dissertations (2024 - present)

Temporal networks arise in many domains including activity of social network users, sensor network readings over time, and time course gene expression within the interaction network of a model organism. Data of this type contains a wealth of prior information such as the connectivity among nodes (e.g., a friendship graph), and prior knowledge of expected temporal patterns (e.g., periodicity). Modeling these temporal and network patterns jointly is essential for state-of-the-art performance in temporal network data analysis and mining. Sparse dictionary encoding is one modeling approach for such underlying patterns. However, most classical approaches consider only one dimension of the data …


Towards Algorithmic Justice: Human Centered Approaches To Artificial Intelligence Design To Support Fairness And Mitigate Bias In The Financial Services Sector, Jihyun Kim Jan 2024

Towards Algorithmic Justice: Human Centered Approaches To Artificial Intelligence Design To Support Fairness And Mitigate Bias In The Financial Services Sector, Jihyun Kim

CMC Senior Theses

Artificial Intelligence (AI) has positively transformed the Financial services sector but also introduced AI biases against protected groups, amplifying existing prejudices against marginalized communities. The financial decisions made by biased algorithms could cause life-changing ramifications in applications such as lending and credit scoring. Human Centered AI (HCAI) is an emerging concept where AI systems seek to augment, not replace human abilities while preserving human control to ensure transparency, equity and privacy. The evolving field of HCAI shares a common ground with and can be enhanced by the Human Centered Design principles in that they both put humans, the user, at …


Secure And Privacy-Preserving Federated Learning With Rapid Convergence In Leo Satellite Networks, Mohamed Elmahallawy Jan 2024

Secure And Privacy-Preserving Federated Learning With Rapid Convergence In Leo Satellite Networks, Mohamed Elmahallawy

Doctoral Dissertations

"The advancement of satellite technology has enabled the launch of small satellites equipped with high-resolution cameras into low Earth orbit (LEO), enabling the collection of extensive Earth data for training AI models. However, the conventional approach of downloading satellite-related data to a ground station (GS) for training a centralized machine learning (ML) model faces significant challenges. Firstly, the transmission of raw data raises security and privacy concerns, especially in military applications. Secondly, the download bandwidth is limited, which puts a stringent limit on image transmissions to the GS. Lastly, LEO satellites have sporadic visibility with the GS, and orbit the …


Ensemble Classification: An Analysis Of The Random Forest Model, Jarod Korn Jan 2024

Ensemble Classification: An Analysis Of The Random Forest Model, Jarod Korn

Williams Honors College, Honors Research Projects

The random forest model proposed by Dr. Leo Breiman in 2001 is an ensemble machine learning method for classification prediction and regression. In the following paper, we will conduct an analysis on the random forest model with a focus on how the model works, how it is applied in software, and how it performs on a set of data. To fully understand the model, we will introduce the concept of decision trees, give a summary of the CART model, explain in detail how the random forest model operates, discuss how the model is implemented in software, demonstrate the model by …


Flexible Attenuation Fields: Tomographic Reconstruction From Heterogeneous Datasets, Clifford S. Parker Jan 2024

Flexible Attenuation Fields: Tomographic Reconstruction From Heterogeneous Datasets, Clifford S. Parker

Theses and Dissertations--Computer Science

Traditional reconstruction methods for X-ray computed tomography (CT) are highly constrained in the variety of input datasets they admit. Many of the imaging settings -- the incident energy, field-of-view, effective resolution -- remain fixed across projection images, and the only real variance is in the detector's position and orientation with respect to the scene. In contrast, methods for 3D reconstruction of natural scenes are extremely flexible to the geometric and photometric properties of the input datasets, readily accepting and benefiting from images captured under varying lighting conditions, with different cameras, and at disparate points in time and space. Extending CT …


Advanced Mathematical Graph-Based Machine Learning And Deep Learning Models For Drug Design, Farjana Tasnim Mukta Jan 2024

Advanced Mathematical Graph-Based Machine Learning And Deep Learning Models For Drug Design, Farjana Tasnim Mukta

Theses and Dissertations--Mathematics

Drug discovery is a highly complicated and time-consuming process. One of the main challenges in drug development is predicting whether a drug-like molecule will interact with a specific target protein. This prediction accelerates target validation and drug development. Recent research in biomolecular sciences has shown significant interest in algebraic graph-based models for representing molecular complexes and predicting drug-target binding affinity. In this thesis, we present algebraic graph-based molecular representations to create data-driven scoring functions (SF) using extended atom types to capture wide-range interactions between targets and drug candidates. Our model employs multiscale weighted colored subgraphs for the protein-ligand complex, colored …


Wave Energy Converter Wave Force Prediction Using A Neural Network, Morgan Kline Jan 2024

Wave Energy Converter Wave Force Prediction Using A Neural Network, Morgan Kline

Dissertations, Master's Theses and Master's Reports

Due to the unpredictable nature of large bodies of water, wave energy can be a difficult renewable resource to rely on. One way to make Wave Energy Converters (WECs) more efficient is to apply a control strategy. In many control solutions, it is assumed that the wave excitation force is known into the future. In many instances, especially with complex waveforms, this is simply not the case. Simulation studies have shown the promise of wave force prediction using neural networks. This study demonstrates this experimentally and aims to characterize the important factors when designing such a network. Several wave elevation …


On Vulnerabilities Of Building Automation Systems, Michael Cash Jan 2024

On Vulnerabilities Of Building Automation Systems, Michael Cash

Graduate Thesis and Dissertation 2023-2024

Building automation systems (BAS) have become more commonplace in personal and commercial environments in recent years. They provide many functions for comfort and ease of use, from automating room temperature and shading, to monitoring equipment data and status. Even though their convenience is beneficial, their security has become an increased concerned in recent years. This research shows an extensive study on building automation systems and identifies vulnerabilities in some of the most common building communication protocols, BACnet and KNX. First, we explore the BACnet protocol, exploring its Standard BACnet objects and properties. An automation tool is designed and implemented to …


Data Driven And Machine Learning Based Modeling And Predictive Control Of Combustion At Reactivity Controlled Compression Ignition Engines, Behrouz Khoshbakht Irdmousa Jan 2024

Data Driven And Machine Learning Based Modeling And Predictive Control Of Combustion At Reactivity Controlled Compression Ignition Engines, Behrouz Khoshbakht Irdmousa

Dissertations, Master's Theses and Master's Reports

Reactivity Controlled Compression Ignition (RCCI) engines operates has capacity to provide higher thermal efficiency, lower particular matter (PM), and lower oxides of nitrogen (NOx) emissions compared to conventional diesel combustion (CDC) operation. Achieving these benefits is difficult since real-time optimal control of RCCI engines is challenging during transient operation. To overcome these challenges, data-driven machine learning based control-oriented models are developed in this study. These models are developed based on Linear Parameter-Varying (LPV) modeling approach and input-output based Kernelized Canonical Correlation Analysis (KCCA) approach. The developed dynamic models are used to predict combustion timing (CA50), indicated mean effective pressure (IMEP), …


An Unsupervised Machine Learning Algorithm For Clustering Low Dimensional Data Points In Euclidean Grid Space, Josef Lazar Jan 2024

An Unsupervised Machine Learning Algorithm For Clustering Low Dimensional Data Points In Euclidean Grid Space, Josef Lazar

Senior Projects Spring 2024

Clustering algorithms provide a useful method for classifying data. The majority of well known clustering algorithms are designed to find globular clusters, however this is not always desirable. In this senior project I present a new clustering algorithm, GBCN (Grid Box Clustering with Noise), which applies a box grid to points in Euclidean space to identify areas of high point density. Points within the grid space that are in adjacent boxes are classified into the same cluster. Conversely, if a path from one point to another can only be completed by traversing an empty grid box, then they are classified …


Ai-Based Defect Detection In Aerospace Ultrasonic Signals, Rami Issac Lake Jan 2024

Ai-Based Defect Detection In Aerospace Ultrasonic Signals, Rami Issac Lake

Graduate Research Theses & Dissertations

Ensuring the safety and integrity of materials and structures throughout the manufacturing cycle is a critical concern across various industries, including aerospace, automotive, oiland gas, and civil engineering. Non-Destructive Inspection (NDI) techniques allow for the examination of materials without causing damage or alteration, enabling the early detection of potential issues before materials are utilized in the field. The inspection of fuselage composites presents a particular challenge due to their complex structures, diverse materials, and differences in thickness, making defect detection a challenging yet crucial task. Moreover, defects of various types and causes can emerge across all depths of the material …


Leveraging Machine Learning & Deep Learning Methodologies To Detect Deepfakes, Aniruddha Tiwari Jan 2024

Leveraging Machine Learning & Deep Learning Methodologies To Detect Deepfakes, Aniruddha Tiwari

All Graduate Theses, Dissertations, and Other Capstone Projects

The rapid evolution of deep learning (DL) and machine learning (ML) techniques has facilitated the rise of highly convincing synthetic media, commonly referred to as deepfakes. These manipulative media artifacts, generated through advanced artificial intelligence algorithms, pose significant challenges in distinguishing them from authentic content. Given their potential to be disseminated widely across various online platforms, the imperative for robust detection methodologies becomes apparent. Accordingly, this study explores the efficacy of existing ML/DL-based approaches and aims to compare which type of methodology performs better in identifying deepfake content. In response to the escalating threat posed by deepfakes, previous research efforts …


Comparative Analysis Of Data Augmentation On Sentiment Analysis In Three Distinct Languages, Hyesu Lee Jan 2024

Comparative Analysis Of Data Augmentation On Sentiment Analysis In Three Distinct Languages, Hyesu Lee

All Graduate Theses, Dissertations, and Other Capstone Projects

Machine learning in natural language processing analyzes datasets to make future predictions for various filed in the real world. By training machine algorithms on the datasets of text, the model can learn patterns and structure of the text in many different languages. Then the model enables to perform the text classification, sentiment analysis, and other tasks. A large and balanced dataset is required to develop an accurate machine learning model. However, the collection of a reliable, large, and equally distributed dataset is a challenging and requires significant resources and time. As a solution to this challenge, a data augmentation technique …