Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Machine Learning

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 1 - 30 of 826

Full-Text Articles in Physical Sciences and Mathematics

What Do We Know About Hugging Face? A Systematic Literature Review And Quantitative Validation Of Qualitative Claims, Jason Jones, Wenxin Jiang, Nicholas Synovic, George K. Thiruvathukal, James C. Davis Oct 2024

What Do We Know About Hugging Face? A Systematic Literature Review And Quantitative Validation Of Qualitative Claims, Jason Jones, Wenxin Jiang, Nicholas Synovic, George K. Thiruvathukal, James C. Davis

Computer Science: Faculty Publications and Other Works

Background: Collaborative Software Package Registries (SPRs) are an integral part of the software supply chain. Much engineering work synthesizes SPR package into applications. Prior research has examined SPRs for traditional software, such as NPM (JavaScript) and PyPI (Python). Pre-Trained Model (PTM) Registries are an emerging class of SPR of increasing importance, because they support the deep learning supply chain.
Aims: Recent empirical research has examined PTM registries in ways such as vulnerabilities, reuse processes, and evolution. However, no existing research synthesizes them to provide a systematic understanding of the current knowledge. Some of the existing research includes qualitative …


Interoperability In Deep Learning: A User Survey And Failure Analysis Of Onnx Model Converters, Purvish Jajal, Wenxin Jiang, Arav Tewari, Erik Kocinare, Joseph Woo, Anusha Sarraf, Yung-Hsiang Lu, George K. Thiruvathukal, James C. Davis Sep 2024

Interoperability In Deep Learning: A User Survey And Failure Analysis Of Onnx Model Converters, Purvish Jajal, Wenxin Jiang, Arav Tewari, Erik Kocinare, Joseph Woo, Anusha Sarraf, Yung-Hsiang Lu, George K. Thiruvathukal, James C. Davis

Computer Science: Faculty Publications and Other Works

Software engineers develop, fine-tune, and deploy deep learning (DL) models using a variety of development frameworks and runtime environments. DL model converters move models between frameworks and to runtime environments. Conversion errors compromise model quality and disrupt deployment. However, the failure characteristics of DL model converters are unknown, adding risk when using DL interoperability technologies.
This paper analyzes failures in DL model converters. We survey software engineers about DL interoperability tools, use cases, and pain points (N=92). Then, we characterize failures in model converters associated with the main interoperability tool, ONNX (N=200 issues in PyTorch and TensorFlow). Finally, we formulate …


Rethinking Retrieval Automated Fine-Tuning In An Evolving Llm Landscape, Nicholas Sager, Timothy Cabaza, Matthew Cusack, Ryan Bass, Joaquin Dominguez Sep 2024

Rethinking Retrieval Automated Fine-Tuning In An Evolving Llm Landscape, Nicholas Sager, Timothy Cabaza, Matthew Cusack, Ryan Bass, Joaquin Dominguez

SMU Data Science Review

This study explores the utilization of Retrieval Augmented Fine-Tuning (RAFT) to enhance the performance of Large Language Models (LLMs) in domain-specific Retrieval Augmented Generation (RAG) tasks. By integrating domain-specific information during the retrieval process, RAG aims to reduce hallucination and improve the accuracy of LLM outputs. We investigate the use of RAFT, an approach that enhances LLMs by incorporating domain-specific knowledge and effectively handling distractor documents. This paper validates previous work, which found that RAFT can considerably improve the performance of Llama2-7B in specific domains. We also expand upon previous work into new state-of-the-art open-source models and other datasets with …


An Online Nanoinformatics Platform Empowering Computational Modeling Of Nanomaterials By Nanostructure Annotations And Machine Learning Toolkits., Tong Wang, Daniel P Russo, Philip Demokritou, Xuelian Jia, Heng Huang, Xinyu Yang, Hao Zhu Aug 2024

An Online Nanoinformatics Platform Empowering Computational Modeling Of Nanomaterials By Nanostructure Annotations And Machine Learning Toolkits., Tong Wang, Daniel P Russo, Philip Demokritou, Xuelian Jia, Heng Huang, Xinyu Yang, Hao Zhu

College of Science & Mathematics Departmental Research

Modern nanotechnology has generated numerous datasets from in vitro and in vivo studies on nanomaterials, with some available on nanoinformatics portals. However, these existing databases lack the digital data and tools suitable for machine learning studies. Here, we report a nanoinformatics platform that accurately annotates nanostructures into machine-readable data files and provides modeling toolkits. This platform, accessible to the public at https://vinas-toolbox.com/, has annotated nanostructures of 14 material types. The associated nanodescriptor data and assay test results are appropriate for modeling purposes. The modeling toolkits enable data standardization, data visualization, and machine learning model development to predict properties and bioactivities …


A Machine Learning Based Approach For The Identification Of Fake Bills, Tianyang Lu, Hongyang Pang Aug 2024

A Machine Learning Based Approach For The Identification Of Fake Bills, Tianyang Lu, Hongyang Pang

Rose-Hulman Undergraduate Mathematics Journal

Fake or counterfeiting currency, which has been around as long as money has existed, is a major economic problem. Since the US dollar is the most popular form of currency globally, it is the most popular currency to counterfeit. The United States Department of Treasury estimates that between $70 million and $200 million in fake bills are in circulation. The Federal Reserve Bank uses special banknote processing systems to count each bill deposited by the bank and examine them for the possibility of counterfeits. These machines have sensors designed to detect general quality of the bills, including paper type, quality …


Enhancing Fundraising Strategies In Higher Education Through Machine Learning, Laith Alatwah Aug 2024

Enhancing Fundraising Strategies In Higher Education Through Machine Learning, Laith Alatwah

Electrical Engineering Theses

This thesis presents a comprehensive application of machine learning techniques, namely Fine Gaussian SVM and RUS Boosted Trees, to enhance fundraising strategies in higher education institutions. Analyzing a rich dataset from Blackbaud Raiser's Edge NXT, spanning 2012 to 2022, the study focuses on donor profiles, including demographics, donation history, and engagement patterns. Key demographic insights include the increasing engagement of younger donors (20-29 age group) and significant contributions from older donors (70-99 age group). Geographical trends are also examined, revealing distinct patterns based on donors' city, state, and ZIP code. The Fine Gaussian SVM model demonstrates moderate discriminatory power, with …


Physics-Informed Machine Learning Methods For Inverse Design Of Multi-Phase Materials With Targeted Mechanical Properties, Yunpeng Wu Aug 2024

Physics-Informed Machine Learning Methods For Inverse Design Of Multi-Phase Materials With Targeted Mechanical Properties, Yunpeng Wu

All Dissertations

Advances in machine learning algorithms and applications have significantly enhanced engineering inverse design capabilities. This work focuses on the machine learning-based inverse design of material microstructures with targeted linear and nonlinear mechanical properties. It involves developing and applying predictive and generative physics-informed neural networks for both 2D and 3D multiphase materials.

The first investigation aims to develop a machine learning method for the inverse design of 2D multiphase materials, particularly porous materials. We first develop machine learning methods to understand the implicit relationship between a material's microstructure and its mechanical behavior. Specifically, we use ResNet-based models to predict the elastic …


Leveraging Generative Ai For Sustainable Farm Management Techniques Correspond To Optimization And Agricultural Efficiency Prediction, Samira Samrose Aug 2024

Leveraging Generative Ai For Sustainable Farm Management Techniques Correspond To Optimization And Agricultural Efficiency Prediction, Samira Samrose

All Graduate Reports and Creative Projects, Fall 2023 to Present

Sustainable farm management practice is a multifaceted challenge. Uncovering the optimal state for production while reduction of environmental negative impacts and guaranteed inter-generational assets supervision needs balanced management. Also, considering lots of different factors (cost, profit, employment etc), the agricultural based management technique requires rigorous concentration. In this project machine learning models are applied to develop, achieve and improve the farm management techniques. This experiment ensures the resultant impacts being environment friendly and necessary resource availability and efficiency. Predicting the type of crop and rotational recommendations will disclose potentiality of productive agricultural based farming. Additionally, this project is designed to …


Neural Networks For Decisions Under Uncertainty, Edwin Tomy George Aug 2024

Neural Networks For Decisions Under Uncertainty, Edwin Tomy George

Open Access Theses & Dissertations

Neural networks are used in many real-world applications, ranging from classification tasks to medical diagnostics. For each task, a neural network is typically able to make predictions due to its ability to extract meaningful patterns from processing large amounts of data. Thus, given the increases in available data in recent decades, the performance of neural networks in making accurate predictions has greatly increased. However, this data often comes with ingrained uncertainties due to measurement errors or the inherent variability of individual data points. Neural networks can learn despite the errors in the overall data, but what if we want them …


Personalized Driving Using Inverse Reinforcement Learning, Rodrigo J. Gonzalez Salinas Jul 2024

Personalized Driving Using Inverse Reinforcement Learning, Rodrigo J. Gonzalez Salinas

Theses and Dissertations

This thesis introduces an autonomous driving controller designed to replicate individual driving behaviors based on a provided demonstration. The controller employs Inverse Reinforcement Learning (IRL) to formulate the reward function associated with the provided demonstration. IRL is implemented through a dual-feedback loop system. The inner loop utilizes Q-learning, a model-free reinforcement learning technique, to optimize the Hamilton-Jacobi-Bellman (HJB) equation and derive an appropriate control solution. The outer loop leverages this derived control solution to generate parameters for the reward function, which are subsequently integrated into the HJB equation. The ultimate control policy is deduced from the final reward function obtained …


Detection And Classification Of Unauthorized Use Of Irrigation Motors In Agricultural Irrigation, Önder Ci̇velek, Sedat Görmüş, Hali̇l İbrahi̇m Okumuş, Orhan Gazi̇ Kederoglu Jul 2024

Detection And Classification Of Unauthorized Use Of Irrigation Motors In Agricultural Irrigation, Önder Ci̇velek, Sedat Görmüş, Hali̇l İbrahi̇m Okumuş, Orhan Gazi̇ Kederoglu

Turkish Journal of Electrical Engineering and Computer Sciences

The decarbonisation of electricity generation requires the real-time monitoring and control of grid components in order to efficiently and timely dispatch demand. This highly automated system, known as the Smart Grid, relies on smart or sensor-equipped distribution network components to optimise energy flow and minimise losses. However, energy theft, a major obstacle to efficient resource utilisation, poses a significant challenge to achieving this goal. This study proposes and evaluates a real-time telemetry and control system designed to mitigate energy theft in agricultural irrigation applications. The system increases energy efficiency by tracking the energy use in agricultural irrigation. The key challenge …


Vysion Software, Isaias Hernandez-Dominguez Jr, Chander Luderman Miller Jul 2024

Vysion Software, Isaias Hernandez-Dominguez Jr, Chander Luderman Miller

2024 Symposium

Vision loss presents significant challenges in daily life. Existing solutions for blind and visually impaired individuals are often limited in functionality, expensive, or complex to use. Vysion Software addresses this gap by developing a user-friendly, all-in-one AI companion app that provides features including text summarization, real-time audio descriptions, and AI-enhanced navigation. This project details the development plan, initial functionalities, and future vision for Vysion Software.


Survey Of Memory Consolidation Techniques For Video Question Answering, Matthew Couts, Pha Nguyen, Khoa Luu Jul 2024

Survey Of Memory Consolidation Techniques For Video Question Answering, Matthew Couts, Pha Nguyen, Khoa Luu

Inquiry: The University of Arkansas Undergraduate Research Journal

Video Question Answering (VideoQA) is a field of research focused on developing models that can engage in natural conversations with humans about the content of videos. Currently, the most successful approaches involve analyzing videos frame-by-frame, which is computationally and memory-intensive. To imitate human memory, the Atkinson-Shiffrin memory model can formulate the machine’s video understanding capability through Vision-Language Models. Reducing the number of frames processed by the model is a crucial operation in this approach category and can be handled by a memory consolidation algorithm. The memory consolidation algorithm should be able to determine the keyframes to transfer from short-term to …


Predictive Power Of Machine Learning Models On Degree Completion Among Adult Learners, Emily Barnes, James Hutson, Karriem Perry Jun 2024

Predictive Power Of Machine Learning Models On Degree Completion Among Adult Learners, Emily Barnes, James Hutson, Karriem Perry

Faculty Scholarship

The integration of machine learning (ML) into higher education has been recognized as a transformative force for adult learners, a growing demographic facing unique educational challenges. This study evaluates the predictive power of three ML models—Random Forest, Gradient-Boosting Machine, and Decision Trees—in forecasting degree completion among this group. Utilizing a dataset from the academic years 2013-14 to 2021-22, which includes demographic and academic performance metrics, the study employs accuracy, precision, recall, and F1 score to assess the efficacy of these models. The results indicate that the Gradient-Boosting Machine model outperforms others in predicting degree completion, suggesting that ML can significantly …


Investigating Bias In Mortgage-Rate Machine Learning Models, Will Kalikman May 2024

Investigating Bias In Mortgage-Rate Machine Learning Models, Will Kalikman

Computer Science Senior Theses

Banks and fintech lenders increasingly rely on computer-aided models in lending decisions. Traditional models were interpretable: decisions were based on observable factors, such as whether a borrower's credit score was above a threshold value, and explainable in terms of combinations of these factors. In contrast, modern machine learning models are opaque and non-interpretable. Their opaqueness and reliance on historical data that is the artifact of past racial discrimination means these new models risk embedding and exacerbating such discrimination, even if lenders do not intend to discriminate. We calibrate two random forest classifiers using publicly available HMDA loan data and publicly …


Building A Data Pipeline And Machine Learning Model For Insurance Data, Connor Weyers May 2024

Building A Data Pipeline And Machine Learning Model For Insurance Data, Connor Weyers

Honors Theses

Insurance telematics is an emerging and exciting field. It combines the advancements in GPS tracking, computational analytics, data processing, and machine learning into a useful tool to help insurance companies make the best product for their consumers. This is why National Indemnity looked to implement a telematics portion to their business processes of underwriting insurance policies and sponsored a School of Computing Senior Design project. In this report, we will first review existing solutions that been used to solve problems and subproblems similar to that we are given in this project. We then propose designs for the data pipeline and …


Unboxing The Complicated Near Term Climatic And Geomorphic History Of Mars, Joshua Matthew Williams May 2024

Unboxing The Complicated Near Term Climatic And Geomorphic History Of Mars, Joshua Matthew Williams

Earth and Planetary Sciences ETDs

It has long been thought that glacial processes were unlikely within the tropical regions of Mars. However, growing evidence, including this work has identified and quantified relic glacial forms within the equatorial regions. These findings have major implications for understanding Martian climate history and its sensitivity to changes in insolation. As well, the presence of ice in the equatorial region of Mars has significant implications for the past global redistribution of the water ice in the Martian cryosphere. In this manuscript, I clarify and refine our understanding of the morphology of glacial features in the equatorial zone by applying novel …


Machine Learning: Face Recognition, Mohammed E. Amin May 2024

Machine Learning: Face Recognition, Mohammed E. Amin

Publications and Research

This project explores the cutting-edge intersection of machine learning (ML) and face recognition (FR) technology, utilizing the OpenCV library to pioneer innovative applications in real-time security and user interface enhancement. By processing live video feeds, our system encodes visual inputs and employs advanced face recognition algorithms to accurately identify individuals from a database of photos. This integration of machine learning with OpenCV not only showcases the potential for bolstering security systems but also enriches user experiences across various technological platforms. Through a meticulous examination of unique facial features and the application of sophisticated ML algorithms and neural networks, our project …


Evaluating Neuroimaging Modalities In The A/T/N Framework: Single And Combined Fdg-Pet And T1-Weighted Mri For Alzheimer’S Diagnosis, Peiwang Liu May 2024

Evaluating Neuroimaging Modalities In The A/T/N Framework: Single And Combined Fdg-Pet And T1-Weighted Mri For Alzheimer’S Diagnosis, Peiwang Liu

McKelvey School of Engineering Theses & Dissertations

With the escalating prevalence of dementia, particularly Alzheimer's Disease (AD), the need for early and precise diagnostic techniques is rising. This study delves into the comparative efficacy of Fluorodeoxyglucose Positron Emission Tomography (FDG-PET) and T1-weighted Magnetic Resonance Imaging (MRI) in diagnosing AD, where the integration of multimodal models is becoming a trend. Leveraging data from the Alzheimer's Disease Neuroimaging Initiative (ADNI), we employed linear Support Vector Machines (SVM) to assess the diagnostic potential of these modalities, both individually and in combination, within the AD continuum. Our analysis, under the A/T/N framework's 'N' category, reveals that FDG-PET consistently outperforms T1w-MRI across …


Toward The Integration Of Behavioral Sensing And Artificial Intelligence, Subigya K. Nepal May 2024

Toward The Integration Of Behavioral Sensing And Artificial Intelligence, Subigya K. Nepal

Dartmouth College Ph.D Dissertations

The integration of behavioral sensing and Artificial Intelligence (AI) has increasingly proven invaluable across various domains, offering profound insights into human behavior, enhancing mental health monitoring, and optimizing workplace productivity. This thesis presents five pivotal studies that employ smartphone, wearable, and laptop-based sensing to explore and push the boundaries of what these technologies can achieve in real-world settings. This body of work explores the innovative and practical applications of AI and behavioral sensing to capture and analyze data for diverse purposes. The first part of the thesis comprises longitudinal studies on behavioral sensing, providing a detailed, long-term view of how …


Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth May 2024

Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth

Electronic Theses, Projects, and Dissertations

The longstanding prevalence of hypertension, often undiagnosed, poses significant risks of severe chronic and cardiovascular complications if left untreated. This study investigated the causes and underlying risks of hypertension in females aged between 18-39 years. The research questions were: (Q1.) What factors affect the occurrence of hypertension in females aged 18-39 years? (Q2.) What machine learning algorithms are suited for effectively predicting hypertension? (Q3.) How can SHAP values be leveraged to analyze the factors from model outputs? The findings are: (Q1.) Performing Feature selection using binary classification Logistic regression algorithm reveals an array of 30 most influential factors at an …


Predicting Energy Expenditure From Physical Activity Videos Using Optical Flows And Deep Learning, Gayatri Kasturi May 2024

Predicting Energy Expenditure From Physical Activity Videos Using Optical Flows And Deep Learning, Gayatri Kasturi

Theses and Dissertations

This thesis presents a novel approach for predicting energy expenditure of physical activity from videos using optical flows and deep learning. Conventional approaches mainly rely on wearable sensors, which, despite being widely used, are constrained by practicality and accuracy concerns. This proposal introduces a new strategy that utilizes a three-dimensional Convolutional Neural Network (3D-CNN) to evaluate video data and accurately estimate energy costs in metabolic equivalents (METs). Our model utilizes optical flow extraction to analyze video, capturing complex motion patterns and their changes over time. The results are good indicating potential for this method to be deployed in various healthcare …


Database And Machine Learning Model For Classifying Autism Spectrum Disorder From Smartphone Based Electroretinography, Rory Harris May 2024

Database And Machine Learning Model For Classifying Autism Spectrum Disorder From Smartphone Based Electroretinography, Rory Harris

Honors Scholar Theses

Autism Spectrum Disorder (ASD) is a neurodevelopmental disorder that negatively affects a patient’s cognitive and communication aptitude and, therefore, can severely impact that patient’s quality of life. Because of this, early diagnosis is paramount. In recent studies, electroretinography (ERG), which is a measure of the retina’s electrical response to a brief flash of light into the eye, has shown promise in detecting ASD. Access to these scans can provide early diagnosis, improving well-being. Current ERG devices are very expensive due to their on board processing capabilities. This paper aims to create an ERG device using a smartphone as the main …


Using Machine Learning To Identify Hate Speech And Offensive Language On Twitter., Mayara Lorens, Thayene Lorens May 2024

Using Machine Learning To Identify Hate Speech And Offensive Language On Twitter., Mayara Lorens, Thayene Lorens

BSc (Hons) in Computing in IT

The central theme of this project is the application of Machine Learning to identify both hate speech and offensive language on Twitter. We chose this topic for its ethical relevance in the technological environment and its business potential. This topic raises concerns such as cyberbullying and the existence of a hostile environment for users. For this reason, we sought to implement four different models to create an automated system capable of identifying and categorizing whether specific content is offensive, non-offensive or neutral.


Advancing Compact Modeling Of Electronic Devices: Machine Learning Approaches With Neural Networks, Mixture Density Networks, And Deep Symbolic Regression, Jack Robert Hutchins May 2024

Advancing Compact Modeling Of Electronic Devices: Machine Learning Approaches With Neural Networks, Mixture Density Networks, And Deep Symbolic Regression, Jack Robert Hutchins

Masters Theses

This thesis pioneers the integration of deep learning techniques into the realm of compact modeling, presenting three distinct approaches that enhance the precision, efficiency, and adaptability of compact models for electronic devices. The first method introduces a Generalized Multilayer Perception Compact Model, leveraging the function approximation capabilities of neural networks through a multilayer perception (MLP) framework. This approach utilizes hyperband tuning to optimize network hyperparameters, demonstrating its effectiveness on a HfOx memristor and establishing a versatile modeling strategy for both single-state and multistate devices.

The second approach explores the application of Mixture Density Networks (MDNs) to encapsulate the inherent stochasticity …


Murmurations And Root Numbers, Alexey Pozdnyakov May 2024

Murmurations And Root Numbers, Alexey Pozdnyakov

University Scholar Projects

We report on a machine learning investigation of large datasets of elliptic curves and L-functions. This leads to the discovery of murmurations, an unexpected correlation between the root numbers and Dirichlet coefficients of L-functions. We provide a formal definition of murmurations, describe the connection with 1-level density, and provide three examples for which the murmuration phenomenon has been rigorously proven. Using our understanding of murmurations, we then build new machine learning models in search of a polynomial time algorithm for predicting root numbers. Based on our models and several heuristic arguments, we conclude that it is unlikely for …


Learning Scene Semantics For 3d Scene Retrieval, Natalie Gleason May 2024

Learning Scene Semantics For 3d Scene Retrieval, Natalie Gleason

Honors Theses

This project presents a comprehensive exploration into semantics-driven 3D scene retrieval, aiming to bridge the gap between 2D sketches/images and 3D models. Through four distinct research objectives, this project endeavors to construct a foundational infrastructure, develop methodologies for quantifying semantic similarity, and advance a semantics-based retrieval framework for 2D scene sketch-based and image-based 3D scene retrieval. Leveraging WordNet as a foundational semantic ontology library, the research proposes the construction of an extensive hierarchical scene semantic tree, enriching 2D/3D scenes with encoded semantic information. The methodologies for semantic similarity computation utilize this semantic tree to bridge the semantic disparity between 2D …


Implementation Of Explainable Ai For Bearing Fault Classification, Mohammad Mundiwala May 2024

Implementation Of Explainable Ai For Bearing Fault Classification, Mohammad Mundiwala

Honors Scholar Theses

It is difficult to overstate the impact of artificial intelligence (AI) over the past decade. The rapid expansion of machine learning has stimulated a race to deploy AI in all facets of life, one such domain being machine health monitoring. There is no doubt that machine learning excels in prediction accuracy, but oftentimes, these models are cryptic and fail to provide valuable insight into their decisions. This paper presents an overview of a neural network and what it means to learn. Next, two distinct Explainable AI (XAI) techniques will be presented: Gradient Class Activation Mapping and SimplEx . Finally, these …


From Tweets To Token Sales: Assessing Ico Success Through Social Media Sentiments, Donghao Huang, S. Samuel, Quoc Toan Huynh, Zhaoxia Wang May 2024

From Tweets To Token Sales: Assessing Ico Success Through Social Media Sentiments, Donghao Huang, S. Samuel, Quoc Toan Huynh, Zhaoxia Wang

Research Collection School Of Computing and Information Systems

With the advent of social network technology, the influence of collective opinions has significantly impacted business, marketing, and fundraising. Particularly in the blockchain space, Initial Coin Offerings (ICOs) gain substantial exposure across various online platforms. Yet, the intricate relationships among these elements remain largely unexplored. This study aims to investigate the relationships between social media sentiment, engagement metrics, and ICO success. We hypothesize a positive correlation between favorable sentiment in ICO-related tweets and overall project success. Additionally, we recognize social media engagement indicators (mentions, retweets, likes, follower counts) as critical factors affecting ICO performance. Employing machine learning techniques, we conduct …


Optimizing Adult Learner Success: Applying Random Forest Classifier In Higher Education Predictive Analytics, Emily Barnes, James Hutson, Karriem Perry May 2024

Optimizing Adult Learner Success: Applying Random Forest Classifier In Higher Education Predictive Analytics, Emily Barnes, James Hutson, Karriem Perry

Faculty Scholarship

This study examines the application of the Random Forest Classifier (RF) model in predicting academic success among adult learners in higher education. It focuses on evaluating the model's effectiveness using key statistical measures like accuracy, precision, recall, and F1 score across a comprehensive dataset from 2013–14 to 2021–22, which includes variables such as age, ethnicity, gender, Pell Grant eligibility, and academic performance metrics. The research highlights the RF model's capability to handle large datasets with varying data types and demonstrates its superiority over traditional regression models in predictive accuracy. Through an iterative process, the study refines the RF model to …