Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Machine Learning

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 91 - 120 of 826

Full-Text Articles in Physical Sciences and Mathematics

Data-Driven 2d Materials Discovery For Next-Generation Electronics, Zeyu Zhang Aug 2023

Data-Driven 2d Materials Discovery For Next-Generation Electronics, Zeyu Zhang

Dissertations

The development of material discovery and design has lasted centuries in human history. After the concept of modern chemistry and material science was established, the strategy of material discovery relies on the experiments. Such a strategy becomes expensive and time-consuming with the increasing number of materials nowadays. Therefore, a novel strategy that is faster and more comprehensive is urgently needed. In this dissertation, an experiment-guided material discovery strategy is developed and explained using metal-organic frameworks (MOFs) as instances. The advent of 7r-stacked layered MOFs, which offer electrical conductivity on top of permanent porosity and high surface area, opened up new …


Global Cyber Attack Forecast Using Ai Techniques, Nusrat Kabir Samia Aug 2023

Global Cyber Attack Forecast Using Ai Techniques, Nusrat Kabir Samia

Electronic Thesis and Dissertation Repository

The advancement of internet technology and growing involvement in the cyber world have made us prone to cyber-attacks inducing severe damage to individuals and organizations, including financial loss, identity theft, and reputational damage. The rapid emergence and evolution of new networks and new opportunities for businesses and technologies are increasing threats to security vulnerabilities. Hence cyber-crime analysis is one of the wide range applications of Data Mining that can be eventually used to predict and detect crime. However, there are several constraints while analyzing cyber-attacks, which are yet to be resolved for more accurate cyber security inspection.

Although there are …


Predicting Network Failures With Ai Techniques, Chandrika Saha Aug 2023

Predicting Network Failures With Ai Techniques, Chandrika Saha

Electronic Thesis and Dissertation Repository

Network failure is the unintentional interruption of internet services, resulting in widespread client frustration. It is especially true for time-sensitive services in the healthcare industry, smart grid control, and mobility control, among others. In addition, the COVID-19 pandemic has compelled many businesses to operate remotely, making uninterrupted internet access essential. Moreover, Internet Service Providers (ISPs) lose millions of dollars annually due to network failure, which has a negative impact on their businesses. Currently, redundant network equipment is used as a restoration technique to resolve this issue of network failure. This technique requires a strategy for failure identification and prediction to …


How I Read An Article That Uses Machine Learning Methods, Aziz Nazha, Olivier Elemento, Shannon Mcweeney, Moses Miles, Torsten Haferlach Aug 2023

How I Read An Article That Uses Machine Learning Methods, Aziz Nazha, Olivier Elemento, Shannon Mcweeney, Moses Miles, Torsten Haferlach

Kimmel Cancer Center Faculty Papers

No abstract provided.


Towards Automated Mineral Identification In Martian Rocks From X-Ray Diffraction Patterns, Luke Tambakis Aug 2023

Towards Automated Mineral Identification In Martian Rocks From X-Ray Diffraction Patterns, Luke Tambakis

Electronic Thesis and Dissertation Repository

The CheMin (Chemistry and Mineralogy) instrument on the Curiosity rover has provided a rich set of X-ray diffraction (XRD) patterns from Martian rocks and regolith. These XRD patterns have allowed geologists to make exciting new discoveries about the mineralogy and the geological history of Mars. These discoveries pave the way for further Martian exploration and provide a deeper understanding of Martian geology. The Curiosity rover is very slow by design, travelling at about 4 cm/s. New, faster rovers are being developed to increase scientific throughput and exploration. XRD is valuable for future missions as it can produce new discov- eries …


Traditional Vs Machine Learning Approaches: A Comparison Of Time Series Modeling Methods, Miguel E. Bonilla Jr., Jason Mcdonald, Tamas Toth, Bivin Sadler Aug 2023

Traditional Vs Machine Learning Approaches: A Comparison Of Time Series Modeling Methods, Miguel E. Bonilla Jr., Jason Mcdonald, Tamas Toth, Bivin Sadler

SMU Data Science Review

In recent years, various new Machine Learning and Deep Learning algorithms have been introduced, claiming to offer better performance than traditional statistical approaches when forecasting time series. Studies seeking evidence to support the usage of ML/DL over statistical approaches have been limited to comparing the forecasting performance of univariate, linear time series data. This research compares the performance of traditional statistical-based and ML/DL methods for forecasting multivariate and nonlinear time series.


Application Of Crystal Engineering In Multicomponent Pharmaceutical Crystals: A Study Of Theory And Practice, Soroush Ahmadi Nasrabadi Aug 2023

Application Of Crystal Engineering In Multicomponent Pharmaceutical Crystals: A Study Of Theory And Practice, Soroush Ahmadi Nasrabadi

Electronic Thesis and Dissertation Repository

Multicomponent crystallization, a prominent strategy in crystal engineering, offers the ability to modify the physicochemical properties of crystals by introducing a secondary component to their lattice structure. Such multicomponent crystals have found widespread application in the pharmaceutical industry. This thesis explores the experimental screening, characterization, application, and theoretical prediction of multicomponent crystals of Active Pharmaceutical Ingredients (APIs).

The first case study investigates a new solvate of Dasatinib which exhibits high instability at room temperature and transforms into a different polymorph upon desolvation. The crystal structure of this compound is obtained, revealing insights into its transient nature and the potential application …


Secondary Features Of Importance For A Url Ranking, Atajan Abdyyev Aug 2023

Secondary Features Of Importance For A Url Ranking, Atajan Abdyyev

Dissertations and Theses

This paper investigates the impact of secondary ranking factors on webpage relevance and rankings in the context of Search Engine Optimization (SEO), focusing on the jewelry domain within the United States e-commerce market. By generating a keyword list related to jewelry and retrieving top URLs from Google's search results, the study employs machine learning models including XGBoost, CatBoost, and Linear Regression to identify key features influencing webpage relevance and rankings.The findings highlight specific optimal ranges for features like Outlinks, Unique Inlinks, Flesch Reading Ease Score, and others, indicating their significant impact on better rankings. Notably, Random Forest model performed best …


Innovations In Geospatial Technologies For Water And Environmental Resource Protection, Madeleine Bolick Aug 2023

Innovations In Geospatial Technologies For Water And Environmental Resource Protection, Madeleine Bolick

All Dissertations

New technologies and applications of technologies are critical to protecting environmental and water resources. Students need to be aware of these new technologies so they can be prepared to utilize them in their future careers. One such technology is using Unmanned Aerial Vehicles (UAVs) because it can be applied to a wide variety of fields such as engineering, construction, wildlife biology, agriculture, and many more. Chapter two discusses the creation of an online teaching module to introduce students to using UAVs in natural resource research and evaluates how well students respond to the education module. Overall, student familiarity with UAVs …


Bayesian Optimization With Switching Cost: Regret Analysis And Lookahead Variants, Peng Liu, Haowei Wang, Wei Qiyu Aug 2023

Bayesian Optimization With Switching Cost: Regret Analysis And Lookahead Variants, Peng Liu, Haowei Wang, Wei Qiyu

Research Collection Lee Kong Chian School Of Business

Bayesian Optimization (BO) has recently received increasing attention due to its efficiency in optimizing expensive-to-evaluate functions. For some practical problems, it is essential to consider the path-dependent switching cost between consecutive sampling locations given a total traveling budget. For example, when using a drone to locate cracks in a building wall or search for lost survivors in the wild, the search path needs to be efficiently planned given the limited battery power of the drone. Tackling such problems requires a careful cost-benefit analysis of candidate locations and balancing exploration and exploitation. In this work, we formulate such a problem as …


Genetic Programming To Optimize Performance Of Machine Learning Algorithms On Unbalanced Data Set, Asitha Thumpati Aug 2023

Genetic Programming To Optimize Performance Of Machine Learning Algorithms On Unbalanced Data Set, Asitha Thumpati

Electronic Theses, Projects, and Dissertations

Data collected from the real world is often imbalanced, meaning that the distribution of data across known classes is biased or skewed. When using machine learning classification models on such imbalanced data, predictive performance tends to be lower because these models are designed with the assumption of balanced classes or a relatively equal number of instances for each class. To address this issue, we employ data preprocessing techniques such as SMOTE (Synthetic Minority Oversampling Technique) for oversampling data and random undersampling for undersampling data on unbalanced datasets. Once the dataset is balanced, genetic programming is utilized for feature selection to …


Generalization Through Diversity: Improving Unsupervised Environment Design, Wenjun Li, Pradeep Varakantham, Dexun Li Aug 2023

Generalization Through Diversity: Improving Unsupervised Environment Design, Wenjun Li, Pradeep Varakantham, Dexun Li

Research Collection School Of Computing and Information Systems

Agent decision making using Reinforcement Learning (RL) heavily relies on either a model or simulator of the environment (e.g., moving in an 8x8 maze with three rooms, playing Chess on an 8x8 board). Due to this dependence, small changes in the environment (e.g., positions of obstacles in the maze, size of the board) can severely affect the effectiveness of the policy learned by the agent. To that end, existing work has proposed training RL agents on an adaptive curriculum of environments (generated automatically) to improve performance on out-of-distribution (OOD) test scenarios. Specifically, existing research has employed the potential for the …


An Interval-Valued Random Forests, Paul Gaona Partida Aug 2023

An Interval-Valued Random Forests, Paul Gaona Partida

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

There is a growing demand for the development of new statistical models and the refinement of established methods to accommodate different data structures. This need arises from the recognition that traditional statistics often assume the value of each observation to be precise, which may not hold true in many real-world scenarios. Factors such as the collection process and technological advancements can introduce imprecision and uncertainty into the data.

For example, consider data collected over a long period of time, where newer measurement tools may offer greater accuracy and provide more information than previous methods. In such cases, it becomes crucial …


Stressor: An R Package For Benchmarking Machine Learning Models, Samuel A. Haycock Aug 2023

Stressor: An R Package For Benchmarking Machine Learning Models, Samuel A. Haycock

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Many discipline specific researchers need a way to quickly compare the accuracy of their predictive models to other alternatives. However, many of these researchers are not experienced with multiple programming languages. Python has recently been the leader in machine learning functionality, which includes the PyCaret library that allows users to develop high-performing machine learning models with only a few lines of code. The goal of the stressor package is to help users of the R programming language access the advantages of PyCaret without having to learn Python. This allows the user to leverage R’s powerful data analysis workflows, while simultaneously …


On Training Neurons With Bounded Compilations, Lance Kennedy Jul 2023

On Training Neurons With Bounded Compilations, Lance Kennedy

Master of Science in Computer Science Theses

Knowledge compilation offers a formal approach to explaining and verifying the behavior of machine learning systems, such as neural networks. Unfortunately, compiling even an individual neuron into a tractable representation such as an Ordered Binary Decision Diagram (OBDD), is an NP-hard problem. In this thesis, we consider the problem of training a neuron from data, subject to the constraint that it has a compact representation as an OBDD. Our approach is based on the observation that a neuron can be compiled into an OBDD in polytime if (1) the neuron has integer weights, and (2) its aggregate weight is bounded. …


Using Machine Learning Techniques To Model Encoder/Decoder Pair For Non-Invasive Electroencephalographic Wireless Signal Transmission, Ernst Fanfan Jul 2023

Using Machine Learning Techniques To Model Encoder/Decoder Pair For Non-Invasive Electroencephalographic Wireless Signal Transmission, Ernst Fanfan

Master of Science in Computer Science Theses

This study investigated the application and enhancement of Non-Invasive Brain-Computer Interfaces (NI-BCIs), focused on enhancing the efficiency and effectiveness of this technology for individuals with severe physical limitations. The core research goal was to improve current limitations associated with wires, noise, and invasive procedures often associated with BCI technology. The key discussed solution involves developing an optimized Encoder/Decoder (E/D) pair using machine learning techniques, particularly those borrowed from Generative Adversarial Networks (GAN) and other Deep Neural Networks, to minimize data transmission and ensure robustness against data degradation. The study highlighted the crucial role of machine learning in self-adjusting and isolating …


Case Study: The Impact Of Emerging Technologies On Cybersecurity Education And Workforces, Austin Cusak Jul 2023

Case Study: The Impact Of Emerging Technologies On Cybersecurity Education And Workforces, Austin Cusak

Journal of Cybersecurity Education, Research and Practice

A qualitative case study focused on understanding what steps are needed to prepare the cybersecurity workforces of 2026-2028 to work with and against emerging technologies such as Artificial Intelligence and Machine Learning. Conducted through a workshop held in two parts at a cybersecurity education conference, findings came both from a semi-structured interview with a panel of experts as well as small workgroups of professionals answering seven scenario-based questions. Data was thematically analyzed, with major findings emerging about the need to refocus cybersecurity STEM at the middle school level with problem-based learning, the disconnects between workforce operations and cybersecurity operators, the …


Extending The Convolution In Graph Neural Networks To Solve Materials Science And Node Classification Problems, Steph-Yves Mike Louis Jul 2023

Extending The Convolution In Graph Neural Networks To Solve Materials Science And Node Classification Problems, Steph-Yves Mike Louis

Theses and Dissertations

The usage of graph to represent one's data in machine learning has grown in popularity in both academia and the industry due to its inherent benefits. With its flexible nature and immediate translation to real life observed objects, graph representation had a considerable contribution in advancing the state-of-the-art performance of machine learning in materials.

In this dissertation proposal, we discuss how machines can learn from graph encoded data and provide excellent results through graph neural networks (GNN). Notably, we focus our adaptation of graph neural networks on three tasks: predicting crystal materials properties, nullifying the negative impact of inferior graph …


Generation Of Cardiac Chamber Models Using Interpretable Generative Neural Networks For Electrophysiology Studies, Sunil Mathew Jul 2023

Generation Of Cardiac Chamber Models Using Interpretable Generative Neural Networks For Electrophysiology Studies, Sunil Mathew

Dissertations (1934 -)

An Electrophysiology study is conducted to diagnose and treat heart rhythm disorders, such as arrhythmias (abnormal heartbeat) like atrial fibrillation. A catheter is inserted into the chamber of interest to acquire 3D location and electrical information to create an electroanatomical map. This dissertation explores the design of a mapping system based on interpretable generative neural networks for generating patient specific cardiac models. Chapter 1 provides an introduction to electroanatomical mapping, the need for interpretability in neural networks and other relevant topics that are discussed in detail in the chapters that follow. Neural networks are often very large models with millions …


Predicting Material Structures And Properties Using Deep Learning And Machine Learning Algorithms, Yuqi Song Jul 2023

Predicting Material Structures And Properties Using Deep Learning And Machine Learning Algorithms, Yuqi Song

Theses and Dissertations

Discovering new materials and understanding their crystal structures and chemical properties are critical tasks in the material sciences. Although computational methodologies such as Density Functional Theory (DFT), provide a convenient means for calculating certain properties of materials or predicting crystal structures when combined with search algorithms, DFT is computationally too demanding for structure prediction and property calculation for most material families, especially for those materials with a large number of atoms. This dissertation aims to address this limitation by developing novel deep learning and machine learning algorithms for effective prediction of material crystal structures and properties. Our data-driven machine learning …


Computational Studies Of Bond Dissociation Energies And Organic Reaction Mechanisms, Shehani Thishakkya Wetthasinghe Jul 2023

Computational Studies Of Bond Dissociation Energies And Organic Reaction Mechanisms, Shehani Thishakkya Wetthasinghe

Theses and Dissertations

This dissertation presents the progress of two independent projects. Chapter 2 and Chapter 3 focus on the first project, which involves material exploration utilizing machine learning techniques. We explore the potential use of cobaltocenium (CoCp+2) derivatives as metal cations in anion exchange membranes (AEMs) for alkaline fuel cells, highlighting their superior thermal and alkaline stability compared to ammonium derivatives. The stability of CoCp+2 can be fine-tuned by varying the substituent groups attached to the cyclopentadienyl ring (Cp) in CoCp+2 .These derivatives encompass a variety of electron-donating and electron-withdrawing groups as substituents on both …


Exploratory Data-Driven Models For Water Quality: A Case Study For Tampa Bay Water, Sandra Sekyere Jun 2023

Exploratory Data-Driven Models For Water Quality: A Case Study For Tampa Bay Water, Sandra Sekyere

USF Tampa Graduate Theses and Dissertations

Water, a crucial resource for sustaining life, covers approximately 70% of the earth's surface. Nonetheless, the quality of water is deteriorating rapidly due to the rapid growth of urban areas and industries, which is a worrying trend causing harm to human health and the ecosystem. Water quality forecasting has a key role in water resources management by enabling effective pollution control, ecosystem monitoring, and decision-making.

Previously, traditional statistical models were used to forecast water quality, but they were unable to examine the non-linear relationships between water quality parameters, and they assumed that all datasets were distributed normally. This study uses …


Insect Classification And Explainability From Image Data Via Deep Learning Techniques, Tanvir Hossain Bhuiyan Jun 2023

Insect Classification And Explainability From Image Data Via Deep Learning Techniques, Tanvir Hossain Bhuiyan

USF Tampa Graduate Theses and Dissertations

Since the dawn of the Industrial Revolution, humanity has always tried to make labor more efficient and automated, and this trend is only continuing in the modern digital age. With the advent of artificial intelligence (AI) techniques in the latter part of the 20th century, the speed and scale with which AI has been leveraged to automate tasks defy human imagination. Many people deeply entrenched in the technology field are genuinely intrigued and concerned about how AI may change many of the ways in which humans have been living for millennia. Only time will provide the answers. This dissertation is …


Stereotypes And Language Models: Understanding How Language Models Encode Stereotypes, Debiasing Language Models, And Examining How Stereotypes Affect Conversations, Brian C. Wang Jun 2023

Stereotypes And Language Models: Understanding How Language Models Encode Stereotypes, Debiasing Language Models, And Examining How Stereotypes Affect Conversations, Brian C. Wang

Computer Science Senior Theses

This thesis describes a variety of approaches in examining how language models encode stereotypes (understanding stereotypes from a model point-of-view), debiasing language models, and using language models to understand how stereotypes affect conversations (understanding stereotypes from a conversational point-of-view). We present a novel approach for textual clues analysis that makes language models more interpretable, combining the understanding of what stereotypes the internal structures of language models have encoded during their initial training (via attention-based analysis) and understanding what textual clues are most relevant to identifying stereotypes for models trained to detect stereotypes (via SHAP-based analysis). We find that different pre-trained …


Stream-Evolving Bot Detection Framework Using Graph-Based And Feature-Based Approaches For Identifying Social Bots On Twitter, Eiman Alothali Jun 2023

Stream-Evolving Bot Detection Framework Using Graph-Based And Feature-Based Approaches For Identifying Social Bots On Twitter, Eiman Alothali

Dissertations

This dissertation focuses on the problem of evolving social bots in online social networks, particularly Twitter. Such accounts spread misinformation and inflate social network content to mislead the masses. The main objective of this dissertation is to propose a stream-based evolving bot detection framework (SEBD), which was constructed using both graph- and feature-based models. It was built using Python, a real-time streaming engine (Apache Kafka version 3.2), and our pretrained model (bot multi-view graph attention network (Bot-MGAT)). The feature-based model was used to identify predictive features for bot detection and evaluate the SEBD predictions. The graph-based model was used to …


A Novel Approach To Extending Music Using Latent Diffusion, Keon Roohparvar, Franz J. Kurfess Jun 2023

A Novel Approach To Extending Music Using Latent Diffusion, Keon Roohparvar, Franz J. Kurfess

Master's Theses

Using deep learning to synthetically generate music is a research domain that has gained more attention from the public in the past few years. A subproblem of music generation is music extension, or the task of taking existing music and extending it. This work proposes the Continuer Pipeline, a novel technique that uses deep learning to take music and extend it in 5 second increments. It does this by treating the musical generation process as an image generation problem; we utilize latent diffusion models (LDMs) to generate spectrograms, which are image representations of music. The Continuer Pipeline is able to …


Realizing Molecular Machine Learning Through Communications For Biological Ai: Future Directions And Challenges, Sasitharan Balasubramaniam, Samitha Somathilaka, Sehee Sun, Adrian Ratwatte, Massimiliano Pierobon Jun 2023

Realizing Molecular Machine Learning Through Communications For Biological Ai: Future Directions And Challenges, Sasitharan Balasubramaniam, Samitha Somathilaka, Sehee Sun, Adrian Ratwatte, Massimiliano Pierobon

School of Computing: Faculty Publications

Artificial Intelligence (AI) and Machine Learning (ML) are weaving their way into the fabric of society, where they are playing a crucial role in numerous facets of our lives. As we witness the increased deployment of AI and ML in various types of devices, we benefit from their use into energy-efficient algorithms for low powered devices. In this paper, we investigate a scale and medium that is far smaller than conventional devices as we move towards molecular systems that can be utilized to perform machine learning functions, i.e., Molecular Machine Learning (MML). Fundamental to the operation of MML is the …


Machine Learning And Network Embedding Methods For Gene Co-Expression Networks, Niloofar Aghaieabiane May 2023

Machine Learning And Network Embedding Methods For Gene Co-Expression Networks, Niloofar Aghaieabiane

Dissertations

High-throughput technologies such as DNA microarrays and RNA-seq are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed into Gene Co-expression Networks (GCNs). GCNs are analyzed to discover gene modules. GCN construction and analysis is a well-studied topic, for nearly two decades. While new types of sequencing and the corresponding data are now available, the software package WGCNA and its most recent variants are still widely used, contributing to biological discovery.

The discovery of biologically significant modules of genes from raw expression data is …


Ai Approaches To Understand Human Deceptions, Perceptions, And Perspectives In Social Media, Chih-Yuan Li May 2023

Ai Approaches To Understand Human Deceptions, Perceptions, And Perspectives In Social Media, Chih-Yuan Li

Dissertations

Social media platforms have created virtual space for sharing user generated information, connecting, and interacting among users. However, there are research and societal challenges: 1) The users are generating and sharing the disinformation 2) It is difficult to understand citizens' perceptions or opinions expressed on wide variety of topics; and 3) There are overloaded information and echo chamber problems without overall understanding of the different perspectives taken by different people or groups.

This dissertation addresses these three research challenges with advanced AI and Machine Learning approaches. To address the fake news, as deceptions on the facts, this dissertation presents Machine …


An Algorithmic Approach To Jazz Guitar Voice-Leading Chord Fingerings, Matthew B. Keating May 2023

An Algorithmic Approach To Jazz Guitar Voice-Leading Chord Fingerings, Matthew B. Keating

Computer Science Senior Theses

A problem in guitar practice is choosing chord voicings that fit together in sequence, a process known as voice leading. In jazz, a guitarist follows voice leading by maintaining stepwise or limited motion for smoother harmony. The main avenues to learn jazz guitar voice leading theory are through a guitar instructor or chord books. To our knowledge, no computational method of generating voice-leading given chord labels exists. First, we demonstrate the complexity of this problem by presenting a graph search algorithm to optimize for a simplified version of voice leading. Then, we present a novel approach to algorithmically derive tablature …