Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Computer Sciences (648)
- Artificial Intelligence and Robotics (297)
- Engineering (201)
- Data Science (148)
- Statistics and Probability (88)
-
- Computer Engineering (74)
- Databases and Information Systems (57)
- Electrical and Computer Engineering (53)
- Social and Behavioral Sciences (53)
- Other Computer Sciences (51)
- Life Sciences (47)
- Medicine and Health Sciences (45)
- Mathematics (43)
- Software Engineering (43)
- Theory and Algorithms (42)
- Applied Mathematics (40)
- Numerical Analysis and Scientific Computing (40)
- Information Security (33)
- Physics (30)
- Business (26)
- Earth Sciences (24)
- Bioinformatics (23)
- Statistical Models (23)
- Applied Statistics (22)
- Environmental Sciences (19)
- Graphics and Human Computer Interfaces (18)
- Mechanical Engineering (17)
- Operations Research, Systems Engineering and Industrial Engineering (16)
- Chemistry (15)
- Institution
-
- Singapore Management University (30)
- California Polytechnic State University, San Luis Obispo (28)
- Southern Methodist University (28)
- Western University (28)
- University of Texas at El Paso (27)
-
- Technological University Dublin (26)
- San Jose State University (25)
- University of South Florida (23)
- University of Wisconsin Milwaukee (23)
- University of Kentucky (22)
- City University of New York (CUNY) (20)
- Missouri University of Science and Technology (19)
- West Virginia University (19)
- University of Tennessee, Knoxville (18)
- Dartmouth College (17)
- University of Arkansas, Fayetteville (17)
- University of Nebraska - Lincoln (16)
- Utah State University (16)
- Northern Illinois University (15)
- Washington University in St. Louis (15)
- Wright State University (15)
- Claremont Colleges (14)
- University of South Carolina (12)
- Chapman University (11)
- Kennesaw State University (11)
- Selected Works (11)
- University of Nevada, Las Vegas (11)
- Virginia Commonwealth University (11)
- Clemson University (10)
- Purdue University (9)
- Publication Year
- Publication
-
- Theses and Dissertations (58)
- SMU Data Science Review (28)
- Open Access Theses & Dissertations (27)
- Master's Theses (25)
- Research Collection School Of Computing and Information Systems (25)
-
- Electronic Theses and Dissertations (24)
- Electronic Thesis and Dissertation Repository (24)
- Master's Projects (23)
- USF Tampa Graduate Theses and Dissertations (23)
- Doctoral Dissertations (19)
- Graduate Theses, Dissertations, and Problem Reports (18)
- Graduate Theses and Dissertations (15)
- Conference papers (14)
- Dissertations (14)
- Graduate Research Theses & Dissertations (13)
- McKelvey School of Engineering Theses & Dissertations (13)
- Browse all Theses and Dissertations (12)
- Masters Theses (12)
- Dissertations, Theses, and Capstone Projects (11)
- All Graduate Theses and Dissertations, Spring 1920 to Summer 2023 (10)
- UNLV Theses, Dissertations, Professional Papers, and Capstones (10)
- CCE Theses and Dissertations (8)
- CMC Senior Theses (8)
- Dissertations and Theses (8)
- Electronic Theses, Projects, and Dissertations (8)
- Theses and Dissertations--Computer Science (8)
- Computer Science Senior Theses (7)
- Department of Computer Science and Engineering: Dissertations, Theses, and Student Research (7)
- Dissertations, Master's Theses and Master's Reports (7)
- FIU Electronic Theses and Dissertations (7)
- Publication Type
- File Type
Articles 481 - 510 of 826
Full-Text Articles in Physical Sciences and Mathematics
Analysis On Suicidal Ideation Among Adolescents (12-17 Years) In The Usa, Himani Raturi
Analysis On Suicidal Ideation Among Adolescents (12-17 Years) In The Usa, Himani Raturi
Electronic Theses, Projects, and Dissertations
Suicide is one of the leading health concerns in United States among adolescents and the presence of suicidal ideation (SI) is quite high, with ~20-30% of adolescents reporting it at some point. Though we have seen growth and development in the prevention of suicide, there is limited research on the ability to identify the adolescents which might be at risk for SI. The objective behind the project is to identify adolescents with SI using machine learning.
The project shows statistics from different articles on adolescents in the U.S. For this study, adolescent data was taken from NSDUH 2018. Moreover, detailed …
Groundwater Storage Loss Associated With Land Subsidence In Western United States Mapped Using Machine Learning, Ryan G. Smith, Sayantan Majumdar
Groundwater Storage Loss Associated With Land Subsidence In Western United States Mapped Using Machine Learning, Ryan G. Smith, Sayantan Majumdar
Geosciences and Geological and Petroleum Engineering Faculty Research & Creative Works
Land subsidence caused by groundwater extraction has numerous negative consequences, such as loss of groundwater storage and damage to infrastructure. Understanding the magnitude, timing, and locations of land subsidence, as well as the mechanisms driving it, is crucial to implementing mitigation strategies, yet the complex, nonlinear processes causing subsidence are difficult to quantify. Physical models relating groundwater flux to aquifer compaction exist but require substantial hydrological data sets and are time consuming to calibrate. Land deformation can be measured using interferometric synthetic aperture radar (InSAR) and GPS, but the former is computationally expensive to estimate at scale and is subject …
What Was Written Vs. Who Read It: News Media Profiling Using Text Analysis And Social Media Context, Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass, Preslav. Nakov
What Was Written Vs. Who Read It: News Media Profiling Using Text Analysis And Social Media Context, Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass, Preslav. Nakov
Research Collection School Of Computing and Information Systems
Predicting the political bias and the factuality of reporting of entire news outlets are critical elements of media profiling, which is an understudied but an increasingly important research direction. The present level of proliferation of fake, biased, and propagandistic content online has made it impossible to fact-check every single suspicious claim, either manually or automatically. Thus, it has been proposed to profile entire news outlets and to look for those that are likely to publish fake or biased content. This makes it possible to detect likely “fake news” the moment they are published, by simply checking the reliability of their …
Machine Learning For The Internet Of Things: Applications, Implementation, And Security, Vishalini Laguduva Ramnath
Machine Learning For The Internet Of Things: Applications, Implementation, And Security, Vishalini Laguduva Ramnath
USF Tampa Graduate Theses and Dissertations
Artificial intelligence and ubiquitous sensor systems have seen tremendous advances in recent times, resulting in groundbreaking impact across domains such as healthcare, entertainment, and transportation through a collective ecosystem called the Internet of Things. The advent of 5G and improved wireless networks will further accelerate the research and development of tools in deep learning, sensor systems, and computing platforms by providing improved network latency and bandwidth. While tremendous progress has been made in the Internet of Things, current work has largely focused on building robust applications that leverage the data collected through ubiquitous sensor nodes to provide actionable rules and …
Combining Machine Learning And Empirical Engineering Methods Towards Improving Oil Production Forecasting, Andrew J. Allen
Combining Machine Learning And Empirical Engineering Methods Towards Improving Oil Production Forecasting, Andrew J. Allen
Master's Theses
Current methods of production forecasting such as decline curve analysis (DCA) or numerical simulation require years of historical production data, and their accuracy is limited by the choice of model parameters. Unconventional resources have proven challenging to apply traditional methods of production forecasting because they lack long production histories and have extremely variable model parameters. This research proposes a data-driven alternative to reservoir simulation and production forecasting techniques. We create a proxy-well model for predicting cumulative oil production by selecting statistically significant well completion parameters and reservoir information as independent predictor variables in regression-based models. Then, principal component analysis (PCA) …
Algorithmic Robot Design: Label Maps, Procrustean Graphs, And The Boundary Of Non-Destructiveness, Shervin Ghasemlou
Algorithmic Robot Design: Label Maps, Procrustean Graphs, And The Boundary Of Non-Destructiveness, Shervin Ghasemlou
Theses and Dissertations
This dissertation is focused on the problem of algorithmic robot design. The process of designing a robot or a team of robots that can reliably accomplish a task in an environment requires several key elements. How the problem is formulated can play a big role in the design process. The ability of the model to correctly reflect the environment, the events, and different pieces of the problem is crucial. Another key element is the ability of the model to show the relationship between different designs of a single system. These two elements can enable design algorithms to navigate through the …
Using Group Affinity To Predict Community Formation In Social Networks, Joseph Leung
Using Group Affinity To Predict Community Formation In Social Networks, Joseph Leung
Undergraduate Honors Theses
A well-studied topic in network theory is detecting the communities found in real-world networks. Community detection is a technique to better understand the way in which small dense substructures appear in these networks. Such substructures can often tell important information about groups that form in such systems. A prominent feature of many networks is that they evolve over time, forming and dissolving new edges between different nodes that appear. In this thesis, we consider how we can use the community structure of a network at a certain point in time to predict the state of a network’s communities at some …
A Hybrid Approach To Procedural Dungeon Generation, Mathias Paul Babin
A Hybrid Approach To Procedural Dungeon Generation, Mathias Paul Babin
Electronic Thesis and Dissertation Repository
This thesis presents a novel approach to the Procedural Content Generation (PCG) of both maze and dungeon environments. The solution we propose in this thesis borrows techniques from both Procedural Content Generation via Machine Learning as well as Constructive PCG methods. The approach we take involves decomposing the problem of level generation into a series of stages which begins with the production of macro-level functional structures and ends with micro-level aesthetic details; specifically, we train a Deep Convolutional Neural Network to produce high-quality mazes, which in turn, are transformed into the rooms of larger dungeon levels using a constructive algorithm. …
Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa
Machine Learning With Digital Signal Processing For Rapid And Accurate Alignment-Free Genome Analysis: From Methodological Design To A Covid-19 Case Study, Gurjit Singh Randhawa
Electronic Thesis and Dissertation Repository
In the field of bioinformatics, taxonomic classification is the scientific practice of identifying, naming, and grouping of organisms based on their similarities and differences. The problem of taxonomic classification is of immense importance considering that nearly 86% of existing species on Earth and 91% of marine species remain unclassified. Due to the magnitude of the datasets, the need exists for an approach and software tool that is scalable enough to handle large datasets and can be used for rapid sequence comparison and analysis. We propose ML-DSP, a stand-alone alignment-free software tool that uses Machine Learning and Digital Signal Processing to …
Evidence-Based Detection Of Pancreatic Canc, Rajeshwari Deepak Chandratre
Evidence-Based Detection Of Pancreatic Canc, Rajeshwari Deepak Chandratre
Master's Projects
This study is an effort to develop a tool for early detection of pancreatic cancer using evidential reasoning. An evidential reasoning model predicts the likelihood of an individual developing pancreatic cancer by processing the outputs of a Support Vector Classifier, and other input factors such as smoking history, drinking history, sequencing reads, biopsy location, family and personal health history. Certain features of the genomic data along with the mutated gene sequence of pancreatic cancer patients was obtained from the National Cancer Institute (NIH) Genomic Data Commons (GDC). This data was used to train the SVC. A prediction accuracy of ~85% …
Computational Astronomy: Classification Of Celestial Spectra Using Machine Learning Techniques, Gayatri Milind Hungund
Computational Astronomy: Classification Of Celestial Spectra Using Machine Learning Techniques, Gayatri Milind Hungund
Master's Projects
Lightyears beyond the Planet Earth there exist plenty of unknown and unexplored stars and Galaxies that need to be studied in order to support the Big Bang Theory and also make important astronomical discoveries in quest of knowing the unknown. Sophisticated devices and high-power computational resources are now deployed to make a positive effort towards data gathering and analysis. These devices produce massive amount of data from the astronomical surveys and the data is usually in terabytes or petabytes. It is exhaustive to process this data and determine the findings in short period of time. Many details can be missed …
Vikingbot: The Starcraft Artificial Intelligence, Tyler Barger, Daniel Peterson
Vikingbot: The Starcraft Artificial Intelligence, Tyler Barger, Daniel Peterson
Scholars Week
VikingBot is an automated AI that plays StarCraft by using a combination of machine learning and artificial intelligence. High level strategies are planned using the Brown-UMBC Reinforcement Learning and Planning (BURLAP), library which implements planning algorithms and provides interfaces for defining a domain and models of that domain for planning. For the planning, we used the BURLAP implementation of the sparse sampling algorithm because the time complexity is independent of the size of the state space, and we have to plan quickly in real time. SARSA reinforcement learning is used for a machine learning model that controls combat units. Various …
Network Traffic Based Botnet Detection Using Machine Learning, Anand Ravindra Vishwakarma
Network Traffic Based Botnet Detection Using Machine Learning, Anand Ravindra Vishwakarma
Master's Projects
The field of information and computer security is rapidly developing in today’s world as the number of security risks is continuously being explored every day. The moment a new software or a product is launched in the market, a new exploit or vulnerability is exposed and exploited by the attackers or malicious users for different motives. Many attacks are distributed in nature and carried out by botnets that cause widespread disruption of network activity by carrying out DDoS (Distributed Denial of Service) attacks, email spamming, click fraud, information and identity theft, virtual deceit and distributed resource usage for cryptocurrency mining. …
Sensor Data Analysis In Smart Buildings, Manuel A. Mane Penton
Sensor Data Analysis In Smart Buildings, Manuel A. Mane Penton
Publications and Research
Data analysis and Machine Learning are destined to evolve the current technology infrastructure by solving technology and economy demands present mainly in developed cities like New York. This research proposes a machine learning (ML) based solution to alleviate one of the main issues that big buildings such as CUNY campuses have, that is the waste of energy resources. The analysis of data coming from the readings of different deployed sensors such as CO2, humidity and temperature can be used to estimate occupancy in a specific room and building in general. The outcome of this research established a relationship between the …
Toward The Automatic Classification Of Self-Affirmed Refactoring, Mohamed Wiem Mkaouer, Eman Abdullah Alomar, Ali Ouni
Toward The Automatic Classification Of Self-Affirmed Refactoring, Mohamed Wiem Mkaouer, Eman Abdullah Alomar, Ali Ouni
Articles
The concept of Self-Affirmed Refactoring (SAR) was introduced to explore how developers document their refactoring activities in commit messages, i.e., developers explicit documentation of refactoring operations intentionally introduced during a code change. In our previous study, we have manually identified refactoring patterns and defined three main common quality improvement categories including internal quality attributes, external quality attributes, and code smells, by only considering refactoring-related commits. However, this approach heavily depends on the manual inspection of commit messages. In this paper, we propose a two-step approach to first identify whether a commit describes developer-related refactoring events, then to classify it according …
Integrated Machine Learning And Bioinformatics Approaches For Prediction Of Cancer-Driving Gene Mutations, Oluyemi Odeyemi
Integrated Machine Learning And Bioinformatics Approaches For Prediction Of Cancer-Driving Gene Mutations, Oluyemi Odeyemi
Computational and Data Sciences (PhD) Dissertations
Cancer arises from the accumulation of somatic mutations and genetic alterations in cell division checkpoints and apoptosis, this often leads to abnormal tumor proliferation. Proper classification of cancer-linked driver mutations will considerably help our understanding of the molecular dynamics of cancer. In this study, we compared several cancer-specific predictive models for prediction of driver mutations in cancer-linked genes that were validated on canonical data sets of functionally validated mutations and applied to a raw cancer genomics data. By analyzing pathogenicity prediction and conservation scores, we have shown that evolutionary conservation scores play a pivotal role in the classification of cancer …
Chaff From The Wheat: Characterizing And Determining Valid Bug Reports, Yuanrui Fan, Xin Xia, David Lo, Ahmed E. Hassan
Chaff From The Wheat: Characterizing And Determining Valid Bug Reports, Yuanrui Fan, Xin Xia, David Lo, Ahmed E. Hassan
Research Collection School Of Computing and Information Systems
Developers use bug reports to triage and fix bugs. When triaging a bug report, developers must decide whether the bug report is valid (i.e., a real bug). A large amount of bug reports are submitted every day, with many of them end up being invalid reports. Manually determining valid bug report is a difficult and tedious task. Thus, an approach that can automatically analyze the validity of a bug report and determine whether a report is valid can help developers prioritize their triaging tasks and avoid wasting time and effort on invalid bug reports. In this study, motivated by the …
Early Warning Solar Storm Prediction, Ian D. Lumsden, Marvin Joshi, Matthew Smalley, Aiden Rutter, Ben Klein
Early Warning Solar Storm Prediction, Ian D. Lumsden, Marvin Joshi, Matthew Smalley, Aiden Rutter, Ben Klein
Chancellor’s Honors Program Projects
No abstract provided.
Knot Flow Classification And Its Applications In Vehicular Ad-Hoc Networks (Vanet), David Schmidt
Knot Flow Classification And Its Applications In Vehicular Ad-Hoc Networks (Vanet), David Schmidt
Electronic Theses and Dissertations
Intrusion detection systems (IDSs) play a crucial role in the identification and mitigation for attacks on host systems. Of these systems, vehicular ad hoc networks (VANETs) are difficult to protect due to the dynamic nature of their clients and their necessity for constant interaction with their respective cyber-physical systems. Currently, there is a need for a VANET-specific IDS that meets this criterion. To this end, a spline-based intrusion detection system has been pioneered as a solution. By combining clustering with spline-based general linear model classification, this knot flow classification method (KFC) allows for robust intrusion detection to occur. Due its …
Achieving Causal Fairness In Machine Learning, Yongkai Wu
Achieving Causal Fairness In Machine Learning, Yongkai Wu
Graduate Theses and Dissertations
Fairness is a social norm and a legal requirement in today's society. Many laws and regulations (e.g., the Equal Credit Opportunity Act of 1974) have been established to prohibit discrimination and enforce fairness on several grounds, such as gender, age, sexual orientation, race, and religion, referred to as sensitive attributes. Nowadays machine learning algorithms are extensively applied to make important decisions in many real-world applications, e.g., employment, admission, and loans. Traditional machine learning algorithms aim to maximize predictive performance, e.g., accuracy. Consequently, certain groups may get unfairly treated when those algorithms are applied for decision-making. Therefore, it is an imperative …
Dynamic Fraud Detection Via Sequential Modeling, Panpan Zheng
Dynamic Fraud Detection Via Sequential Modeling, Panpan Zheng
Graduate Theses and Dissertations
The impacts of information revolution are omnipresent from life to work. The web services have signicantly changed our living styles in daily life, such as Facebook for communication and Wikipedia for knowledge acquirement. Besides, varieties of information systems, such as data management system and management information system, make us work more eciently. However, it is usually a double-edged sword. With the popularity of web services, relevant security issues are arising, such as fake news on Facebook and vandalism on Wikipedia, which denitely impose severe security threats to OSNs and their legitimate participants. Likewise, oce automation incurs another challenging security issue, …
Advanced Techniques To Detect Complex Android Malware, Zhiqiang Li
Advanced Techniques To Detect Complex Android Malware, Zhiqiang Li
Department of Computer Science and Engineering: Dissertations, Theses, and Student Research
Android is currently the most popular operating system for mobile devices in the world. However, its openness is the main reason for the majority of malware to be targeting Android devices. Various approaches have been developed to detect malware.
Unfortunately, new breeds of malware utilize sophisticated techniques to defeat malware detectors. For example, to defeat signature-based detectors, malware authors change the malware’s signatures to avoid detection. As such, a more effective approach to detect malware is by leveraging malware’s behavioral characteristics. However, if a behavior-based detector is based on static analysis, its reported results may contain a large number of …
Subsurface Analytics: Contribution Of Artificial Intelligence And Machine Learning To Reservoir Engineering, Reservoir Modeling, And Reservoir Management, Shahab D. Mohaghegh
Subsurface Analytics: Contribution Of Artificial Intelligence And Machine Learning To Reservoir Engineering, Reservoir Modeling, And Reservoir Management, Shahab D. Mohaghegh
Faculty & Staff Scholarship
Subsurface Analytics is a new technology that changes the way reservoir simulation and modeling is performed. Instead of starting with the construction of mathematical equations to model the physics of the fluid flow through porous media and then modification of the geological models in order to achieve history match, Subsurface Analytics that is a completely AI-based reservoir simulation and modeling technology takes a completely different approach. In AI-based reservoir modeling, field measurements form the foundation of the reservoir model. Using data-driven, pattern recognition technologies; the physics of the fluid flow through porous media is modeled through discovering the best, most …
Finding Critical And Gradient-Flat Points Of Deep Neural Network Loss Functions, Charles Gearhart Frye '09
Finding Critical And Gradient-Flat Points Of Deep Neural Network Loss Functions, Charles Gearhart Frye '09
Doctoral Dissertations
Despite the fact that the loss functions of deep neural networks are highly non-convex, gradient-based optimization algorithms converge to approximately the same performance from many random initial points. This makes neural networks easy to train, which, combined with their high representational capacity and implicit and explicit regularization strategies, leads to machine-learned algorithms of high quality with reasonable computational cost in a wide variety of domains.
One thread of work has focused on explaining this phenomenon by numerically characterizing the local curvature at critical points of the loss function, where gradients are zero. Such studies have reported that the loss functions …
Explainable Deep Learning For Medical Image Analysis, Brennan Rhoadarmer
Explainable Deep Learning For Medical Image Analysis, Brennan Rhoadarmer
UCARE Research Products
Explainable Deep Learning for Medical Image Analysis is a project focused on improving the ability for deep learning models to explain the reasoning behind their classification in order to improve their viability in the medical field, where explanations of decisions is critical for the care of patients. In order to explore this topic, we work to implement GradCAM, which is a new method of determining the cause classification in models by tracing back through the model layers to the input.
Robust Neural Machine Translation, Abdul Rafae Khan
Robust Neural Machine Translation, Abdul Rafae Khan
Dissertations, Theses, and Capstone Projects
This thesis aims for general robust Neural Machine Translation (NMT) that is agnostic to the test domain. NMT has achieved high quality on benchmarks with closed datasets such as WMT and NIST but can fail when the translation input contains noise due to, for example, mismatched domains or spelling errors. The standard solution is to apply domain adaptation or data augmentation to build a domain-dependent system. However, in real life, the input noise varies in a wide range of domains and types, which is unknown in the training phase. This thesis introduces five general approaches to improve NMT accuracy and …
Glacier Segmentation In Satellite Images For Hindu Kush Himalaya Region, Bibek Aryal
Glacier Segmentation In Satellite Images For Hindu Kush Himalaya Region, Bibek Aryal
Open Access Theses & Dissertations
Climate change poses a risk to individuals whose livelihoods depend on the health of glacier ecosystems. Monitoring glaciers in the Himalayan Hindu Kush (HKH) region is of high importance especially when we consider the impact of recent climate change on them. Our work aims to provide an automated method to outline glaciers using machine learning techniques and publicly available remote sensing imagery.In this work, we present ways to delineate glaciers from Landsat-7 imagery using various machine learning and computer vision techniques. The multi-step methodology that we present in this work is generalizable across different types of satellite and overhead imagery, …
Benchmarking Machine Learning Methods For Molecular Property Prediction, Govinda Bahadur Kc
Benchmarking Machine Learning Methods For Molecular Property Prediction, Govinda Bahadur Kc
Open Access Theses & Dissertations
Machine learning (ML) techniques have been widely applied in a variety of areas ranging from pattern recognition, natural language processing, and computer games to self-driving cars, clinical diagnostics, and molecular structure prediction easing day to day life of human beings. Drug discovery is an expensive, complex, and time taking process. Currently, the pharma industry is hoping to leverage machine learning methods in expediting the drug discovery process. Molecular property prediction is one of the most important tasks in drug discovery. While developing a new drug relies on a proper understanding of molecular properties, there has been great interest in the …
Artificial Neural Network Models For Pattern Discovery From Ecg Time Series, Mehakpreet Kaur
Artificial Neural Network Models For Pattern Discovery From Ecg Time Series, Mehakpreet Kaur
Electronic Theses and Dissertations
Artificial Neural Network (ANN) models have recently become de facto models for deep learning with a wide range of applications spanning from scientific fields such as computer vision, physics, biology, medicine to social life (suggesting preferred movies, shopping lists, etc.). Due to advancements in computer technology and the increased practice of Artificial Intelligence (AI) in medicine and biological research, ANNs have been extensively applied not only to provide quick information about diseases, but also to make diagnostics accurate and cost-effective. We propose an ANN-based model to analyze a patient's electrocardiogram (ECG) data and produce accurate diagnostics regarding possible heart diseases …
Brexit: Psychometric Profiling The Political Salubrious Through Machine Learning: Predicting Personality Traits Of Boris Johnson Through Twitter Political Text, James Usher, Pierpaolo Dondio
Brexit: Psychometric Profiling The Political Salubrious Through Machine Learning: Predicting Personality Traits Of Boris Johnson Through Twitter Political Text, James Usher, Pierpaolo Dondio
Conference papers
Whilst the CIA have been using psychometric profiling for decades, Cambridge Analytica showed that people's psychological characteristics can be accurately predicted from their digital footprints, such as their Facebook or Twitter accounts. To exploit this form of psychological assessment from digital footprints, we propose machine learning methods for assessing political personality from Twitter. We have extracted the tweet content of Prime Minster Boris Johnson’s Twitter account and built three predictive personality models based on his Twitter political content. We use a Multi-Layer Perceptron Neural network, a Naive Bayes multinomial model and a Support Machine Vector model to predict the OCEAN …