Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Machine learning

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 631 - 660 of 1687

Full-Text Articles in Physical Sciences and Mathematics

Exploring Cyberterrorism, Topic Models And Social Networks Of Jihadists Dark Web Forums: A Computational Social Science Approach, Vivian Fiona Guetler Jan 2022

Exploring Cyberterrorism, Topic Models And Social Networks Of Jihadists Dark Web Forums: A Computational Social Science Approach, Vivian Fiona Guetler

Graduate Theses, Dissertations, and Problem Reports

This three-article dissertation focuses on cyber-related topics on terrorist groups, specifically Jihadists’ use of technology, the application of natural language processing, and social networks in analyzing text data derived from terrorists' Dark Web forums. The first article explores cybercrime and cyberterrorism. As technology progresses, it facilitates new forms of behavior, including tech-related crimes known as cybercrime and cyberterrorism. In this article, I provide an analysis of the problems of cybercrime and cyberterrorism within the field of criminology by reviewing existing literature focusing on (a) the issues in defining terrorism, cybercrime, and cyberterrorism, (b) ways that cybercriminals commit a crime in …


Using Landsat-Based Phenology Metrics, Terrain Variables, And Machine Learning For Mapping And Probabilistic Prediction Of Forest Community Types In West Virginia, Faith M. Hartley Jan 2022

Using Landsat-Based Phenology Metrics, Terrain Variables, And Machine Learning For Mapping And Probabilistic Prediction Of Forest Community Types In West Virginia, Faith M. Hartley

Graduate Theses, Dissertations, and Problem Reports

This study investigates the mapping of forest community types for the entire state of West Virginia, USA using Global Land Analysis and Discovery (GLAD) Phenology Metrics analysis ready data (ARD) derived from the Landsat time series and digital terrain variables derived from a digital terrain model (DTM). Both classifications and probabilistic predictions were made using random forest (RF) machine learning (ML) and training data derived from ground plots provided by the West Virginia Natural Heritage Program (WVNHP). The primary goal of this study is to explore the use of globally consistent ARD data for operational forest type mapping over a …


Deeply Learning Deep Inelastic Scattering Kinematics, Markus Diefenthaler, Abdullah Farhat, Andrii Verbytskyi, Yuesheng Xu Jan 2022

Deeply Learning Deep Inelastic Scattering Kinematics, Markus Diefenthaler, Abdullah Farhat, Andrii Verbytskyi, Yuesheng Xu

Mathematics & Statistics Faculty Publications

We study the use of deep learning techniques to reconstruct the kinematics of the neutral current deep inelastic scattering (DIS) process in electron–proton collisions. In particular, we use simulated data from the ZEUS experiment at the HERA accelerator facility, and train deep neural networks to reconstruct the kinematic variables Q2 and x. Our approach is based on the information used in the classical construction methods, the measurements of the scattered lepton, and the hadronic final state in the detector, but is enhanced through correlations and patterns revealed with the simulated data sets. We show that, with the appropriate selection …


Data-Driven Methods For Low-Energy Nuclear Theory, Jordan M.R. Fox Jan 2022

Data-Driven Methods For Low-Energy Nuclear Theory, Jordan M.R. Fox

CGU Theses & Dissertations

The term data-driven describes computational methods for numerical problem solvingwhich have been developed by the field of data science; these are at the intersection of computer science,mathematics, and statistics. When applied to a domain science like nuclear physics, especially with the goalof deepening scientific insight, data-driven methods form a core pillar of the computational science endeavor.In this dissertation I explore two problems related to theoretical nuclear physics: one in the framework of numerical statistics, and the other in the framework of machine learning. I) Historically our understanding of the structure of the atomic nucleus, the quantum many-body problem, has been …


On Performance Optimization And Prediction Of Parallel Computing Frameworks In Big Data Systems, Haifa Alquwaiee Dec 2021

On Performance Optimization And Prediction Of Parallel Computing Frameworks In Big Data Systems, Haifa Alquwaiee

Dissertations

A wide spectrum of big data applications in science, engineering, and industry generate large datasets, which must be managed and processed in a timely and reliable manner for knowledge discovery. These tasks are now commonly executed in big data computing systems exemplified by Hadoop based on parallel processing and distributed storage and management. For example, many companies and research institutions have developed and deployed big data systems on top of NoSQL databases such as HBase and MongoDB, and parallel computing frameworks such as MapReduce and Spark, to ensure timely data analyses and efficient result delivery for decision making and business …


Parameter Estimation And Inference Of Spatial Autoregressive Model By Stochastic Gradient Descent, Gan Luan Dec 2021

Parameter Estimation And Inference Of Spatial Autoregressive Model By Stochastic Gradient Descent, Gan Luan

Dissertations

Stochastic gradient descent (SGD) is a popular iterative method for model parameter estimation in large-scale data and online learning settings since it goes through the data in only one pass. While SGD has been well studied for independent data, its application to spatially-correlated data largely remains unexplored. This dissertation develops SGD-based parameter estimation and statistical inference algorithms for the spatial autoregressive (SAR) model, a common model for spatial lattice data.

This research contains three parts. (I) The first part concerns SGD estimation and inference for the SAR mean regression model. A new SGD algorithm based on maximum likelihood estimator (MLE) …


Machine Learning And Computer Vision In Solar Physics, Haodi Jiang Dec 2021

Machine Learning And Computer Vision In Solar Physics, Haodi Jiang

Dissertations

In the recent decades, the difficult task of understanding and predicting violent solar eruptions and their terrestrial impacts has become a strategic national priority, as it affects the life of human beings, including communication, transportation, the power grid, national defense, space travel, and more. This dissertation explores new machine learning and computer vision techniques to tackle this difficult task. Specifically, the dissertation addresses four interrelated problems in solar physics: magnetic flux tracking, fibril tracing, Stokes inversion and vector magnetogram generation.

First, the dissertation presents a new deep learning method, named SolarUnet, to identify and track solar magnetic flux elements in …


Long Term Predictive Modeling On Big Spatio-Temporal Data, Yong Zhuang Dec 2021

Long Term Predictive Modeling On Big Spatio-Temporal Data, Yong Zhuang

Graduate Doctoral Dissertations

In the era of massive data, one of the most promising research fields involves the analysis of large-scale Spatio-temporal databases to discover exciting and previously unknown but potentially useful patterns from data collected over time and space. A modeling process in this domain must take temporal and spatial correlations into account, but with the dimensionality of the time and space measurements increasing, the number of elements potentially contributing to a target sharply grows, making the target's long-term behavior highly complex, chaotic, highly dynamic, and hard to predict. Therefore, two different considerations are taken into account in this work: one is …


A Hybrid Machine Learning Framework For Predicting Students’ Performance In Virtual Learning Environment, Edmund Evangelista Dec 2021

A Hybrid Machine Learning Framework For Predicting Students’ Performance In Virtual Learning Environment, Edmund Evangelista

All Works

Virtual Learning Environments (VLE), such as Moodle and Blackboard, store vast data to help identify students' performance and engagement. As a result, researchers have been focusing their efforts on assisting educational institutions in providing machine learning models to predict at-risk students and improve their performance. However, it requires an efficient approach to construct a model that can ultimately provide accurate predictions. Consequently, this study proposes a hybrid machine learning framework to predict students' performance using eight classification algorithms and three ensemble methods (Bagging, Boosting, Voting) to determine the best-performing predictive model. In addition, this study used filter-based and wrapper-based feature …


Aspect-Based Sentiment Analysis Of Movie Reviews, Samuel Onalaja, Eric Romero, Bosang Yun Dec 2021

Aspect-Based Sentiment Analysis Of Movie Reviews, Samuel Onalaja, Eric Romero, Bosang Yun

SMU Data Science Review

This study investigates a comparison of classification models used to determine aspect based separated text sentiment and predict binary sentiments of movie reviews with genre and aspect specific driving factors. To gain a broader classification analysis, five machine and deep learning algorithms were compared: Logistic Regression (LR), Naive Bayes (NB), Support Vector Machine (SVM), and Recurrent Neural Network Long-Short-Term Memory (RNN LSTM). The various movie aspects that are utilized to separate the sentences are determined through aggregating aspect words from lexicon-base, supervised and unsupervised learning. The driving factors are randomly assigned to various movie aspects and their impact tied to …


Clinical Diagnosis Support With Convolutional Neural Network By Transfer Learning, Spencer Fogleman, Jeremy Otsap, Sangrae Cho Dec 2021

Clinical Diagnosis Support With Convolutional Neural Network By Transfer Learning, Spencer Fogleman, Jeremy Otsap, Sangrae Cho

SMU Data Science Review

Breast cancer is prevalent among women in the United States. Breast cancer screening is standard but requires a radiologist to review screening images to make a diagnosis. Diagnosis through the traditional screening method of mammography currently has an accuracy of about 78% for women of all ages and demographics. A more recent and precise technique called Digital Breast Tomosynthesis (DBT) has shown to be more promising but is less well studied. A machine learning model trained on DBT images has the potential to increase the success of identifying breast cancer and reduce the time it takes to diagnose a patient, …


Machine Learning And Radiomic Features To Predict Overall Survival Time For Glioblastoma Patients, Lina Chato, Shahram Latifi Dec 2021

Machine Learning And Radiomic Features To Predict Overall Survival Time For Glioblastoma Patients, Lina Chato, Shahram Latifi

Electrical & Computer Engineering Faculty Research

Glioblastoma is an aggressive brain tumor with a low survival rate. Understanding tumor behavior by predicting prognosis outcomes is a crucial factor in deciding a proper treatment plan. In this paper, an automatic overall survival time prediction system (OST) for glioblastoma patients is developed on the basis of radiomic features and machine learning (ML). This system is designed to predict prognosis outcomes by classifying a glioblastoma patient into one of three survival groups: short-term, mid-term, and long-term. To develop the prediction system, a medical dataset based on imaging information from magnetic resonance imaging (MRI) and non-imaging information is used. A …


Quantification Of Mineral Reactivity Using Machine Learning Interpretation Of Micro-Xrf Data, Julie J. Kim, Florence Ling, Dan A. Plattenberger, Andres F. Clarens, Catherine A. Peters Dec 2021

Quantification Of Mineral Reactivity Using Machine Learning Interpretation Of Micro-Xrf Data, Julie J. Kim, Florence Ling, Dan A. Plattenberger, Andres F. Clarens, Catherine A. Peters

Environmental Science Faculty Work

Accurate characterizations of mineral reactivity require mapping of spatial heterogeneity, and quantifications of mineral abundances, elemental content, and mineral accessibility. Reactive transport models require such information at the grain-scale to accurately simulate coupled processes of mineral reactions, aqueous solution speciation, and mass transport. In this work, millimeter-scale mineral maps are generated using a neural network approach for 2D mineral mapping based on synchrotron micro x-ray fluorescence (μXRF) data. The approach is called Synchrotron-based Machine learning Approach for RasTer (SMART) mapping, which reads μXRF scans and provides mineral maps of the same size and resolution. The SMART mineral classifier is trained …


Machine Learning For Stock Prediction Based On Fundamental Analysis, Yuxuan Huang, Luiz Fernando Capretz, Danny Ho Dec 2021

Machine Learning For Stock Prediction Based On Fundamental Analysis, Yuxuan Huang, Luiz Fernando Capretz, Danny Ho

Electrical and Computer Engineering Publications

Application of machine learning for stock prediction is attracting a lot of attention in recent years. A large amount of research has been conducted in this area and multiple existing results have shown that machine learning methods could be successfully used toward stock predicting using stocks’ historical data. Most of these existing approaches have focused on short term prediction using stocks’ historical price and technical indicators. In this paper, we prepared 22 years’ worth of stock quarterly financial data and investigated three machine learning algorithms: Feed-forward Neural Network (FNN), Random Forest (RF) and Adaptive Neural Fuzzy Inference System (ANFIS) for …


Intelligent Resource Prediction For Hpc And Scientific Workflows, Benjamin Shealy Dec 2021

Intelligent Resource Prediction For Hpc And Scientific Workflows, Benjamin Shealy

All Dissertations

Scientific workflows and high-performance computing (HPC) platforms are critically important to modern scientific research. In order to perform scientific experiments at scale, domain scientists must have knowledge and expertise in software and hardware systems that are highly complex and rapidly evolving. While computational expertise will be essential for domain scientists going forward, any tools or practices that reduce this burden for domain scientists will greatly increase the rate of scientific discoveries. One challenge that exists for domain scientists today is knowing the resource usage patterns of an application for the purpose of resource provisioning. A tool that accurately estimates these …


Comparing Machine Learning Techniques With State-Of-The-Art Parametric Prediction Models For Predicting Soybean Traits, Susweta Ray Dec 2021

Comparing Machine Learning Techniques With State-Of-The-Art Parametric Prediction Models For Predicting Soybean Traits, Susweta Ray

Department of Statistics: Dissertations, Theses, and Student Work

Soybean is a significant source of protein and oil, and also widely used as animal feed. Thus, developing lines that are superior in terms of yield, protein and oil content is important to feed the ever-growing population. As opposed to the high-cost phenotyping, genotyping is both cost and time efficient for breeders while evaluating new lines in different environments (location-year combinations) can be costly. Several Genomic prediction (GP) methods have been developed to use the marker and environment data effectively to predict the yield or other relevant phenotypic traits of crops. Our study compares a conventional GP method (GBLUP), a …


Translation Of Array-Based Loop Programs To Optimized Sql-Based Distributed Programs, Md Hasanuzzaman Noor Dec 2021

Translation Of Array-Based Loop Programs To Optimized Sql-Based Distributed Programs, Md Hasanuzzaman Noor

Computer Science and Engineering Dissertations

Most programs written to operate on data are usually expressed in terms of array operations in sequential loops. However, these programs do not scale to large amount of data generated by scientific experiments and industrial and commercial markets. Given the success of machine learning algorithms on large amount of data and the recent shift of industries to data-driven decision making, the data scientists who are not familiar with Big Data frameworks have to rewrite the sequential programs to distributed data-parallel programs by hand. We present a novel framework, called SQLgen, that automatically translates sequential loops to distributed data-parallel programs. SQLgen …


On Predicting Omnidirectional Honey Bee Traffic Using Weather And Electromagnetic Radiation, Daniel G. Hornberger Dec 2021

On Predicting Omnidirectional Honey Bee Traffic Using Weather And Electromagnetic Radiation, Daniel G. Hornberger

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Honey bees are responsible for pollinating many important crops in the United States. However, honey bee populations have declined significantly since 1961. While some causes of this decline are known, others are not. By utilizing electronic bee hive monitoring (EBM) systems, bee keepers and researchers have an added resource in determining the causes of these declines so that the issues can be remedied. For nearly five months (May through October) during the 2020 honey bee foraging season in Logan, Utah, USA, we collected on-site weather and electromagnetic radiation (EMR) readings and videos of the hive entrances of six bee hives …


Regulating New Tech: Problems, Pathways, And People, Cary Coglianese Dec 2021

Regulating New Tech: Problems, Pathways, And People, Cary Coglianese

All Faculty Scholarship

New technologies bring with them many promises, but also a series of new problems. Even though these problems are new, they are not unlike the types of problems that regulators have long addressed in other contexts. The lessons from regulation in the past can thus guide regulatory efforts today. Regulators must focus on understanding the problems they seek to address and the causal pathways that lead to these problems. Then they must undertake efforts to shape the behavior of those in industry so that private sector managers focus on their technologies’ problems and take actions to interrupt the causal pathways. …


Estimation And Interpretation Of Machine Learning Models With Customized Surrogate Model, Mudabbir Ali, Asad Masood Khattak, Zain Ali, Bashir Hayat, Muhammad Idrees, Zeeshan Pervez, Kashif Rizwan, Tae Eung Sung, Ki Il Kim Dec 2021

Estimation And Interpretation Of Machine Learning Models With Customized Surrogate Model, Mudabbir Ali, Asad Masood Khattak, Zain Ali, Bashir Hayat, Muhammad Idrees, Zeeshan Pervez, Kashif Rizwan, Tae Eung Sung, Ki Il Kim

All Works

Machine learning has the potential to predict unseen data and thus improve the productivity and processes of daily life activities. Notwithstanding its adaptiveness, several sensitive applications based on such technology cannot compromise our trust in them; thus, highly accurate machine learning models require reason. Such models are black boxes for end-users. Therefore, the concept of interpretability plays the role if assisting users in a couple of ways. Interpretable models are models that possess the quality of explaining predictions. Different strategies have been proposed for the aforementioned concept but some of these require an excessive amount of effort, lack generalization, are …


Determining States Of Movement In Humans Using Minimally Processed Eeg Signals And Various Classification Methods, Maurice Barnett Dec 2021

Determining States Of Movement In Humans Using Minimally Processed Eeg Signals And Various Classification Methods, Maurice Barnett

All Theses

Electroencephalography (EEG) is a non-invasive technique used in both clinical and research settings to record neuronal signaling in the brain. The location of an EEG signal as well as the frequencies at which its neuronal constituents fire correlate with behavioral tasks, including discrete states of motor activity. Due to the number of channels and fine temporal resolution of EEG, a dense, high-dimensional dataset is collected. Transcranial direct current stimulation (tDCS) is a treatment that has been suggested to improve motor functions of Parkinson’s disease and chronic stroke patients when stimulation occurs during a motor task. tDCS is commonly administered without …


Quantum State Estimation And Tracking For Superconducting Processors Using Machine Learning, Shiva Lotfallahzadeh Barzili Dec 2021

Quantum State Estimation And Tracking For Superconducting Processors Using Machine Learning, Shiva Lotfallahzadeh Barzili

Computational and Data Sciences (PhD) Dissertations

Quantum technology has been rapidly growing; in particular, the experiments that have been performed with superconducting qubits and circuit QED have allowed us to explore the light-matter interaction at its most fundamental level. The study of coherent dynamics between two-level systems and resonator modes can provide insight into fundamental aspects of quantum physics, such as how the state of a system evolves while being continuously observed. To study such an evolving quantum system, experimenters need to verify the accuracy of state preparation and control since quantum systems are very fragile and sensitive to environmental disturbance. In this thesis, I look …


Adaptive Human-Robot Motion Transfer For Complete Body Imitation, Francisco Villa Dec 2021

Adaptive Human-Robot Motion Transfer For Complete Body Imitation, Francisco Villa

Computer Science and Engineering Theses

Programming robot systems to perform certain tasks is a big challenge especially if such programming is to be performed by persons who are not experts in robotics. For example, when programming a robot to serve as an exercise trainer, the person defining the motions might more naturally be a person in the exercise domain rather than a robotics expert. To address this, this thesis investigates programming by demonstration or teleoperation using full direct body motion. The goal is to reproduce gaits, gestures, and postures on a humanoid robot from observed human demonstrations. Fine motor movements such as movement of fingers …


The Potential Of Remotely Sensed Vegetation Indices For Monitoring Pasture Condition, Pouria Ramzi, Karen Holmes Dec 2021

The Potential Of Remotely Sensed Vegetation Indices For Monitoring Pasture Condition, Pouria Ramzi, Karen Holmes

Resource management technical reports

The Department of Primary Industries and Regional Development (DPIRD) is developing an integrated monitoring system using remote sensing and on-ground measurements to track pasture condition across Western Australia’s pastoral region. We extended and adapted the methods developed in the Pastoral Lease Assessment Using Geospatial Analysis (PLAGA) project (Robinson et al. 2012), which combined remotely sensed vegetation indices (VIs) with on-ground pasture condition observations to assess the potential of using different vegetation indices in a statewide condition monitoring system.

There were 6 regions in WA’s pastoral rangelands with DPIRD on-ground condition traverse points: Kimberley and Broome, Pilbara, Yalgoo and Sandstone, Goldfields, …


Factors Influencing Intent To Take A Covid-19 Test In The United States, Sheila Rutto Dec 2021

Factors Influencing Intent To Take A Covid-19 Test In The United States, Sheila Rutto

Theses and Dissertations

In 2020, COVID-19 became the first pandemic in the world’s history that brought the entire world to an abrupt and unexpected halt. Since the first reported case of the disease to date, the novel coronavirus has been able to wreak havoc in literary every corner of the globe and left an ever-growing number of unprecedented fatalities. The normal way of life has been disrupted, and the level of uncertainty about the end of this pandemic continues to manifest to many. Due to the urgency to bring this pandemic under control, medical officers have been able to recommend actions that people …


Modeling Of Groundwater Potential Using Cloud Computing Platform: A Case Study From Nineveh Plain, Northern Iraq, Ali Za. Al-Ozeer, Alaa M. Al-Abadi, Tariq Abed Hussain, Alan E. Fryar, Biswajeet Pradhan, Abdullah Alamri, Khairul Nizam Abdul Maulud Nov 2021

Modeling Of Groundwater Potential Using Cloud Computing Platform: A Case Study From Nineveh Plain, Northern Iraq, Ali Za. Al-Ozeer, Alaa M. Al-Abadi, Tariq Abed Hussain, Alan E. Fryar, Biswajeet Pradhan, Abdullah Alamri, Khairul Nizam Abdul Maulud

Earth and Environmental Sciences Faculty Publications

Knowledge of the groundwater potential, especially in an arid region, can play a major role in planning the sustainable management of groundwater resources. In this study, nine machine learning (ML) algorithms—namely, Artificial Neural Network (ANN), Decision Jungle (DJ), Averaged Perceptron (AP), Bayes Point Machine (BPM), Decision Forest (DF), Locally-Deep Support Vector Machine (LD-SVM), Boosted Decision Tree (BDT), Logistic Regression (LG), and Support Vector Machine (SVM)—were run on the Microsoft Azure cloud computing platform to model the groundwater potential. We investigated the relationship between 512 operating boreholes with a specified specific capacity and 14 groundwater-influencing occurrence factors. The unconfined aquifer in …


Deep Learning For Multiclass Classification, Predictive Modeling And Segmentation Of Disease Prone Regions In Alzheimer’S Disease, Maryamossadat Aghili Nov 2021

Deep Learning For Multiclass Classification, Predictive Modeling And Segmentation Of Disease Prone Regions In Alzheimer’S Disease, Maryamossadat Aghili

FIU Electronic Theses and Dissertations

One of the challenges facing accurate diagnosis and prognosis of Alzheimer’s Disease (AD) is identifying the subtle changes that define the early onset of the disease. This dissertation investigates three of the main challenges confronted when such subtle changes are to be identified in the most meaningful way. These are (1) the missing data challenge, (2) longitudinal modeling of disease progression, and (3) the segmentation and volumetric calculation of disease-prone brain areas in medical images. The scarcity of sufficient data compounded by the missing data challenge in many longitudinal samples exacerbates the problem as we seek statistical meaningfulness in multiclass …


Experimental Analysis Of Gbm To Expand The Time Horizon Of Irish Electricity Price Forecasts, Conor Lynch, Christian O'Leary, Preetham Goving Kolar Sundareshan, Yavuz Akin Nov 2021

Experimental Analysis Of Gbm To Expand The Time Horizon Of Irish Electricity Price Forecasts, Conor Lynch, Christian O'Leary, Preetham Goving Kolar Sundareshan, Yavuz Akin

NIMBUS Articles

In response to the inherent challenges of generating cost-effective electricity consumption schedules for dynamic systems, this paper espouses the use of GBM or Gradient Boosting Machine-based models for electricity price forecasting. These models are applied to data streams from the Irish electricity market and achieve favorable results, relative to the current state-of-the-art. Presently, electricity prices are published 10 h in advance of the trade day of interest. Using the forecasting methodology outlined in this paper, an estimation of these prices can be made available one day in advance of the official price publication, thus extending the time available to plan …


From Mdp To Alphazero, David Robert Sewell Nov 2021

From Mdp To Alphazero, David Robert Sewell

Dissertations and Theses

In this paper I will explain the AlphaGo family of algorithms starting from first principles and requiring little previous knowledge from the reader. The focus will be upon one of the more recent versions AlphaZero but I hope to explain the core principles that allowed these algorithms to be so successful. I will generally refer to AlphaZero as theses [sic] core set of principles and will make it clear when I am referring to a specific algorithm of the AlphaGo family. AlphaZero in short combines Monte Carlo Tree Search (MCTS) with Deep learning and self-play. We will see how these …


Improving Accurate Candidates For Missing Data Using Benefit Performance Of (Ml-Som), Abeer Abdullah Al-Mohdar, Mohamed Abdullah Bamatraf Nov 2021

Improving Accurate Candidates For Missing Data Using Benefit Performance Of (Ml-Som), Abeer Abdullah Al-Mohdar, Mohamed Abdullah Bamatraf

Hadhramout University Journal of Natural & Applied Sciences

Missing data is one of the major challenges in extracting and analyzing knowledge from datasets. The performance of training quality was affected by the appearance of missing data in a dataset. For this reason, there is a need for a quick and reliable method to find possible solutions in order to provide an accurate system. Therefore, the previous studies provided robust ability of Self Organizing Map (SOM) algorithm to deal with the missing values [6, 20]. However, it has a drawback such as an error rate(ERR) in the missing values that increase huge dataset. This study is mainly based on …