Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Computer Sciences (1330)
- Artificial Intelligence and Robotics (516)
- Engineering (356)
- Computer Engineering (169)
- Data Science (148)
-
- Social and Behavioral Sciences (143)
- Electrical and Computer Engineering (139)
- Statistics and Probability (129)
- Medicine and Health Sciences (117)
- Life Sciences (102)
- Databases and Information Systems (100)
- Earth Sciences (79)
- Theory and Algorithms (74)
- Mathematics (72)
- Physics (70)
- Environmental Sciences (69)
- Information Security (69)
- Numerical Analysis and Scientific Computing (69)
- Software Engineering (68)
- Other Computer Sciences (64)
- Business (58)
- Applied Mathematics (51)
- Arts and Humanities (45)
- Education (40)
- Medical Specialties (36)
- Chemistry (34)
- Applied Statistics (32)
- Operations Research, Systems Engineering and Industrial Engineering (32)
- Oceanography and Atmospheric Sciences and Meteorology (30)
- Institution
-
- Old Dominion University (115)
- Singapore Management University (105)
- Brigham Young University (74)
- Air Force Institute of Technology (66)
- TÜBİTAK (61)
-
- Zayed University (48)
- University of Texas at Arlington (44)
- New Jersey Institute of Technology (42)
- Technological University Dublin (40)
- Portland State University (38)
- University of Nebraska - Lincoln (38)
- Edith Cowan University (30)
- Western University (30)
- Chapman University (27)
- San Jose State University (26)
- City University of New York (CUNY) (25)
- University of Kentucky (25)
- University of South Florida (24)
- Boise State University (21)
- Utah State University (21)
- Louisiana State University (19)
- University of Texas Rio Grande Valley (19)
- University at Albany, State University of New York (18)
- University of Louisville (18)
- Wright State University (18)
- Southern Methodist University (17)
- University of Nevada, Las Vegas (17)
- University of Tennessee, Knoxville (17)
- California Polytechnic State University, San Luis Obispo (16)
- Dartmouth College (16)
- Publication Year
- Publication
-
- Theses and Dissertations (152)
- Research Collection School Of Computing and Information Systems (86)
- Electronic Theses and Dissertations (65)
- Turkish Journal of Electrical Engineering and Computer Sciences (60)
- Dissertations (51)
-
- All Works (48)
- Faculty Publications (40)
- Computer Science and Engineering Dissertations (24)
- Electrical & Computer Engineering Faculty Publications (24)
- Electronic Thesis and Dissertation Repository (23)
- Dissertations and Theses (22)
- Master's Projects (21)
- Conference papers (20)
- Doctoral Dissertations (20)
- Computer Science Faculty Publications (19)
- Computer Science and Engineering Theses (19)
- Legacy Theses & Dissertations (2009 - 2024) (18)
- Articles (17)
- USF Tampa Graduate Theses and Dissertations (17)
- Master's Theses (16)
- Browse all Theses and Dissertations (15)
- Research outputs 2022 to 2026 (15)
- SMU Data Science Review (15)
- Boise State University Theses and Dissertations (14)
- Dissertations, Theses, and Capstone Projects (13)
- LSU Doctoral Dissertations (13)
- Mathematics, Physics, and Computer Science Faculty Articles and Research (13)
- CCE Theses and Dissertations (12)
- Honors Theses (12)
- Journal of System Simulation (12)
- Publication Type
Articles 1141 - 1170 of 1687
Full-Text Articles in Physical Sciences and Mathematics
Stock Market Analysis: A Review And Taxonomy Of Prediction Techniques, Dev Shah, Haruna Isah, Farhana Zulkernine
Stock Market Analysis: A Review And Taxonomy Of Prediction Techniques, Dev Shah, Haruna Isah, Farhana Zulkernine
Publications and Scholarship
Stock market prediction has always caught the attention of many analysts and researchers. Popular theories suggest that stock markets are essentially a random walk and it is a fool’s game to try and predict them. Predicting stock prices is a challenging problem in itself because of the number of variables which are involved. In the short term, the market behaves like a voting machine but in the longer term, it acts like a weighing machine and hence there is scope for predicting the market movements for a longer timeframe. Application of machine learning techniques and other algorithms for stock price …
Using Computer Vision To Quantify Coral Reef Biodiversity, Niket Bhodia
Using Computer Vision To Quantify Coral Reef Biodiversity, Niket Bhodia
Master's Projects
The preservation of the world’s oceans is crucial to human survival on this planet, yet we know too little to begin to understand anthropogenic impacts on marine life. This is especially true for coral reefs, which are the most diverse marine habitat per unit area (if not overall) as well as the most sensitive. To address this gap in knowledge, simple field devices called autonomous reef monitoring structures (ARMS) have been developed, which provide standardized samples of life from these complex ecosystems. ARMS have now become successful to the point that the amount of data collected through them has outstripped …
Deep Learning On Graphs Using Graph Convolutional Networks, Saurabh Mithe
Deep Learning On Graphs Using Graph Convolutional Networks, Saurabh Mithe
Master's Projects
Graphs are a powerful way to model network data with the objects as nodes and the relationship between the various objects as links. Such graphs contain a plethora of valuable information about the underlying data which can be extracted, analyzed, and visualized using Machine Learning (ML). The challenge to this task is that graphs are non-Euclidean structures which means that they cannot be directly used with ML techniques because ML techniques only work with Euclidean structures like grids or sequences. In order to overcome this challenge, the graph structure first needs to be encoded into an equivalent Euclidean representation in …
Intelligent Log Analysis For Anomaly Detection, Steven Yen
Intelligent Log Analysis For Anomaly Detection, Steven Yen
Master's Projects
Computer logs are a rich source of information that can be analyzed to detect various issues. The large volumes of logs limit the effectiveness of manual approaches to log analysis. The earliest automated log analysis tools take a rule-based approach, which can only detect known issues with existing rules. On the other hand, anomaly detection approaches can detect new or unknown issues. This is achieved by looking for unusual behavior different from the norm, often utilizing machine learning (ML) or deep learning (DL) models. In this project, we evaluated various ML and DL techniques used for log anomaly detection. We …
Multifamily Malware Models, Samanvitha Basole
Multifamily Malware Models, Samanvitha Basole
Master's Projects
When training a machine learning model, there is likely to be a tradeoff between the accuracy of the model and the generality of the dataset. Previous research has shown that if we train a model to detect one specific malware family, we obtain stronger results as compared to a case where we train a single model on multiple diverse families. During the detection phase, it would be more efficient to have a single model that could detect multiple families, rather than having to score each sample against multiple models. In this research, we conduct experiments to quantify the relationship between …
Sensor - Based Human Activity Recognition Using Smartphones, Mustafa Badshah
Sensor - Based Human Activity Recognition Using Smartphones, Mustafa Badshah
Master's Projects
It is a significant technical and computational task to provide precise information regarding the activity performed by a human and find patterns of their behavior. Countless applications can be molded and various problems in domains of virtual reality, health and medical, entertainment and security can be solved with advancements in human activity recognition (HAR) systems. HAR is an active field for research for more than a decade, but certain aspects need to be addressed to improve the system and revolutionize the way humans interact with smartphones. This research provides a holistic view of human activity recognition system architecture and discusses …
Stock Market Prediction Using Ensemble Of Graph Theory, Machine Learning And Deep Learning Models, Pratik Patil
Stock Market Prediction Using Ensemble Of Graph Theory, Machine Learning And Deep Learning Models, Pratik Patil
Master's Projects
Efficient Market Hypothesis (EMH) is the cornerstone of the modern financial theory and it states that it is impossible to predict the price of any stock using any trend, fundamental or technical analysis. Stock trading is one of the most important activities in the world of finance. Stock price prediction has been an age-old problem and many researchers from academia and business have tried to solve it using many techniques ranging from basic statistics to machine learning using relevant information such as news sentiment and historical prices. Even though some studies claim to get prediction accuracy higher than a random …
Machine Learning Versus Deep Learning For Malware Detection, Parth Jain
Machine Learning Versus Deep Learning For Malware Detection, Parth Jain
Master's Projects
It is often claimed that the primary advantage of deep learning is that such models can continue to learn as more data is available, provided that sufficient computing power is available for training. In contrast, for other forms of machine learning it is claimed that models ‘‘saturate,’’ in the sense that no additional learning can occur beyond some point, regardless of the amount of data or computing power available. In this research, we compare the accuracy of deep learning to other forms of machine learning for malware detection, as a function of the training dataset size. We experiment with a …
Smart Home Simulation In The Virtual World, Thomas Jones-Moore, David Son
Smart Home Simulation In The Virtual World, Thomas Jones-Moore, David Son
Scholars Week
The goal of this project is to produce a 'smart home' by using IoT and RFID like things in the virtual world to help solve problems. Some of these problems can be CPR training, etc. Used as an evaluation platform of suggested hardware to get a desired (or best fit) set of smart objects, or combinations with computer vision. Cost model to determine best fit based on: accuracy, lowest cost, easiest deployment, etc.
Magnetic Borophenes From An Evolutionary Search, Meng-Hong Zhu, Xiao-Ji Weng, Guoying Gao, Shuai Dong, Ling-Fang Lin, Wei-Hua Wang, Qiang Zhu, Artem R. Oganov, Xiao Dong, Yongjun Tian, Xiang-Feng Zhou, Hui-Tian Wang
Magnetic Borophenes From An Evolutionary Search, Meng-Hong Zhu, Xiao-Ji Weng, Guoying Gao, Shuai Dong, Ling-Fang Lin, Wei-Hua Wang, Qiang Zhu, Artem R. Oganov, Xiao Dong, Yongjun Tian, Xiang-Feng Zhou, Hui-Tian Wang
Physics & Astronomy Faculty Research
A computational methodology based on ab initio evolutionary algorithms and spin-polarized density functional theory was developed to predict two-dimensional magnetic materials. Its application to a model system borophene reveals an unexpected rich magnetism and polymorphism. A metastable borophene with nonzero thickness is an antiferromagnetic semiconductor from first-principles calculations, and can be further tuned into a half-metal by finite electron doping. In this borophene, the buckling and coupling among three atomic layers are not only responsible for magnetism, but also result in an out-of-plane negative Poisson's ratio under uniaxial tension, making it the first elemental material possessing auxetic and magnetic properties …
Depressiongnn: Depression Prediction Using Graph Neural Network On Smartphone And Wearable Sensors, Param Bidja
Depressiongnn: Depression Prediction Using Graph Neural Network On Smartphone And Wearable Sensors, Param Bidja
Honors Scholar Theses
Depression prediction is a complicated classification problem because depression diagnosis involves many different social, physical, and mental signals. Traditional classification algorithms can only reach an accuracy of no more than 70% given the complexities of depression. However, a novel approach using Graph Neural Networks (GNN) can be used to reach over 80% accuracy, if a graph can represent the depression data set to capture differentiating features. Building such a graph requires 1) the definition of node features, which must be highly correlated with depression, and 2) the definition for edge metrics, which must also be highly correlated with depression. In …
Exploring And Expanding The One-Pixel Attack, Umairullah Khan, Walt Woods, Christof Teuscher
Exploring And Expanding The One-Pixel Attack, Umairullah Khan, Walt Woods, Christof Teuscher
Student Research Symposium
In machine learning research, adversarial examples are normal inputs to a classifier that have been specifically perturbed to cause the model to misclassify the input. These perturbations rarely affect the human readability of an input, even though the model’s output is drastically different. Recent work has demonstrated that image-classifying deep neural networks (DNNs) can be reliably fooled with the modification of a single pixel in the input image, without knowledge of a DNN’s internal parameters. This “one-pixel attack” utilizes an iterative evolutionary optimizer known as differential evolution (DE) to find the most effective pixel to perturb, via the evaluation of …
Every Data Point Counts: Political Elections In The Age Of Digital Analytics, Julian Kehle, Samir Naimi
Every Data Point Counts: Political Elections In The Age Of Digital Analytics, Julian Kehle, Samir Naimi
Honors Thesis
Synthesizing the investigative research and cautionary messages from experts in the fields of technology, political science, and behavioral science, this project explores the ways in which digital analytics has begun to influence the American political arena. Historically, political parties have constructed systems to target voters and win elections. However, rapid changes in the field of technology (such as big data, artificial intelligence, and the prevalence of social media) threaten to undermine the integrity of elections themselves. Future political campaigns will utilize profiling to micro-target individuals in order to manipulate and persuade them with hyper-personalized political content. Most dangerously, the average …
Predicting Hospital Length Of Stay In Intensive Care Unit, Namita Singh
Predicting Hospital Length Of Stay In Intensive Care Unit, Namita Singh
Theses and Dissertations
In this thesis, we investigate the performance of a series of classification methods for the
Prediction of the hospital Length of Stay (LoS) in Intensive Care Unit (ICU). Predicting
LOS for an inpatient in an hospital is a challenging task but is essential for the operational
success of a hospital. Since hospitals are faced with severely limited resources including
beds to hold admitted patients, prediction of LoS will assist the hospital staff for better
planning and management of hospital resources. The goal of this project is to create a
machine learning model that predicts the length-of stay for each patient …
Watersheds For Semi-Supervised Classification, Aditya Challa, Sravan Danda, B. S.Daya Sagar, Laurent Najman
Watersheds For Semi-Supervised Classification, Aditya Challa, Sravan Danda, B. S.Daya Sagar, Laurent Najman
Journal Articles
Watershed technique from mathematical morphology (MM) is one of the most widely used operators for image segmentation. Recently watersheds are adapted to edge weighted graphs, allowing for wider applicability. However, a few questions remain to be answered - How do the boundaries of the watershed operator behave? Which loss function does the watershed operator optimize? How does watershed operator relate with existing ideas from machine learning. In this letter, a framework is developed, which allows one to answer these questions. This is achieved by generalizing the maximum margin principle to maximum margin partition and proposing a generic solution, morphMedian, resulting …
Distilling Managerial Insights And Lessons From Ai Projects At Singapore's Changi Airport (Part 2), Steve Lee, Steven M. Miller
Distilling Managerial Insights And Lessons From Ai Projects At Singapore's Changi Airport (Part 2), Steve Lee, Steven M. Miller
Asian Management Insights
Since 2017, Changi Airport group (CAG) has initiated a host of pilot projects that use connective and intelligent technologies to enable its move towards digital transformation and SMART Airport Vision. This has resulted in a first wave of deployment of AI and Machine Learning-enabled applications across various functions that can better sense, analyse, predict, and interact with people.
Clustering Of Multiple Instance Data., Andrew D. Karem
Clustering Of Multiple Instance Data., Andrew D. Karem
Electronic Theses and Dissertations
An emergent area of research in machine learning that aims to develop tools to analyze data where objects have multiple representations is Multiple Instance Learning (MIL). In MIL, each object is represented by a bag that includes a collection of feature vectors called instances. A bag is positive if it contains at least one positive instance, and negative if no instances are positive. One of the main objectives in MIL is to identify a region in the instance feature space with high correlation to instances from positive bags and low correlation to instances from negative bags -- this region is …
Learning Representations Using Reinforcement Learning, Sourabh Bose
Learning Representations Using Reinforcement Learning, Sourabh Bose
Computer Science and Engineering Dissertations
The framework of reinforcement learning is a powerful suite of algorithms that can learn generalized solutions to complex decision making problems. However, the applications of reinforcement learning algorithms to traditional machine learning problems such as clustering, classification and representation learning, have rarely been explored. With the advent of large amounts of data, robust models are required which can extract meaningful representations from the data that can potentially be applied to new unseen tasks. The presented work investigates the applications of reinforcement learning algorithms in the perspective of transfer learning by applying algorithms in the framework of reinforcement learning to address …
Ai Gets Real At Singapore's Changi Airport (Part 1), Steve Lee, Steven M. Miller
Ai Gets Real At Singapore's Changi Airport (Part 1), Steve Lee, Steven M. Miller
Asian Management Insights
Ranked as the best airport for seven consecutive years, Singapore’s Changi Airport is lauded the world over for the efficient, safe, pleasurable and seamless service it offers the millions of passengers that pass through its facilities annually. Much of Changi Airport’s success can be attributed to the organisation’s customer-oriented business focus and deeply embedded culture of service excellence, combined with a host of advanced technologies operating invisibly in the background. The framework for this technology enablement is Changi Airport Group’s (CAG’s) SMART Airport Vision—an enterprise-wide approach to connective technologies that leverages sensors, data fusion, data analytics, and artificial intelligence (AI), …
Deep Reinforcement Learning-Based Portfolio Management, Nitin Kanwar
Deep Reinforcement Learning-Based Portfolio Management, Nitin Kanwar
Computer Science and Engineering Theses
Machine Learning is at the forefront of every field today. The subfields of Machine Learning called Reinforcement Learning and Deep Learning, when combined have given rise to advanced algorithms which have been successful at reaching or surpassing the human-level performance at playing Atari games to defeating multiple times champion at Go. These successes of Machine Learning have attracted the interest of the financial community and have raised the question if these techniques could also be applied in detecting patterns in the financial markets. Until recently, mathematical formulations of dynamical systems in the context of Signal Processing and Control Theory have …
From Body To Brain: Using Artificial Intelligence To Identify User Skills & Intentions In Interactive Scenarios, Michalis Papakostas
From Body To Brain: Using Artificial Intelligence To Identify User Skills & Intentions In Interactive Scenarios, Michalis Papakostas
Computer Science and Engineering Dissertations
Artificial Intelligence has probably been the most rapidly evolving field of science during the last decade. Its numerous real-life applications have radically altered the way we experience daily-living with great impact in some of the most basic aspects of human lives including but not limited to health and well-being, communication and interaction, education, driving, daily, and entertainment. Human-Computer Interaction (HCI) is the field of Computer Science lying in the epicenter of this evolution and is responsible for transforming rudimentary research findings and theoretical principles into intuitive tools, responsible for enhancing human performance, increasing productivity and ensuring safety. Two of the …
Studying And Handling Iterated Algorithmic Biases In Human And Machine Learning Interaction., Wenlong Sun
Studying And Handling Iterated Algorithmic Biases In Human And Machine Learning Interaction., Wenlong Sun
Electronic Theses and Dissertations
Algorithmic bias consists of biased predictions born from ingesting unchecked information, such as biased samples and biased labels. Furthermore, the interaction between people and algorithms can exacerbate bias such that neither the human nor the algorithms receive unbiased data. Thus, algorithmic bias can be introduced not only before and after the machine learning process but sometimes also in the middle of the learning process. With a handful of exceptions, only a few categories of bias have been studied in Machine Learning, and there are few, if any, studies of the impact of bias on both human behavior and algorithm performance. …
Predicting Hospital Length Of Stay In Intensive Care Unit, Namita Singh
Predicting Hospital Length Of Stay In Intensive Care Unit, Namita Singh
Theses and Dissertations
In this thesis, we investigate the performance of a series of classification methods for the
Prediction of the hospital Length of Stay (LoS) in Intensive Care Unit (ICU). Predicting
LOS for an inpatient in an hospital is a challenging task but is essential for the operational
success of a hospital. Since hospitals are faced with severely limited resources including
beds to hold admitted patients, prediction of LoS will assist the hospital staff for better
planning and management of hospital resources. The goal of this project is to create a
machine learning model that predicts the length-of stay for each patient …
Seeing Eye To Eye: A Machine Learning Approach To Automated Saccade Analysis, Maigh Attre
Seeing Eye To Eye: A Machine Learning Approach To Automated Saccade Analysis, Maigh Attre
Honors Scholar Theses
Abnormal ocular motility is a common manifestation of many underlying pathologies particularly those that are neurological. Dynamics of saccades, when the eye rapidly changes its point of fixation, have been characterized for many neurological disorders including concussions, traumatic brain injuries (TBI), and Parkinson’s disease. However, widespread saccade analysis for diagnostic and research purposes requires the recognition of certain eye movement parameters. Key information such as velocity and duration must be determined from data based on a wide set of patients’ characteristics that may range in eye shapes and iris, hair and skin pigmentation [36]. Previous work on saccade analysis has …
Machine Learning Classification Of Primary Tissue Origin Of Cancer From Dna Methylation Markers, Sravani Gannavarapu Surya Naga
Machine Learning Classification Of Primary Tissue Origin Of Cancer From Dna Methylation Markers, Sravani Gannavarapu Surya Naga
UNLV Theses, Dissertations, Professional Papers, and Capstones
Cancer is one of the leading causes of death globally and was responsible for approximately 9.6 million deaths in 2018. One of the main reason for deaths from cancer is late-stage presentation and inaccessible diagnosis and treatment. Cancer often spreads from the part of the body where it started (primary site) to a different part of the body (metastatic site). Identifying the primary site of cancer plays a key role as it directs the appropriate treatment. Cancer which spreads needs the same treatment as its origin. Having this knowledge can help doctors to decide the type of treatment.
All cancers …
A Comparison Of Machine Learning Techniques For Taxonomic Classification Of Teeth From The Family Bovidae, Gregory J. Matthews, Juliet K. Brophy, Maxwell Luetkemeier, Hongie Gu, George K. Thiruvathukal
A Comparison Of Machine Learning Techniques For Taxonomic Classification Of Teeth From The Family Bovidae, Gregory J. Matthews, Juliet K. Brophy, Maxwell Luetkemeier, Hongie Gu, George K. Thiruvathukal
George K. Thiruvathukal
This study explores the performance of machine learning algorithms on the classification of fossil teeth in the Family Bovidae. Isolated bovid teeth are typically the most common fossils found in southern Africa and they often constitute the basis for paleoenvironmental reconstructions. Taxonomic identification of fossil bovid teeth, however, is often imprecise and subjective. Using modern teeth with known taxons, machine learning algorithms can be trained to classify fossils. Previous work by Brophy et al. [Quantitative morphological analysis of bovid teeth and implications for paleoenvironmental reconstruction of plovers lake, Gauteng Province, South Africa, J. Archaeol. Sci. 41 (2014), pp. …
Machine Learning For Stock Prediction Based On Fundamental Analysis, Yuxuan Huang
Machine Learning For Stock Prediction Based On Fundamental Analysis, Yuxuan Huang
Electronic Thesis and Dissertation Repository
Application of machine learning for stock prediction is attracting a lot of attention in recent years. A large amount of research has been conducted in this area and multiple existing results have shown that machine learning methods could be successfully used toward stock predicting using stocks’ historical data. Most of these existing approaches have focused on short term prediction using stocks’ historical price and technical indicators. In this thesis, we prepared 22 years’ worth of stock quarterly financial data and investigated three machine learning algorithms: Feed-forward Neural Network (FNN), Random Forest (RF) and Adaptive Neural Fuzzy Inference System (ANFIS) for …
Applicability Of Recurrent Neural Networks To Player Data Analysis In Freemium Video Games, Jonathan Tan
Applicability Of Recurrent Neural Networks To Player Data Analysis In Freemium Video Games, Jonathan Tan
Electronic Thesis and Dissertation Repository
We demonstrate the applicability and practicality of recurrent neural networks (RNNs), a machine learning methodology suited for sequential data, on player data from the mobile video game My Singing Monsters. Since this data comes in as a stream of events, RNNs are a natural solution for analyzing this data with minimal preprocessing. We apply RNNs to monitor and forecast game metrics, predict player conversion, estimate lifetime player value, and cluster player behaviours. In each case, we discuss why the results are interesting, how the trained models can be applied in a business setting, and how the preliminary work can …
Improved Evolutionary Support Vector Machine Classifier For Coronary Artery Heart Disease Prediction Among Diabetic Patients, Narasimhan B, Malathi A Dr
Improved Evolutionary Support Vector Machine Classifier For Coronary Artery Heart Disease Prediction Among Diabetic Patients, Narasimhan B, Malathi A Dr
Library Philosophy and Practice (e-journal)
Soft computing paves way many applications including medical informatics. Decision support system has gained a major attention that will aid medical practitioners to diagnose diseases. Diabetes mellitus is hereditary disease that might result in major heart disease. This research work aims to propose a soft computing mechanism named Improved Evolutionary Support Vector Machine classifier for CAHD risk prediction among diabetes patients. The attribute selection mechanism is attempted to build with the classifier in order to reduce the misclassification error rate of the conventional support vector machine classifier. Radial basis kernel function is employed in IESVM. IESVM classifier is evaluated through …
Deepreview: Automatic Code Review Using Deep Multi-Instance Learning, Hengyi Li, Shuting Shi, Ferdian Thung, Xuan Huo, Bowen Xu, Ming Li, David Lo
Deepreview: Automatic Code Review Using Deep Multi-Instance Learning, Hengyi Li, Shuting Shi, Ferdian Thung, Xuan Huo, Bowen Xu, Ming Li, David Lo
Research Collection School Of Computing and Information Systems
Code review, an inspection of code changes in order to identify and fix defects before integration, is essential in Software Quality Assurance (SQA). Code review is a time-consuming task since the reviewers need to understand, analysis and provide comments manually. To alleviate the burden of reviewers, automatic code review is needed. However, this task has not been well studied before. To bridge this research gap, in this paper, we formalize automatic code review as a multi-instance learning task that each change consisting of multiple hunks is regarded as a bag, and each hunk is described as an instance. We propose …