Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Deep learning

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 121 - 150 of 890

Full-Text Articles in Physical Sciences and Mathematics

Rosas: Deep Semi-Supervised Anomaly Detection With Contamination-Resilient Continuous Supervision, Hongzuo Xu, Yijie Wang, Guansong Pang, Songlei Jian, Ning Liu, Yongjun Wang Sep 2023

Rosas: Deep Semi-Supervised Anomaly Detection With Contamination-Resilient Continuous Supervision, Hongzuo Xu, Yijie Wang, Guansong Pang, Songlei Jian, Ning Liu, Yongjun Wang

Research Collection School Of Computing and Information Systems

Semi-supervised anomaly detection methods leverage a few anomaly examples to yield drastically improved performance compared to unsupervised models. However, they still suffer from two limitations: 1) unlabeled anomalies (i.e., anomaly contamination) may mislead the learning process when all the unlabeled data are employed as inliers for model training; 2) only discrete supervision information (such as binary or ordinal data labels) is exploited, which leads to suboptimal learning of anomaly scores that essentially take on a continuous distribution. Therefore, this paper proposes a novel semi-supervised anomaly detection method, which devises contamination-resilient continuous supervisory signals. Specifically, we propose a mass interpolation method …


Arduinoprog: Towards Automating Arduino Programming, Imam Nur Bani Yusuf, Diyanah Binte Abdul Jamal, Lingxiao Jiang Sep 2023

Arduinoprog: Towards Automating Arduino Programming, Imam Nur Bani Yusuf, Diyanah Binte Abdul Jamal, Lingxiao Jiang

Research Collection School Of Computing and Information Systems

Writing code for Arduino poses unique challenges. A developer 1) needs hardware-specific knowledge about the interface configuration between the Arduino controller and the I/Ohardware, 2) identifies a suitable driver library for the I/O hardware, and 3) follows certain usage patterns of the driver library in order to use them properly. In this work, based on a study of real-world user queries posted in the Arduino forum, we propose ArduinoProg to address such challenges. ArduinoProg consists of three components, i.e., Library Retriever, Configuration Classifier, and Pattern Generator. Given a query, Library Retriever retrieves library names relevant to the I/O hardware identified …


Methods Of Evaluating Quantum Phase Estimation Circuit Output, Charles A. Woodrum Sep 2023

Methods Of Evaluating Quantum Phase Estimation Circuit Output, Charles A. Woodrum

Theses and Dissertations

The quantum phase estimation (QPE) algorithm is one of the most important quantum computing algorithms that has been developed. The QPE algorithm estimates the phase or phases of the eigenvalue or eigenvalues of a unitary operator. It is a critical step for applications like Shor’s algorithm for factoring and the HHL algorithm for solving linear systems of equations, but it remains difficult to implement on current quantum computers due to small numbers of logical qubits and high error rates. This investigation derives a more accurate estimation of the phase of a unitary operator than would otherwise be attained with the …


A Proposed Artificial Intelligence Model For Android-Malware Detection, Fatma Taher, Omar Al Fandi, Mousa Al Kfairy, Hussam Al Hamadi, Saed Alrabaee Aug 2023

A Proposed Artificial Intelligence Model For Android-Malware Detection, Fatma Taher, Omar Al Fandi, Mousa Al Kfairy, Hussam Al Hamadi, Saed Alrabaee

All Works

There are a variety of reasons why smartphones have grown so pervasive in our daily lives. While their benefits are undeniable, Android users must be vigilant against malicious apps. The goal of this study was to develop a broad framework for detecting Android malware using multiple deep learning classifiers; this framework was given the name DroidMDetection. To provide precise, dynamic, Android malware detection and clustering of different families of malware, the framework makes use of unique methodologies built based on deep learning and natural language processing (NLP) techniques. When compared to other similar works, DroidMDetection (1) uses API calls and …


Self-Supervised Pretraining And Transfer Learning On Fmri Data With Transformers, Sean Paulsen Aug 2023

Self-Supervised Pretraining And Transfer Learning On Fmri Data With Transformers, Sean Paulsen

Dartmouth College Ph.D Dissertations

Transfer learning is a machine learning technique founded on the idea that knowledge acquired by a model during “pretraining” on a source task can be transferred to the learning of a target task. Successful transfer learning can result in improved performance, faster convergence, and reduced demand for data. This technique is particularly desirable for the task of brain decoding in the domain of functional magnetic resonance imaging (fMRI), wherein even the most modern machine learning methods can struggle to decode labelled features of brain images. This challenge is due to the highly complex underlying signal, physical and neurological differences between …


The Student Becomes The Teacher: Training High-Performance Language Models More Sample-Efficiently From Small Models Via Superstilling, Chaz Allen Gundry Aug 2023

The Student Becomes The Teacher: Training High-Performance Language Models More Sample-Efficiently From Small Models Via Superstilling, Chaz Allen Gundry

Theses and Dissertations

Recent advances including the Transformer architecture have revolutionized the Natural Language Processing community by providing immense performance improvements across many tasks, including the development of Large Language Models (LLMs). LLMs show enormous promise as few-shot learners, common-sense knowledge repositories, conversational agents, writing assistants, and coding tools, and are gaining widespread traction in commercial industry. However, LLMs are expensive and time-consuming to train, requiring many passes over terabytes of data for the largest models. In this paper, we present Superstilling, a method for reducing the sample complexity of language model training by distilling the knowledge from a previously-trained model (the teacher) …


A Hybrid Ensemble Of Learning Models, Bivin Sadler, Dhruba Dey, Duy Nguyen, Tavin Weeda Aug 2023

A Hybrid Ensemble Of Learning Models, Bivin Sadler, Dhruba Dey, Duy Nguyen, Tavin Weeda

SMU Data Science Review

Statistical models in time series forecasting have long been challenged to be superseded by the advent of deep learning models. This research proposes a new hybrid ensemble of forecasting models that combines the strengths of several strong candidates from these two model types. The proposed ensemble aims to improve the accuracy of forecasts and reduce computational complexity by leveraging the strengths of each candidate model.


More Is Better Than One: The Effect Of Ensembling On Deep Learning Performance In Biochemical Prediction Problems, Jacob A. Stern Aug 2023

More Is Better Than One: The Effect Of Ensembling On Deep Learning Performance In Biochemical Prediction Problems, Jacob A. Stern

Theses and Dissertations

This thesis presents two papers addressing important biochemical prediction challenges. The first paper focuses on accurate protein distance predictions and introduces updates to the ProSPr network. We evaluate its performance in the Critical Assessment of techniques for Protein Structure Prediction (CASP14) competition, investigating its accuracy dependence on sequence length and multiple sequence alignment depth. The ProSPr network, an ensemble of three convolutional neural networks (CNNs), demonstrates superior performance compared to individual networks. The second paper addresses the issue of accurate ligand ranking in virtual screening for drug discovery. We propose MILCDock, a machine learning consensus docking tool that leverages predictions …


Optimizing Collective Communication For Scalable Scientific Computing And Deep Learning, Jiali Li Aug 2023

Optimizing Collective Communication For Scalable Scientific Computing And Deep Learning, Jiali Li

Doctoral Dissertations

In the realm of distributed computing, collective operations involve coordinated communication and synchronization among multiple processing units, enabling efficient data exchange and collaboration. Scientific applications, such as simulations, computational fluid dynamics, and scalable deep learning, require complex computations that can be parallelized across multiple nodes in a distributed system. These applications often involve data-dependent communication patterns, where collective operations are critical for achieving high performance in data exchange. Optimizing collective operations for scientific applications and deep learning involves improving the algorithms, communication patterns, and data distribution strategies to minimize communication overhead and maximize computational efficiency.

Within the context of this …


Insights Into The Application Of Deep Reinforcement Learning In Healthcare And Materials Science, Benjamin R. Smith Aug 2023

Insights Into The Application Of Deep Reinforcement Learning In Healthcare And Materials Science, Benjamin R. Smith

Doctoral Dissertations

Reinforcement learning (RL) is a type of machine learning designed to optimize sequential decision-making. While controlled environments have served as a foundation for RL research, due to the growth in data volumes and deep learning methods, it is now increasingly being applied to real-world problems. In our work, we explore and attempt to overcome challenges that occur when applying RL to solve problems in healthcare and materials science.

First, we explore how issues in bias and data completeness affect healthcare applications of RL. To understand how bias has already been considered in this area, we survey the literature for existing …


The Development Of Artificial Intelligence-Based Tools For Expert Peer Review Of Radiotherapy Treatment Plans, Mary Gronberg Aug 2023

The Development Of Artificial Intelligence-Based Tools For Expert Peer Review Of Radiotherapy Treatment Plans, Mary Gronberg

Dissertations & Theses (Open Access)

Creating a patient-specific radiation treatment plan is a time-consuming and operator-dependent manual process. The treatment planner adjusts the planning parameters in a trial-and-error fashion in an effort to balance the competing clinical objectives of tumor coverage and normal tissue sparing. Often, a plan is selected because it meets basic organ at risk dose thresholds for severe toxicity; however, it is evident that a plan with a decreased risk of normal tissue complication probability could be achieved. This discrepancy between “acceptable” and “best possible” plan is magnified if either the physician or treatment planner lacks focal expertise in the disease site. …


Increasing The Efficiency And Accuracy Of Collective Intelligence Methods For Image Classification, Md Mahmudulla Hassan Aug 2023

Increasing The Efficiency And Accuracy Of Collective Intelligence Methods For Image Classification, Md Mahmudulla Hassan

Open Access Theses & Dissertations

Collective intelligence has emerged as a powerful methodology for annotating and classifying challenging data that pose difficulties for automated classifiers. It works by leveraging the concept of "wisdom of the crowds" which approximates a ground truth after aggregating experts' feedback and filtering out noise. However, challenges arise when certain applications, such as medical image classification, security threat detection, and financial fraud detection, demand accurate and reliable data annotation. The unreliability of experts due to inconsistent expertise and competencies, coupled with the associated cost and time-consuming judgment extraction, presents additional challenges.

Input aggregation is the process of consolidating and combining multiple …


Multi-Granularity Detector For Vulnerability Fixes, Truong Giang Nguyen, Cong, Thanh Le, Hong Jin Kang, Ratnadira Widyasari, Chengran Yang, Zhipeng Zhao, Bowen Xu, Jiayuan Zhou, Xin Xia, Ahmed E. Hassan, David Lo, David Lo Aug 2023

Multi-Granularity Detector For Vulnerability Fixes, Truong Giang Nguyen, Cong, Thanh Le, Hong Jin Kang, Ratnadira Widyasari, Chengran Yang, Zhipeng Zhao, Bowen Xu, Jiayuan Zhou, Xin Xia, Ahmed E. Hassan, David Lo, David Lo

Research Collection School Of Computing and Information Systems

With the increasing reliance on Open Source Software, users are exposed to third-party library vulnerabilities. Software Composition Analysis (SCA) tools have been created to alert users of such vulnerabilities. SCA requires the identification of vulnerability-fixing commits. Prior works have proposed methods that can automatically identify such vulnerability-fixing commits. However, identifying such commits is highly challenging, as only a very small minority of commits are vulnerability fixing. Moreover, code changes can be noisy and difficult to analyze. We observe that noise can occur at different levels of detail, making it challenging to detect vulnerability fixes accurately. To address these challenges and …


Evaluating Chatgpt For Recommendation: How Does The Ability To Converse Impact Recommendation?, Kyle Spurlock Aug 2023

Evaluating Chatgpt For Recommendation: How Does The Ability To Converse Impact Recommendation?, Kyle Spurlock

Electronic Theses and Dissertations

Recommendation algorithms have become an absolute necessity in the modern world to avoid information overload. However, the interaction between the human and the system is largely superficial and without any real contact. If you are given poor recommendations, you have no choice but to sift through mountains of content on your own until the model learns to accommodate your tastes more. This is bad for business as well as the consumer. Recently, large language models like ChatGPT have seen a significant rise in popularity due to their ease of use and wide range of knowledge. It has now become nearly …


Mastering Stock Markets With Efficient Mixture Of Diversified Trading Experts, Shuo Sun, Xinrun Wang, Wanqi Xue, Xiaoxuan Lou, Bo An Aug 2023

Mastering Stock Markets With Efficient Mixture Of Diversified Trading Experts, Shuo Sun, Xinrun Wang, Wanqi Xue, Xiaoxuan Lou, Bo An

Research Collection School Of Computing and Information Systems

Quantitative stock investment is a fundamental financial task that highly relies on accurate prediction of market status and profitable investment decision making. Despite recent advances in deep learning (DL) have shown stellar performance on capturing trading opportunities in the stochastic stock market, the performance of existing DL methods is unstable with sensitivity to network initialization and hyperparameter selection. One major limitation of existing works is that investment decisions are made based on one individual neural network predictor with high uncertainty, which is inconsistent with the workflow in real-world trading firms. To tackle this limitation, we propose AlphaMix, a novel three-stage …


Robust Digital Nucleic Acid Memory, Golam Md Mortuza Aug 2023

Robust Digital Nucleic Acid Memory, Golam Md Mortuza

Boise State University Theses and Dissertations

The rapid growth of data generation from electronic devices has created a critical demand for efficient and sustainable data storage solutions. Traditional storage systems face challenges regarding reliability, energy consumption, and scalability, necessitating the exploration of alternative technologies. This dissertation explores the potential of Deoxyribonucleic Acid (DNA) as an alternative storage medium, along with the associated challenges and potential solutions.

This dissertation focuses on Digital Nucleic Acid Memory (dNAM), which utilizes Single Molecule Localization Microscopy (SMLM) to encode and store data within DNA structures called DNA origami. SMLM surpasses the limitations of light’s diffraction limit, enabling the imaging of biological …


Accuracy Vs. Energy: An Assessment Of Bee Object Inference In Videos From On-Hive Video Loggers With Yolov3, Yolov4-Tiny, And Yolov7-Tiny, Vladimir A. Kulyukin, Aleksey V. Kulyukin Jul 2023

Accuracy Vs. Energy: An Assessment Of Bee Object Inference In Videos From On-Hive Video Loggers With Yolov3, Yolov4-Tiny, And Yolov7-Tiny, Vladimir A. Kulyukin, Aleksey V. Kulyukin

Computer Science Faculty and Staff Publications

A continuing trend in precision apiculture is to use computer vision methods to quantify characteristics of bee traffic in managed colonies at the hive's entrance. Since traffic at the hive's entrance is a contributing factor to the hive's productivity and health, we assessed the potential of three open-source convolutional network models, YOLOv3, YOLOv4-tiny, and YOLOv7-tiny, to quantify omnidirectional traffic in videos from on-hive video loggers on regular, unmodified one- and two-super Langstroth hives and compared their accuracies, energy efficacies, and operational energy footprints. We trained and tested the models with a 70/30 split on a dataset of 23,173 flying bees …


Empowering Patient Similarity Networks Through Innovative Data-Quality-Aware Federated Profiling, Alramzana Nujum Navaz, Mohamed Adel Serhani, Hadeel T. El Kassabi, Ikbal Taleb Jul 2023

Empowering Patient Similarity Networks Through Innovative Data-Quality-Aware Federated Profiling, Alramzana Nujum Navaz, Mohamed Adel Serhani, Hadeel T. El Kassabi, Ikbal Taleb

All Works

Continuous monitoring of patients involves collecting and analyzing sensory data from a multitude of sources. To overcome communication overhead, ensure data privacy and security, reduce data loss, and maintain efficient resource usage, the processing and analytics are moved close to where the data are located (e.g., the edge). However, data quality (DQ) can be degraded because of imprecise or malfunctioning sensors, dynamic changes in the environment, transmission failures, or delays. Therefore, it is crucial to keep an eye on data quality and spot problems as quickly as possible, so that they do not mislead clinical judgments and lead to the …


Bertastic At Semeval-2023 Task 3: Fine-Tuning Pretrained Multilingual Transformers – Does Order Matter?, Tarek Mahmoud, Preslav Nakov Jul 2023

Bertastic At Semeval-2023 Task 3: Fine-Tuning Pretrained Multilingual Transformers – Does Order Matter?, Tarek Mahmoud, Preslav Nakov

Natural Language Processing Faculty Publications

The naïve approach for fine-tuning pretrained deep learning models on downstream tasks involves feeding them mini-batches of randomly sampled data. In this paper, we propose a more elaborate method for fine-tuning Pretrained Multilingual Transformers (PMTs) on multilingual data. Inspired by the success of curriculum learning approaches, we investigate the significance of fine-tuning PMTs on multilingual data in a sequential fashion language by language. Unlike the curriculum learning paradigm where the model is presented with increasingly complex examples, we do not adopt a notion of “easy” and “hard” samples. Instead, our experiments draw insight from psychological findings on how the human …


Lightweight Deep Neural Network Models For Electromyography Signal Recognition For Prosthetic Control, Ahmet Mert Jul 2023

Lightweight Deep Neural Network Models For Electromyography Signal Recognition For Prosthetic Control, Ahmet Mert

Turkish Journal of Electrical Engineering and Computer Sciences

In this paper, lightweight deep learning methods are proposed to recognize multichannel electromyography (EMG) signals against varying contraction levels. The classical machine learning, and signal processing methods namely, linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), root mean square (RMS), and waveform length (WL) are adopted to convolutional neural network (CNN), and long short-term memory neural network (LSTM). Eight-channel recordings of nine amputees from a publicly available dataset are used for training and testing the proposed models considering prosthetic control strategies. Six class hand movements with three contraction levels are applied to WL and RMS-based feature extraction. After that, they …


Improving Unet Segmentation Performance Using An Ensemble Model In Images Containing Railway Lines, Mehmet Sevi̇, İlhan Aydin Jul 2023

Improving Unet Segmentation Performance Using An Ensemble Model In Images Containing Railway Lines, Mehmet Sevi̇, İlhan Aydin

Turkish Journal of Electrical Engineering and Computer Sciences

This study aims to make sense of the autonomous system and the railway environment for railway vehicles. For this purpose, by determining the railway line, information about the general condition of the line can be obtained along the way. In addition, objects such as pedestrian crossings, people, cars, and traffic signs on the line will be extracted. The rails and the rail environment in the images will be segmented with a semantic segmentation network. In order to ensure the safety of rail transport, computer vision, and deep learning-based methods are increasingly used to inspect railway tracks and surrounding objects. In …


Learning Deep Time-Index Models For Time Series Forecasting, Jiale Gerald Woo, Chenghao Liu, Doyen Sahoo, Akshat Kumar, Steven Hoi Jul 2023

Learning Deep Time-Index Models For Time Series Forecasting, Jiale Gerald Woo, Chenghao Liu, Doyen Sahoo, Akshat Kumar, Steven Hoi

Research Collection School Of Computing and Information Systems

Deep learning has been actively applied to time series forecasting, leading to a deluge of new methods, belonging to the class of historicalvalue models. Yet, despite the attractive properties of time-index models, such as being able to model the continuous nature of underlying time series dynamics, little attention has been given to them. Indeed, while naive deep timeindex models are far more expressive than the manually predefined function representations of classical time-index models, they are inadequate for forecasting, being unable to generalize to unseen time steps due to the lack of inductive bias. In this paper, we propose DeepTime, a …


Deep Learning Methods For Some Problems In Scientific Computing, Yuankai Teng Jul 2023

Deep Learning Methods For Some Problems In Scientific Computing, Yuankai Teng

Theses and Dissertations

Deep learning has emerged as a powerful approach for solving complex problems in scientific computing due to the increasing availability of large-scale data and computational resources. This thesis explores the potential of deep learning methods for three specific problems in scientific computing: (i) reducing the dimensions of variables in function approximation, (ii) solving linear reaction-diffusion equations, and (iii) finding the parametric representations of parameters in the numerical schemes for solving time-dependent partial differential equations.

For the first problem, a novel deep learning architecture is developed for reducing the dimensions of variables in function approximation. The proposed method achieves state-of-the-art performance …


Mine Water Inrush Prediction Method Based On Vmd-Dbn Model, Liu Hui, Liu Guiqin, Ning Dianyan, Fan Juan, Chen Weiming Jun 2023

Mine Water Inrush Prediction Method Based On Vmd-Dbn Model, Liu Hui, Liu Guiqin, Ning Dianyan, Fan Juan, Chen Weiming

Coal Geology & Exploration

In the process of coal mining, the loss of people and property caused by mine water inrush is extremely serious. To prevent the occurrence of water inrush accidents and grasp the law of change of water inrush, the water inrush prediction and forecasting, especially the accurate estimation of mine water inrush, is very important, which is also an important task in the prevention and control of mine water damage. To increase the prediction accuracy of mine water inrush, an efficient time series prediction model combining Variational Mode Decomposition (VMD) and Deep Belief Network (DBN) was proposed for the series of …


Utilizing Few-Shot Meta Learning Algorithms For Medical Image Segmentation, Nick Littlefield Jun 2023

Utilizing Few-Shot Meta Learning Algorithms For Medical Image Segmentation, Nick Littlefield

Thinking Matters Symposium

Deep learning models can be difficult to train because they require large amounts of data, which we usually do not have or are too expensive to get or annotate. To overcome this problem, we can use few-shot meta-learning, which allows us to train deep learning models with little data. Using a few examples, meta-learning, or learning-to-learn, aims to use the experience learned during training to generalize to unknown tasks. Medical imaging is an industry where it is particularly useful, as there is limited publicly available data due to patient privacy concerns and annotating costs.

This project examines how meta-learning performs …


Sarcasm Detection In English And Arabic Tweets Using Transformer Models, Rishik Lad Jun 2023

Sarcasm Detection In English And Arabic Tweets Using Transformer Models, Rishik Lad

Computer Science Senior Theses

This thesis describes our approach toward the detection of sarcasm and its various types in English and Arabic Tweets through methods in deep learning. There are five problems we attempted: (1) detection of sarcasm in English Tweets, (2) detection of sarcasm in Arabic Tweets, (3) determining the type of sarcastic speech subcategory for English Tweets, (4) determining which of two semantically equivalent English Tweets is sarcastic, and (5) determining which of two semantically equivalent Arabic Tweets is sarcastic. All tasks were framed as classification problems, and our contributions are threefold: (a) we developed an English binary classifier system with RoBERTa, …


Patient Movement Monitoring Based On Imu And Deep Learning, Mohsen Sharifi Renani Jun 2023

Patient Movement Monitoring Based On Imu And Deep Learning, Mohsen Sharifi Renani

Electronic Theses and Dissertations

Osteoarthritis (OA) is the leading cause of disability among the aging population in the United States and is frequently treated by replacing deteriorated joints with metal and plastic components. Developing better quantitative measures of movement quality to track patients longitudinally in their own homes would enable personalized treatment plans and hasten the advancement of promising new interventions. Wearable sensors and machine learning used to quantify patient movement could revolutionize the diagnosis and treatment of movement disorders. The purpose of this dissertation was to overcome technical challenges associated with the use of wearable sensors, specifically Inertial Measurement Units (IMUs), as a …


Connecting Linguistic Expressions And Pain Relief Through Transformer Model Construction And Analysis, Sarah M. Chacko May 2023

Connecting Linguistic Expressions And Pain Relief Through Transformer Model Construction And Analysis, Sarah M. Chacko

Computer Science Senior Theses

Chronic pain is a widespread problem that significantly impacts quality of life. Overprescription and abuse of pain medication continues to be a major public health issue and can further burden patients due to a fragmented health care system. Previous research has suggested a possible psychological basis to pain and the potential for safer, non-pharmacological alternatives for pain relief. This project leverages language models to study chronic pain development and relief through psychological treatments, which will be assessed through responses to post-treatment interviews. A transformer-based natural language processing model is employed to identify connections between language expressions and pain on a …


Deep Learning For Skin Photoaging, Gokul Srinivasan May 2023

Deep Learning For Skin Photoaging, Gokul Srinivasan

Computer Science Senior Theses

Skin photoaging is the premature aging of skin that results from ultraviolet light exposure. It is a major risk factor for the development of skin cancer, among other malignant skin pathologies. Accordingly, understanding its etiology is important for both preventative and reparative clinical action. In this study, skin samples obtained from patients with ranging solar elastosis grades – a proxy for skin photoaging – were sequenced using next-generation sequencing techniques to further understand the genomic, epigenomic, and histological signs and signals of skin photoaging. The results of this study suggest that tissues with severe photoaging exhibit increases in the frequency …


How Object Segmentation And Perceptual Grouping Emerge In Noisy Variational Autoencoders, Ben Lonnqvist, Zhengqing Wu, Michael H. Herzog May 2023

How Object Segmentation And Perceptual Grouping Emerge In Noisy Variational Autoencoders, Ben Lonnqvist, Zhengqing Wu, Michael H. Herzog

MODVIS Workshop

Many animals and humans can recognize and segment objects from their backgrounds. Whether object segmentation is necessary for object recognition has long been a topic of debate. Deep neural networks (DNNs) excel at object recognition, but not at segmentation tasks - this has led to the belief that object recognition and segmentation are separate mechanisms in visual processing. Here, however, we show evidence that in variational autoencoders (VAEs), segmentation and faithful representation of data can be interlinked. VAEs are encoder-decoder models that learn to represent independent generative factors of the data as a distribution in a very small bottleneck layer; …