Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Institution
Keyword
Publication Year
Publication
Publication Type
File Type

Articles 301 - 330 of 13244

Full-Text Articles in Physical Sciences and Mathematics

Ensemble Classification: An Analysis Of The Random Forest Model, Jarod Korn Jan 2024

Ensemble Classification: An Analysis Of The Random Forest Model, Jarod Korn

Williams Honors College, Honors Research Projects

The random forest model proposed by Dr. Leo Breiman in 2001 is an ensemble machine learning method for classification prediction and regression. In the following paper, we will conduct an analysis on the random forest model with a focus on how the model works, how it is applied in software, and how it performs on a set of data. To fully understand the model, we will introduce the concept of decision trees, give a summary of the CART model, explain in detail how the random forest model operates, discuss how the model is implemented in software, demonstrate the model by …


Tropical Fish Study In Tahiti, French Polynesia, Miranda Brainard, Caitlyn Swango, Paityn Houglan, Richard Londraville Jan 2024

Tropical Fish Study In Tahiti, French Polynesia, Miranda Brainard, Caitlyn Swango, Paityn Houglan, Richard Londraville

Williams Honors College, Honors Research Projects

In May of 2023, I embarked on an exciting research journey to Moorea, French Polynesia, alongside fellow students and faculty members from the University of Akron and Syracuse University. This expedition was part of the university-sponsored Tropical Vertebrate Biology course, where we delved into the exploration of various tropical species inhabiting the island, including sea urchins, geckos, and my primary focus, the blackspotted rockskipper.

My research team, composed of my co-authors and me, was particularly intrigued by the unique refuge-seeking behavior displayed by blackspotted rockskippers. These amphibious fish are renowned for their remarkable ability to inhabit tide pools and rocky …


Statistical Modeling Of Bankruptcy Data, Andrew Elsfelder Jan 2024

Statistical Modeling Of Bankruptcy Data, Andrew Elsfelder

Williams Honors College, Honors Research Projects

My project uses a dataset of bankrupt and non-bankrupt companies in Taiwan from 1999 to 2009. This data was collected from the Taiwan Economic Journal. The statistical methods I used to model the data are CHAID, CART, and logistic regression. The models created are tools that can predict if a company is bankrupt, or not-bankrupt based on other data about the company. I created multiple models for each of the methods to find the best model for each method. I then analyzed the output from each method. Lastly, I determined which model was the best for this data based on …


A Copula Discretization Of Time Series-Type Model For Examining Climate Data, Dimuthu Fernando, Olivia Atutey, Norou Diawara Jan 2024

A Copula Discretization Of Time Series-Type Model For Examining Climate Data, Dimuthu Fernando, Olivia Atutey, Norou Diawara

Mathematics & Statistics Faculty Publications

The study presents a comparative analysis of climate data under two scenarios: a Gaussian copula marginal regression model for count time series data and a copula-based bivariate count time series model. These models, built after comprehensive simulations, offer adaptable autocorrelation structures considering the daily average temperature and humidity data observed at a regional airport in Mobile, AL.


Reinforcement Learning: Applying Low Discrepancy Action Selection To Deep Deterministic Policy Gradient, Aleksandr Svishchev Jan 2024

Reinforcement Learning: Applying Low Discrepancy Action Selection To Deep Deterministic Policy Gradient, Aleksandr Svishchev

Electronic Theses and Dissertations

Reinforcement learning (RL) is a subfield of machine learning concerned with agents learning to behave optimally by interacting with an environment. One of the most important topics in RL is how the agent should explore, that is, how to choose actions in order to rate their impact on long-term reward. For example, a simple baseline strategy might be uniformly random action selection. This thesis investigates the heuristic idea that agents will learn faster if they explore by factoring the environment’s state into their decision and intentionally choose actions which are as different as possible from what they have previously observed. …


Performing Holt-Winters Time Series Forecasting Using Neural Network Based Models, Kazeem Olanrewaju Bankole Jan 2024

Performing Holt-Winters Time Series Forecasting Using Neural Network Based Models, Kazeem Olanrewaju Bankole

Electronic Theses and Dissertations

We show how to create Artificial Neural Network based models for performing the well- known Holt-Winters time series analysis. Our work fares well compared to the well-known Holt-Winter time series prediction method while avoiding the burden of searching for the parameters of the model. We present the theoretical justification of the connection between the two models and experimental results showing the similarities of these models


A Comparative Analysis Of A Family Of Advanced Iterative Optimization Methods In Nonlinear Regression, Tanmoy Kumar Debnath Jan 2024

A Comparative Analysis Of A Family Of Advanced Iterative Optimization Methods In Nonlinear Regression, Tanmoy Kumar Debnath

Electronic Theses and Dissertations

Classical statistical supervised learning optimization techniques like the Gauss-Newton Iterative Method (GNIM), Weighted Gauss-Newton Iterative Method (WGNIM), Reweighted Gauss-Newton Iterative Method (RGNIM), and Levenberg-Marquart (LM) algorithm extend the nonlinear least squares method. The WGNIM improves model fitting by controlling heteroscedasticity in the linear and nonlinear models. A comparative analysis of the GNIM, WGNIM, RGNIM, and LM methods for fitting nonlinear models is presented. A step-wise diagnosis for structural multicollinearity in the reweighted linearized model is investigated via the Variance Inflation Factor (VIF) to determine variance inflation in the sequence of estimators for the model parameters. Under restricted multicollinearity levels in …


A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox Jan 2024

A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox

Dartmouth College Master’s Theses

The end-Cretaceous mass extinction was marked by both the Chicxulub impact and the ongoing emplacement of the Deccan Traps flood basalt province. Both of these events perturbed the environment by the emission of climate-active volatiles, primarily CO2 and SO2. To understand the mechanism of extinction, we must disentangle the timing, duration, and intensity of volcanic and meteoritic environmental forcings. In this thesis, we used a parallel Markov chain Monte Carlo approach to invert for the aforementioned volatile emissions, export productivity, and remineralization from 67 to 65 million years ago using the LOSCAR (Long-term Ocean-atmosphere-Sediment CArbon cycle Reservoir) model. The parallel …


Advancing Deep Learning With Graph-Based Structural Insights: From Graph Classification To Semantic Segmentation, Xin Ma Jan 2024

Advancing Deep Learning With Graph-Based Structural Insights: From Graph Classification To Semantic Segmentation, Xin Ma

Computer Science and Engineering Dissertations

Deep learning has profoundly transformed machine learning by offering sophisticated data representations, yet effectively incorporating structural information remains a challenge. Structural data, whether explicit or implicit, has the potential to significantly enhance the performance of deep learning tasks. This research investigates the benefits of structural information across three crucial tasks: classification, clustering, and segmentation. For explicit structural data, where inputs are directly represented as graphs, we investigate graph-level classification in brain connectivity networks. We introduce the Multi-resolution Edge Network (MENET), a novel framework designed to identify disease-specific connectomic benchmarks with high discriminatory power across diagnostic categories. MENET leverages graph-level representations …


Self-Exciting Point Processes In Real Estate, Ian Fraser Jan 2024

Self-Exciting Point Processes In Real Estate, Ian Fraser

Theses and Dissertations (Comprehensive)

This thesis introduces a novel approach to analyzing residential property sales through the lens of stochastic processes by employing point processes. Herein, property sales are treated as point patterns, using self-exciting point process models and a variety of statistical tools to uncover underlying patterns in the data. Key findings include the identification and explanation of clustering in both space and time, and the efficacy of a temporal Hawkes process with a sinusoidal background in predicting home sale occurrences. The temporal analysis starts by employing the state of art techniques for time series data like regression, autoregressive, and autoregressive integrated moving …


Regional Price Index 2023, Department Of Primary Industries And Regional Development, Western Australia Jan 2024

Regional Price Index 2023, Department Of Primary Industries And Regional Development, Western Australia

Statistics

The 2023 Regional Price Index (RPI) is the eleventh State Government Index contrasting the cost of a common basket of goods and services at a number of regional locations to the Perth metropolitan region. The RPI is used as the basis for the construction of the public sector district allowance, and by the private sector when considering remuneration packages for remotely located staff.

The RPI provides an insight into differences in regional consumer costs. The 2023 RPI basket of 185 goods and services was priced in 39 regional centres around Western Australia.

The 2023 RPI results show that, overall, prices …


Bayesian Estimation Of Hierarchical Linear Models From Incomplete Data: Cluster-Level Non-Linear Effects And Small Sample Sizes, Dongho Shin Jan 2024

Bayesian Estimation Of Hierarchical Linear Models From Incomplete Data: Cluster-Level Non-Linear Effects And Small Sample Sizes, Dongho Shin

Theses and Dissertations

We consider Bayesian estimation of a hierarchical linear model (HLM) from small sample sizes. The continuous response Y and covariates C are partially observed and assumed missing at random. With C having linear effects, the HLM may be efficiently estimated by available methods. When C includes cluster-level covariates having interactive or other nonlinear effects given small sample sizes, however, maximum likelihood estimation is suboptimal, and existing Gibbs samplers are based on a Bayesian joint distribution compatible with the HLM, but impute missing values of C by a Metropolis algorithm via a proposal density having a constant variance while the target …


Exact Testing For Heteroscedasticity In A Two-Way Layout In Variety Frost Trials When Incorporating A Covariate, Angelika A. Pilkington, Brenton R. Clarke, Dean A. Diepeveen Jan 2024

Exact Testing For Heteroscedasticity In A Two-Way Layout In Variety Frost Trials When Incorporating A Covariate, Angelika A. Pilkington, Brenton R. Clarke, Dean A. Diepeveen

Grain and Other Field Crops Research Articles

Two-way layouts are common in grain industry research where it is often the case that there are one or more covariates. It is widely recognised that when estimating fixed effect parameters, one should also examine for possible extra error variance structure. An exact test for heteroscedasticity, when there is a covariate, is illustrated for a data set from frost trials in Western Australia. While the general algebra for the test is known, albeit in past literature, there are computational aspects of implementing the test for the two way when there are covariates. In this scenario the test is shown to …


The Distribution Of The Significance Level, Paul O. Monnu Jan 2024

The Distribution Of The Significance Level, Paul O. Monnu

Electronic Theses and Dissertations

Reporting the p-value is customary when conducting a test of hypothesis or significance. The likelihood of getting a fictitious second sample and presuming the null hypothesis is correct is the p-value. The significance level is a statistic that interests us to investigate. Being a statistic, it has a distribution. For the F-test in a one-way ANOVA and the t-tests for population means, we define the significance level, its observed value, and the observed significance level. It is possible to derive the significance level distribution. The t-test and the F-test are not without controversy. Specifically, we demonstrate that as sample size …


Simulation Of Wave Propagation In Granular Particles Using A Discrete Element Model, Syed Tahmid Hussan Jan 2024

Simulation Of Wave Propagation In Granular Particles Using A Discrete Element Model, Syed Tahmid Hussan

Electronic Theses and Dissertations

The understanding of Bender Element mechanism and utilization of Particle Flow Code (PFC) to simulate the seismic wave behavior is important to test the dynamic behavior of soil particles. Both discrete and finite element methods can be used to simulate wave behavior. However, Discrete Element Method (DEM) is mostly suitable, as the micro scaled soil particle cannot be fully considered as continuous specimen like a piece of rod or aluminum. Recently DEM has been widely used to study mechanical properties of soils at particle level considering the particles as balls. This study represents a comparative analysis of Voigt and Best …


Influence Of Attack Performance On The Ovc Volleyball Regular Seasons 2022 & 2023, Ignacio Valdemoros Jan 2024

Influence Of Attack Performance On The Ovc Volleyball Regular Seasons 2022 & 2023, Ignacio Valdemoros

Masters Theses

Understanding the outcome of volleyball games is necessary for coaches before, after, and during a season. There are several ways to gain this understanding, but statistical analysis is fundamental to see the minimum patterns of behavior that influence wins and losses in Volleyball. Furthermore, this analysis helps identify the optimal approach to achieving a goal and determining the most effective alternative to success. Scoring points in Volleyball involves three key skills: serving, blocking, and attacking. Among these skills, attacking plays the most relevant role in determining the outcome of a match. The position on the court (e.g. Outside Hitter, Middle …


A Quantitative Analysis Of Seaplane Accidents From 1982-2021, David C. Ison Jan 2024

A Quantitative Analysis Of Seaplane Accidents From 1982-2021, David C. Ison

International Journal of Aviation, Aeronautics, and Aerospace

This study aimed to assess and analyze all historical National Transportation Safety Board accident reports since 1982. For analysis, reports were bisected into seaplane (float, amphibian, and hull) and non-seaplane groups. Findings showed that there is a deficiency in the level of available detail on the seaplane fleet and cadre of seaplane pilots in the U.S. During the most recent ten years of complete data (2012-2021) showed a negative trend in all accidents and fatal accidents, although only the latter being statistically convincing. During this timeframe, seaplane accident pilots had significantly higher total time and age than other groups (non-seaplane …


Differential Impacts Of Weather Anomalies On Household Energy Expenditure Shares: A Comparison Of Clustered Panel Analysis Methods, Jordan Champion Jan 2024

Differential Impacts Of Weather Anomalies On Household Energy Expenditure Shares: A Comparison Of Clustered Panel Analysis Methods, Jordan Champion

Theses and Dissertations--Agricultural Economics

Recent emphasis on environmental justice has highlighted deficiencies in our energy system that produce disparities in accessibility and affordability for the most vulnerable. Meanwhile, the realities of a gradually warming climate and the onset of a global energy crisis (IEA 2022) have coincidently contributed to spikes in both energy prices and demand. These implications threaten to further exacerbate existing disparities for income-constrained and vulnerable populations, enhancing their risk of falling into prolonged insecurity. To ensure our transition to a just, sustainable future, we must first ensure equitable access to affordable and reliable energy for everyone. Combining household-level panel and state-level …


Deep Learning One-Class Classification With Support Vector Methods, Hayden D. Hampton Jan 2024

Deep Learning One-Class Classification With Support Vector Methods, Hayden D. Hampton

Graduate Thesis and Dissertation 2023-2024

Through the specialized lens of one-class classification, anomalies–irregular observations that uncharacteristically diverge from normative data patterns–are comprehensively studied. This dissertation focuses on advancing boundary-based methods in one-class classification, a critical approach to anomaly detection. These methodologies delineate optimal decision boundaries, thereby facilitating a distinct separation between normal and anomalous observations. Encompassing traditional approaches such as One-Class Support Vector Machine and Support Vector Data Description, recent adaptations in deep learning offer a rich ground for innovation in anomaly detection. This dissertation proposes three novel deep learning methods for one-class classification, aiming to enhance the efficacy and accuracy of anomaly detection in …


On A Fully Coupled Nonlocal Multipoint Boundary Value Problem For A Dual Hybrid System Of Nonlinear Q -Fractional Differential Equations, Ahmed Alsaedi, Martin Bohner, Bashir Ahmad, Boshra Alharbi Jan 2024

On A Fully Coupled Nonlocal Multipoint Boundary Value Problem For A Dual Hybrid System Of Nonlinear Q -Fractional Differential Equations, Ahmed Alsaedi, Martin Bohner, Bashir Ahmad, Boshra Alharbi

Mathematics and Statistics Faculty Research & Creative Works

A new class of nonlocal multipoint boundary value problems involving a dual hybrid system of nonlinear Riemann-Liouville-type q-fractional differential equations is studied in this paper. Existence and uniqueness results for the given problem are derived by applying the Leray-Schauder nonlinear alternative and the Banach contraction mapping principle. Examples are presented for illustrating the obtained results. The work established in this paper is a useful contribution to the existing literature on q-fractional differential equations. Some interesting special cases are also discussed.


Critical Point Approaches To Nonlinear Square Root Laplacian Equations, Martin Bohner, Giuseppe Caristi, Shapour Heidarkhani, Amjad Salari Jan 2024

Critical Point Approaches To Nonlinear Square Root Laplacian Equations, Martin Bohner, Giuseppe Caristi, Shapour Heidarkhani, Amjad Salari

Mathematics and Statistics Faculty Research & Creative Works

This work is devoted to the study of multiplicity results of solutions for a class of nonlinear equations involving the square root of the Laplacian. Indeed, we will use variational methods for smooth functionals, defined on reflexive Banach spaces, in order to achieve the existence of at least three solutions for the equations. Moreover, assuming that the nonlinear terms are nonnegative, we will prove that the solutions are nonnegative. Finally, by presenting an example, we will ensure the applicability of our results.


An Unsupervised Machine Learning Algorithm For Clustering Low Dimensional Data Points In Euclidean Grid Space, Josef Lazar Jan 2024

An Unsupervised Machine Learning Algorithm For Clustering Low Dimensional Data Points In Euclidean Grid Space, Josef Lazar

Senior Projects Spring 2024

Clustering algorithms provide a useful method for classifying data. The majority of well known clustering algorithms are designed to find globular clusters, however this is not always desirable. In this senior project I present a new clustering algorithm, GBCN (Grid Box Clustering with Noise), which applies a box grid to points in Euclidean space to identify areas of high point density. Points within the grid space that are in adjacent boxes are classified into the same cluster. Conversely, if a path from one point to another can only be completed by traversing an empty grid box, then they are classified …


Ms Environmental Biology Capstone Project, Denise Corona Jan 2024

Ms Environmental Biology Capstone Project, Denise Corona

Regis University Student Publications (comprehensive collection)

Land-use change (LUC) is a key driver of biodiversity loss, altering the structure and function of ecosystems through human activities such as urbanization and agriculture. This change has led to habitat loss and fragmentation, resulting in the rapid decline of avian populations globally. Wildlife rehabilitation centers are the primary responders for injured birds and their records provide valuable data to monitor potential factors impacting bird populations. However, these datasets are underutilized in research. This study examined how LUC in the Front Range affects the likelihood and circumstances of admission of injured birds to the Rocky Mountain Wildlife Alliance (RMWA) in …


On Generative Models And Joint Architectures For Document-Level Relation Extraction, Aviv Brokman Jan 2024

On Generative Models And Joint Architectures For Document-Level Relation Extraction, Aviv Brokman

Theses and Dissertations--Statistics

Biomedical text is being generated at a high rate in scientific literature publications and electronic health records. Within these documents lies a wealth of potentially useful information in biomedicine. Relation extraction (RE), the process of automating the identification of structured relationships between entities within text, represents a highly sought-after goal in biomedical informatics, offering the potential to unlock deeper insights and connections from this vast corpus of data. In this dissertation, we tackle this problem with a variety of approaches.

We review the recent history of the field of document-level RE. Several themes emerge. First, graph neural networks dominate the …


Contrastive Learning, With Application To Forensic Identification Of Source, Cole Ryan Patten Jan 2024

Contrastive Learning, With Application To Forensic Identification Of Source, Cole Ryan Patten

Electronic Theses and Dissertations

Forensic identification of source problems often fall under the category of verification problems, where recent advances in deep learning have been made by contrastive learning methods. Many forensic identification of source problems deal with a scarcity of data, an issue addressed by few-shot learning. In this work, we make specific what makes a neural network a contrastive network. We then consider the use of contrastive neural networks for few-shot learning classification problems and compare them to other statistical and deep learning methods. Our findings indicate similar performance between models trained by contrastive loss and models trained by cross-entropy loss. We …


Forecasting The One Month Ahead Joint Distribution Of The Carhart Four Factor Model Using A Copula Method, Michael Nebor Jan 2024

Forecasting The One Month Ahead Joint Distribution Of The Carhart Four Factor Model Using A Copula Method, Michael Nebor

Graduate Research Theses & Dissertations

The Fama and French created factors of Market minus Risk Free (Mkt-RF), Small minusBig (SMB), High minus Low (HML), and the Carhart developed factor of Momentum (MOM) are frequently used together as a four factor model to explain the variation in returns of financial equities. This paper analyzes the joint distribution of these factors by forecasting the one month ahead joint distribution using various copula based methods. The VaR quantiles are calculated throughout the forecasted distribution at the .01, .05, .25, .50, .75, .95, and .99 quantiles. The forecasted distributions are compared against each other by analyzing the exceedances of …


The Effects Of Bureaucratic Corruption On The Financial Constraints Of Nigerian Small Businesses, Obinna Franklin Ezeibekwe Jan 2024

The Effects Of Bureaucratic Corruption On The Financial Constraints Of Nigerian Small Businesses, Obinna Franklin Ezeibekwe

Graduate Research Theses & Dissertations

In this paper, I conduct the first empirical analysis to examine the impact of bureaucratic corruption on small business financial constraints in Nigeria. This study is also the first to compare the average treatment effect (ATE) with the local average treatment effect (LATE) framework to account for potential variations in treatment effects for small Nigerian firms with less than 100 employees. Using the bivariate probit method and two binary instruments, I find that corruption significantly increases the likelihood of financial constraints for a typical micro, small, and medium enterprise (MSME) by approximately 64 to 68 percentage points. When I use …


Synergetic Effects Of Democracy And Economic Development On Income Inequality And The Role Of Relative Political Capacity, Nazif Sali Jan 2024

Synergetic Effects Of Democracy And Economic Development On Income Inequality And The Role Of Relative Political Capacity, Nazif Sali

CGU Theses & Dissertations

Diving into the complex dynamics of income inequality, this dissertation studies the multifaceted relationships between democracy, economic development, and inequality, placing a spotlight on the mediating influence of Relative Political Capacity (RPC). Uncovering the pivotal role of RPC as democracies advance and economies progress, this research navigates the contours of income inequality. By dissecting distinct sub-samples of OECD and non-OECD countries, nuanced insights surface. Employing robust methodologies such as ordinary least squares regressions and fixed effects analysis, I argue that targeted policies addressing inequality can not only foster inclusive economic growth but also fortify the foundations of democratic institutions. As …


Effects Of Voice Pitch On Social Perceptions Vary With Relational Mobility And Homicide Rate, Toe Aung, Et. Al Jan 2024

Effects Of Voice Pitch On Social Perceptions Vary With Relational Mobility And Homicide Rate, Toe Aung, Et. Al

Research Collection School of Social Sciences

Fundamental frequency (fo) is the most perceptually salient vocal acoustic parameter, yet little is known about how its perceptual influence varies across societies. We examined how fo affects key social perceptions and how socioecological variables modulate these effects in 2,647 adult listeners sampled from 44 locations across 22 nations. Low male fo increased men’s perceptions of formidability and prestige, especially in societies with higher homicide rates and greater relational mobility in which male intrasexual competition may be more intense and rapid identification of highstatus competitors may be exigent. High female fo increased women’s perceptions of flirtatiousness where relational mobility was …


Multiple Imputation For Robust Cluster Analysis To Address Missingness In Medical Data, Arnold Harder, Gayla R. Olbricht, Godwin Ekuma, Daniel B. Hier, Tayo Obafemi-Ajayi Jan 2024

Multiple Imputation For Robust Cluster Analysis To Address Missingness In Medical Data, Arnold Harder, Gayla R. Olbricht, Godwin Ekuma, Daniel B. Hier, Tayo Obafemi-Ajayi

Mathematics and Statistics Faculty Research & Creative Works

Cluster Analysis Has Been Applied To A Wide Range Of Problems As An Exploratory Tool To Enhance Knowledge Discovery. Clustering Aids Disease Subtyping, I.e. Identifying Homogeneous Patient Subgroups, In Medical Data. Missing Data Is A Common Problem In Medical Research And Could Bias Clustering Results If Not Properly Handled. Yet, Multiple Imputation Has Been Under-Utilized To Address Missingness, When Clustering Medical Data. Its Limited Integration In Clustering Of Medical Data, Despite The Known Advantages And Benefits Of Multiple Imputation, Could Be Attributed To Many Factors. This Includes Methodological Complexity, Difficulties In Pooling Results To Obtain A Consensus Clustering, Uncertainty Regarding …