Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems

Institution
Keyword
Publication Year
Publication
Publication Type
File Type

Articles 2311 - 2340 of 6720

Full-Text Articles in Physical Sciences and Mathematics

Modular Scheduling System For Westside School District, Tyler Bienhoff Apr 2018

Modular Scheduling System For Westside School District, Tyler Bienhoff

Honors Theses

Westside School district offers a modular scheduling system for their high school that is more similar to a college schedule than the typical high school system. Due to the complexity of their master schedule each semester, there are no commercially available products that can assist in creating a schedule. Hence, this thesis discusses a scheduling algorithm and management system that was built specifically for Westside High School with the potential to be expanded for use by other interested schools. The first part of the paper is focused on gathering input from students and faculty for which courses and how many …


User-Centric Privacy Preservation In Mobile And Location-Aware Applications, Mingming Guo Apr 2018

User-Centric Privacy Preservation In Mobile And Location-Aware Applications, Mingming Guo

FIU Electronic Theses and Dissertations

The mobile and wireless community has brought a significant growth of location-aware devices including smart phones, connected vehicles and IoT devices. The combination of location-aware sensing, data processing and wireless communication in these devices leads to the rapid development of mobile and location-aware applications. Meanwhile, user privacy is becoming an indispensable concern. These mobile and location-aware applications, which collect data from mobile sensors carried by users or vehicles, return valuable data collection services (e.g., health condition monitoring, traffic monitoring, and natural disaster forecasting) in real time. The sequential spatial-temporal data queries sent by users provide their location trajectory information. The …


Supporting User Evaluation Of Messaging Interactions With Potential Romantic Partners Discovered Online, Douglas Zytko Apr 2018

Supporting User Evaluation Of Messaging Interactions With Potential Romantic Partners Discovered Online, Douglas Zytko

Dissertations

Online dating systems have transformed the way people pursue romance. To arrive at a decision to meet for a face-to-face date, users gather information about each other online pertinent to romantic attraction. Yet sometimes they discover on the date that they made the wrong choice. One aspect of online dating system-use that may be a contributing factor, but is largely overlooked in the literature, is interaction through text-based messaging interfaces. This dissertation explores how messaging interactions inform face-to-face meeting decisions through two qualitative studies, and explores through a mixed methods field study how innovative messaging interfaces that embody theory from …


Cross-Platform Openhds Web Application, Nick Littlefield Apr 2018

Cross-Platform Openhds Web Application, Nick Littlefield

Thinking Matters Symposium Archive

A proof of concept cross platform data collection application for Demographic and Health studies has been developed. This application allows for the collection of information about locations, individuals, births, and deaths within a region. This application can be run on both mobile devices, laptops, and desktops.


Surveying Digital Collections Stewardship In Nebraska [Original Survey Form], Jennifer L. Thoegersen, Blake Graham Apr 2018

Surveying Digital Collections Stewardship In Nebraska [Original Survey Form], Jennifer L. Thoegersen, Blake Graham

University of Nebraska-Lincoln Data Repository

No abstract provided.


A Feasibility Study On Crowdsourcing To Monitor Municipal Resources In Smart Cities, Thivya Kandappu, Archan Misra, Ming Hui, Desmond (Xu Minghui) Koh, Randy Tandriansyah Daratan, Nikita Jaiman Apr 2018

A Feasibility Study On Crowdsourcing To Monitor Municipal Resources In Smart Cities, Thivya Kandappu, Archan Misra, Ming Hui, Desmond (Xu Minghui) Koh, Randy Tandriansyah Daratan, Nikita Jaiman

Research Collection School Of Computing and Information Systems

Active citizenry, whereby citizens actively participate inreporting and addressing challenges in urban service delivery is a strategic goalof smart cities such as Singapore. In spite of the promise, we believe that thesuccess of such large-scale nation-wide crowdsourcing deployments depend on thereal-word user preferences and behavioral characteristics of citizens. In thispaper, we first present our findings on behavioral preferences and key concernsof citizens regarding smart-city services via an opinion survey conducted with 1300participants. We then propose a “citizen-controlled” urban services reportingplatform where citizens actively report on the status of various municipalresources. We advocate the importance of matching user mobility patternsagainst task …


Environmental Restoration Database, Joao Nascimento Apr 2018

Environmental Restoration Database, Joao Nascimento

Graduate Theses & Non-Theses

Environmental restoration projects face many challenges. Public awareness, funding constraints, unpredictable weather, unknown biological/chemical factors and the uncertainty about how the targeted ecosystem will develop work against the planned and ideal restoration.

One way the projects’ efficiency can be improved is by using software tools for data and quality management systems, in order to share information, make field practice follow rules, keep track of maintenance tasks, measure results and, therefore, increase the rate of success by the amount of resources invested.

Since the conception of every project, all resources involved need to be focused and coherent to the final restoration …


Pccf: Periodic And Continual Temporal Co-Factorization For Recommender Systems, Guibing Guo, Feida Zhu, Shilin Qu, Xingwei Wang Apr 2018

Pccf: Periodic And Continual Temporal Co-Factorization For Recommender Systems, Guibing Guo, Feida Zhu, Shilin Qu, Xingwei Wang

Research Collection School Of Computing and Information Systems

Rating-only collaborative filtering has been extensively studied for decades with great improvements achieved in predicting a user’s preference on a target item at a particular time point. Yet, it remains a research challenge on how to capture users’ rating patterns which may drift over time. In this article, we propose a time-aware matrix co-factorization model, called PCCF, which considers two types of temporal effects, i.e., periodic and continual. Specifically, periodic effects refer to the impact of discrete periodic time slices with which users’ preferences may be associated, and continual effects refer to the impact of continuous gradual time over which …


The Role Of Urban Mobility In Retail Business Survival, Krittika D'Silva, Kasthuri Jayarajah, Anastasios Noulas, Cecilia Mascolo, Archan Misra Apr 2018

The Role Of Urban Mobility In Retail Business Survival, Krittika D'Silva, Kasthuri Jayarajah, Anastasios Noulas, Cecilia Mascolo, Archan Misra

Research Collection School Of Computing and Information Systems

Economic and urban planning agencies have strong interest in tackling the hard problem of predicting the odds of survival of individual retail businesses. In this work, we tap urban mobility data available both from a location-based intelligence platform, Foursquare, and from public transportation agencies, and investigate whether mobility-derived features can help foretell the failure of such retail businesses, over a 6 month horizon, across 10 distinct cities spanning the globe. We hypothesise that the survival of such a retail outlet is correlated with not only venue-specific characteristics but also broader neighbourhood-level effects. Through careful statistical analysis of Foursquare and taxi …


Social Network Monitoring For Bursty Cascade Detection, Wei Xie, Feida Zhu, Jing Xiao, Jianzong Wang Apr 2018

Social Network Monitoring For Bursty Cascade Detection, Wei Xie, Feida Zhu, Jing Xiao, Jianzong Wang

Research Collection School Of Computing and Information Systems

Social network services have become important and efficient platforms for users to share all kinds of information. The capability to monitor user-generated information and detect bursts from information diffusions in these social networks brings value to a wide range of real-life applications, such as viral marketing. However, in reality, as a third party, there is always a cost for gathering information from each user or so-called social network sensor. The question then arises how to select a budgeted set of social network sensors to form the data stream for burst detection without compromising the detection performance. In this article, we …


Detect Rumor And Stance Jointly By Neural Multi-Task Learning, Jing Ma, Wei Gao, Kam-Fai Wong Apr 2018

Detect Rumor And Stance Jointly By Neural Multi-Task Learning, Jing Ma, Wei Gao, Kam-Fai Wong

Research Collection School Of Computing and Information Systems

In recent years, an unhealthy phenomenon characterized as the massive spread of fake news or unverified information (i.e., rumors) has become increasingly a daunting issue in human society. The rumors commonly originate from social media outlets, primarily microblogging platforms, being viral afterwards by the wild, willful propagation via a large number of participants. It is observed that rumorous posts often trigger versatile, mostly controversial stances among participating users. Thus, determining the stances on the posts in question can be pertinent to the successful detection of rumors, and vice versa. Existing studies, however, mainly regard rumor detection and stance classification as …


Distributed Multi-Task Classification: A Decentralized Online Learning Approach, Chi Zhang, Peilin Zhao, Shuji Hao, Yeng Chai Soh, Bu Sung Lee, Chunyan Miao, Steven C. H. Hoi Apr 2018

Distributed Multi-Task Classification: A Decentralized Online Learning Approach, Chi Zhang, Peilin Zhao, Shuji Hao, Yeng Chai Soh, Bu Sung Lee, Chunyan Miao, Steven C. H. Hoi

Research Collection School Of Computing and Information Systems

Although dispersing one single task to distributed learning nodes has been intensively studied by the previous research, multi-task learning on distributed networks is still an area that has not been fully exploited, especially under decentralized settings. The challenge lies in the fact that different tasks may have different optimal learning weights while communication through the distributed network forces all tasks to converge to an unique classifier. In this paper, we present a novel algorithm to overcome this challenge and enable learning multiple tasks simultaneously on a decentralized distributed network. Specifically, the learning framework can be separated into two phases: (i) …


Every Step You Take, I’Ll Be Watching You: Practical Stepauth-Entication Of Rfid Paths, Kai Bu, Yingjiu Li Apr 2018

Every Step You Take, I’Ll Be Watching You: Practical Stepauth-Entication Of Rfid Paths, Kai Bu, Yingjiu Li

Research Collection School Of Computing and Information Systems

Path authentication thwarts counterfeits in RFID-based supply chains. Its motivation is that tagged products taking invalid paths are likely faked and injected by adversaries at certain supply chain partners/steps. Existing solutions are path-grained in that they simply regard a product as genuine if it takes any valid path. Furthermore, they enforce distributed authentication by offloading the sets of valid paths to some or all steps from a centralized issuer. This not only imposes network and storage overhead but also leaks transaction privacy. We present StepAuth, the first step-grained path authentication protocol that is practically efficient for authenticating products with strict …


Does Journaling Encourage Healthier Choices? Analyzing Healthy Eating Behaviors Of Food Journalers, Palakorn Achananuparp, Ee Peng Lim, Vibhanshu Abhishek Apr 2018

Does Journaling Encourage Healthier Choices? Analyzing Healthy Eating Behaviors Of Food Journalers, Palakorn Achananuparp, Ee Peng Lim, Vibhanshu Abhishek

Research Collection School Of Computing and Information Systems

Past research has shown the benefits of food journaling in promoting mindful eating and healthier food choices. However, the links between journaling and healthy eating have not been thoroughly examined. Beyond caloric restriction, do journalers consistently and sufficiently consume healthful diets? How different are their eating habits compared to those of average consumers who tend to be less conscious about health? In this study, we analyze the healthy eating behaviors of active food journalers using data from MyFitnessPal. Surprisingly, our findings show that food journalers do not eat as healthily as they should despite their proclivity to health eating and …


Exploiting User And Venue Characteristics For Fine-Grained Tweet Geolocation, Wen Haw Chong, Ee Peng Lim Apr 2018

Exploiting User And Venue Characteristics For Fine-Grained Tweet Geolocation, Wen Haw Chong, Ee Peng Lim

Research Collection School Of Computing and Information Systems

Which venue is a tweet posted from? We call this a fine-grained geolocation problem. Given an observed tweet, the task is to infer its discrete posting venue, e.g., a specific restaurant. This recovers the venue context and differs from prior work, which geolocats tweets to location coordinates or cities/neighborhoods. First, we conduct empirical analysis to uncover venue and user characteristics for improving geolocation. For venues, we observe spatial homophily, in which venues near each other have more similar tweet content (i.e., text representations) compared to venues further apart. For users, we observe that they are spatially focused and more likely …


The Impact Of Rapid Release Cycles On The Integration Delay Of Fixed Issues, Daniel Alencar Da Costa, Shane Mcintosh, Christoph Treude, Uirá Kulesza, Ahmed E. Hassan Apr 2018

The Impact Of Rapid Release Cycles On The Integration Delay Of Fixed Issues, Daniel Alencar Da Costa, Shane Mcintosh, Christoph Treude, Uirá Kulesza, Ahmed E. Hassan

Research Collection School Of Computing and Information Systems

The release frequency of software projects has increased in recent years. Adopters of so-called rapid releases—short release cycles, often on the order of weeks, days, or even hours—claim that they can deliver fixed issues (i.e., implemented bug fixes and new features) to users more quickly. However, there is little empirical evidence to support these claims. In fact, our prior work shows that code integration phases may introduce delays for rapidly releasing projects—98% of the fixed issues in the rapidly releasing Firefox project had their integration delayed by at least one release. To better understand the impact that rapid release cycles …


Foundations Of Health Information Technology (Undergraduate) Course Materials, Chi Zhang Apr 2018

Foundations Of Health Information Technology (Undergraduate) Course Materials, Chi Zhang

Computer Science and Information Technology Ancillary Materials

This is a collection of all materials used in Health Information Technology by Dr. Chi Zhang at Kennesaw State University, including lecture slides, assignments, and assessments, including a question bank.

Topics covered include:

  • Clinical Financial Records
  • Evidence-Based Medicine
  • e-Prescribing
  • Patient Bedside Systems
  • Telemedicine
  • Health Information Networks
  • Cryptography
  • Accreditation
  • HIPAA Privacy and Security


A Sliding-Window Framework For Representative Subset Selection, Yanhao Wang, Yuchen Li, Kian-Lee Tan Apr 2018

A Sliding-Window Framework For Representative Subset Selection, Yanhao Wang, Yuchen Li, Kian-Lee Tan

Research Collection School Of Computing and Information Systems

Representative subset selection (RSS) is an important tool for users to draw insights from massive datasets. A common approach is to model RSS as the submodular maximization problem because the utility of extracted representatives often satisfies the "diminishing returns" property. To capture the data recency issue and support different types of constraints in real-world problems, we formulate RSS as maximizing a submodular function subject to a d-knapsack constraint (SMDK) over sliding windows. Then, we propose a novel KnapWindow framework for SMDK. Theoretically, KnapWindow is 1-ε/1+d - approximate for SMDK and achieves sublinear complexity. Finally, we evaluate the efficiency and effectiveness …


A Novel Representation And Compression For Queries On Trajectories In Road Networks, Xiaochun Yang, Bin Wang, Kai Yang, Chengfei Liu, Baihua Zheng Apr 2018

A Novel Representation And Compression For Queries On Trajectories In Road Networks, Xiaochun Yang, Bin Wang, Kai Yang, Chengfei Liu, Baihua Zheng

Research Collection School Of Computing and Information Systems

Recording and querying time-stamped trajectories incurs high cost of data storage and computing. In this paper, we explore several characteristics of the trajectories in road mbox{networks}, which have motivated the idea of coding trajectories by associating timestamps with relative spatial path and locations. Such a representation contains large number of duplicate information to achieve a lower entropy compared with the existing representations, thereby drastically cutting the storage cost. We propose several techniques to compress spatial path and locations separately, which can support fast positioning and achieve better compression ratio. For locations, we propose two novel encoding schemes such that the …


Combination Forecasting Reversion Strategy For Online Portfolio Selection, Dingjiang Huang, Shunchang Yu, Bin Li, Steven C. H. Hoi, Shuigeng G. Zhou Apr 2018

Combination Forecasting Reversion Strategy For Online Portfolio Selection, Dingjiang Huang, Shunchang Yu, Bin Li, Steven C. H. Hoi, Shuigeng G. Zhou

Research Collection School Of Computing and Information Systems

Machine learning and artificial intelligence techniques have been applied to construct online portfolio selection strategies recently. A popular and state-of-the-art family of strategies is to explore the reversion phenomenon through online learning algorithms and statistical prediction models. Despite gaining promising results on some benchmark datasets, these strategies often adopt a single model based on a selection criterion (e.g., breakdown point) for predicting future price. However, such model selection is often unstable and may cause unnecessarily high variability in the final estimation, leading to poor prediction performance in real datasets and thus non-optimal portfolios. To overcome the drawbacks, in this article, …


'Is More Better?': Impact Of Multiple Photos On Perception Of Persona Profiles, Joni Salminen, Lene Nielsen, Soon-Gyo Jung, Jisun An, Haewoon Kwak, Bernard J. Jansen Apr 2018

'Is More Better?': Impact Of Multiple Photos On Perception Of Persona Profiles, Joni Salminen, Lene Nielsen, Soon-Gyo Jung, Jisun An, Haewoon Kwak, Bernard J. Jansen

Research Collection School Of Computing and Information Systems

In this research, we investigate if and how more photos than a single headshot can heighten the level of information provided by persona profiles. We conduct eye-tracking experiments and qualitative interviews with variations in the photos: a single headshot, a headshot and images of the persona in different contexts, and a headshot with pictures of different people representing key persona attributes. The results show that more contextual photos significantly improve the information end users derive from a persona profile; however, showing images of different people creates confusion and lowers the informativeness. Moreover, we discover that choice of pictures results in …


Continuous Top-K Monitoring On Document Streams (Extended Abstract), Leong Hou U, Junjie Zhang, Kyriakos Mouratidis, Ye Li Apr 2018

Continuous Top-K Monitoring On Document Streams (Extended Abstract), Leong Hou U, Junjie Zhang, Kyriakos Mouratidis, Ye Li

Research Collection School Of Computing and Information Systems

The efficient processing of document streams plays an important role in many information filtering systems. Emerging applications, such as news update filtering and social network notifications, demand presenting end-users with the most relevant content to their preferences. In this work, user preferences are indicated by a set of keywords. A central server monitors the document stream and continuously reports to each user the top-k documents that are most relevant to her keywords. The objective is to support large numbers of users and high stream rates, while refreshing the topk results almost instantaneously. Our solution abandons the traditional frequency-ordered indexing approach, …


Domain-Specific Cross-Language Relevant Question Retrieval, Bowen Xu, Zhenchang Xing, Xin Xia, David Lo, Shanping Li Apr 2018

Domain-Specific Cross-Language Relevant Question Retrieval, Bowen Xu, Zhenchang Xing, Xin Xia, David Lo, Shanping Li

Research Collection School Of Computing and Information Systems

Chinese developers often cannot effectively search questions in English, because they may have difficulties in translating technical words from Chinese to English and formulating proper English queries. For the purpose of helping Chinese developers take advantage of the rich knowledge base of Stack Overflow and simplify the question retrieval process, we propose an automated cross-language relevant question retrieval (CLRQR) system to retrieve relevant English questions for a given Chinese question. CLRQR first extracts essential information (both Chinese and English) from the title and description of the input Chinese question, then performs domain-specific translation of the essential Chinese information into English, …


Feature Detection In Medical Images Using Deep Learning, Anthony Pasquarelli Apr 2018

Feature Detection In Medical Images Using Deep Learning, Anthony Pasquarelli

Honors Projects in Information Systems and Analytics

This project explores the use of deep learning to predict age based on pediatric hand X-Rays. Data from the Radiological Society of North America’s pediatric bone age challenge were used to train and evaluate a convolutional neural network. The project used InceptionV3, a CNN developed by Google, that was pre-trained on ImageNet, a popular online image dataset. Our fine-tuned version of InceptionV3 yielded an average error of less than 10 months between predicted and actual age. This project shows the effectiveness of deep learning in analyzing medical images and the potential for even greater improvements in the future. In addition …


Eat & Tell: A Randomized Trial Of Random-Loss Incentive To Increase Dietary Self-Tracking Compliance, Palakorn Achananuparp, Ee Peng Lim, Vibhanshu Abhishek, Tianjiao Yun Apr 2018

Eat & Tell: A Randomized Trial Of Random-Loss Incentive To Increase Dietary Self-Tracking Compliance, Palakorn Achananuparp, Ee Peng Lim, Vibhanshu Abhishek, Tianjiao Yun

Research Collection School Of Computing and Information Systems

A growing body of evidence has shown that incorporating behavioral economics principles into the design of financial incentive programs helps improve their cost-effectiveness, promote individuals' short-term engagement, and increase compliance in health behavior interventions. Yet, their effects on long-term engagement have not been fully examined. In study designs where repeated administration of incentives is required to ensure the regularity of behaviors, the effectiveness of subsequent incentives may decrease as a result of the law of diminishing marginal utility. In this paper, we introduce random-loss incentive-a new financial incentive based on loss aversion and unpredictability principles-to address the problem of individuals' …


A Data-Driven Analysis Of Workers' Earnings On Amazon Mechanical Turk, Kotaro Hara, Abigail Adams, Kristy Milland, Saiph Savage, Chris Callison-Burch, Jeffrey P. Bigham Apr 2018

A Data-Driven Analysis Of Workers' Earnings On Amazon Mechanical Turk, Kotaro Hara, Abigail Adams, Kristy Milland, Saiph Savage, Chris Callison-Burch, Jeffrey P. Bigham

Research Collection School Of Computing and Information Systems

A growing number of people are working as part of on-line crowd work. Crowd work is often thought to be low wage work. However, we know little about the wage distribution in practice and what causes low/high earnings in this setting. We recorded 2,676 workers performing 3.8 million tasks on Amazon Mechanical Turk. Our task-level analysis revealed that workers earned a median hourly wage of only ~$2/h, and only 4% earned more than $7.25/h. While the average requester pays more than $11/h, lower-paying requesters post much more work. Our wage calculations are influenced by how unpaid work is accounted for, …


Octopus: An Online Topic-Aware Influence Analysis System For Social Networks, Ju Fan, Jiarong Qiu, Yuchen Li, Qingfei Meng, Dongxiang Zhang, Guoliang Li, Kian-Lee Tan, Xiaoyong Du Apr 2018

Octopus: An Online Topic-Aware Influence Analysis System For Social Networks, Ju Fan, Jiarong Qiu, Yuchen Li, Qingfei Meng, Dongxiang Zhang, Guoliang Li, Kian-Lee Tan, Xiaoyong Du

Research Collection School Of Computing and Information Systems

The wide adoption of social networks has brought a new demand on influence analysis. This paper presents OCTOPUS that offers social network users and analysts valuable insights through topic-aware social influence analysis services. OCTOPUS has the following novel features. First, OCTOPUS provides a user-friendly interface that allows users to employ simple and easy-to-use keywords to perform influence analysis. Second, OCTOPUS provides three powerful keyword-based topic-aware influence analysis tools: keyword-based influential user discovery, personalized influential keywords suggestion, and interactive influential paths exploration. These tools can not only discover influential users, but also provide insights on how the users influence the network. …


Location-Aware Influence Maximization Over Dynamic Social Streams, Yanhao Wang, Yuchen Li, Ju Fan, Kianlee Tan Apr 2018

Location-Aware Influence Maximization Over Dynamic Social Streams, Yanhao Wang, Yuchen Li, Ju Fan, Kianlee Tan

Research Collection School Of Computing and Information Systems

Influence maximization (IM), which selects a set of k seed users (a.k.a., a seed set) to maximize the influence spread over a social network, is a fundamental problem in a wide range of applications. However, most existing IM algorithms are static and location-unaware. They fail to provide high-quality seed sets efficiently when the social network evolves rapidly and IM queries are location-aware. In this article, we first define two IM queries, namely Stream Influence Maximization (SIM) and Location-aware SIM (LSIM), to track influential users over social streams. Technically, SIM adopts the sliding window model and maintains a seed set with …


Persona Perception Scale: Developing And Validating An Instrument For Human-Like Representations Of Data, Salminen Joni, Haewoon Kwak, João Santos, Soon-Gyo Jung, Jisun An, Bernard J. Jansen Apr 2018

Persona Perception Scale: Developing And Validating An Instrument For Human-Like Representations Of Data, Salminen Joni, Haewoon Kwak, João Santos, Soon-Gyo Jung, Jisun An, Bernard J. Jansen

Research Collection School Of Computing and Information Systems

Personas are widely used in software development, system design, and HCI studies. Yet, their evaluation is difficult, and there are no recognized and validated measurement scales to date. To improve this condition, this research develops a persona perception scale based on reviewing relevant literature. We validate the scale through a pilot study with 19 participants, each evaluating three personas (57 evaluations in total). This is the first reported effort to systematically develop and validate an instrument for persona perception measurement. We find the constructs and items of the scale perform well, with factor loadings ranging between 0.60 and 0.95. Reliability, …


The Application Of Text Mining And Data Visualization Techniques To Textual Corpus Exploration, Jeffrey R. Smith Jr. Mar 2018

The Application Of Text Mining And Data Visualization Techniques To Textual Corpus Exploration, Jeffrey R. Smith Jr.

Theses and Dissertations

Unstructured data in the digital universe is growing rapidly and shows no evidence of slowing anytime soon. With the acceleration of growth in digital data being generated and stored on the World Wide Web, the prospect of information overload is much more prevalent now than it has been in the past. As a preemptive analytic measure, organizations across many industries have begun implementing text mining techniques to analyze such large sources of unstructured data. Utilizing various text mining techniques such as n -gram analysis, document and term frequency analysis, correlation analysis, and topic modeling methodologies, this research seeks to develop …