Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Databases and Information Systems

Institution
Keyword
Publication Year
Publication
Publication Type
File Type

Articles 6121 - 6150 of 6718

Full-Text Articles in Physical Sciences and Mathematics

Context-Aware Semantic Association Ranking, Boanerges Aleman-Meza, Chris Halaschek, I. Budak Arpinar, Amit P. Sheth Aug 2003

Context-Aware Semantic Association Ranking, Boanerges Aleman-Meza, Chris Halaschek, I. Budak Arpinar, Amit P. Sheth

Kno.e.sis Publications

Discovering complex and meaningful relationships, which we call Semantic Associations, is an important challenge. Just as ranking of documents is a critical component of today's search engines, ranking of relationships will be essential in tomorrow's semantic search engines that would support discovery and mining of the Semantic Web. Building upon our recent work on specifying types of Semantic Associations in RDF graphs, which are possible to create through semantic metadata extraction and annotation, we discuss a framework where ranking techniques can be used to identify more interesting and more relevant Semantic Associations. Our techniques utilize alternative ways of specifying the …


Learning Mixture Models With The Latent Maximum Entropy Principle, Shaojun Wang, Dale Schuurmans, Fuchun Peng, Yunxin Zhao Aug 2003

Learning Mixture Models With The Latent Maximum Entropy Principle, Shaojun Wang, Dale Schuurmans, Fuchun Peng, Yunxin Zhao

Kno.e.sis Publications

We present a new approach to estimating mixture models based on a new inference principle we have proposed: the latent maximum entropy principle (LME). LME is different both from Jaynes’ maximum entropy principle and from standard maximum likelihood estimation. We demonstrate the LME principle by deriving new algorithms for mixture model estimation, and show how robust new variants of the EM algorithm can be developed. Our experiments show that estimation based on LME generally yields better results than maximum likelihood estimation, particularly when inferring latent variable models from small amounts of data.


Enemy At The Gate: Threats To Information Security, Michael E. Whitman Aug 2003

Enemy At The Gate: Threats To Information Security, Michael E. Whitman

Faculty Articles

A firm can build more effective security strategies by identifying and ranking the severity of potential threats to its IS efforts.


Boltzmann Machine Learning With The Latent Maximum Entropy Principle, Shaojun Wang, Dale Schuurmans, Fuchun Peng, Yunxin Zhao Aug 2003

Boltzmann Machine Learning With The Latent Maximum Entropy Principle, Shaojun Wang, Dale Schuurmans, Fuchun Peng, Yunxin Zhao

Kno.e.sis Publications

We present a new statistical learning paradigm for Boltzmann machines based on a new inference principle we have proposed: the latent maximum entropy principle (LME). LME is different both from Jaynes maximum entropy principle and from standard maximum likelihood estimation. We demonstrate the LME principle BY deriving new algorithms for Boltzmann machine parameter estimation, and show how robust and fast new variant of the EM algorithm can be developed. Our experiments show that estimation based on LME generally yields better results than maximum likelihood estimation, particularly when inferring hidden units from small amounts of data.


Public Commons For Geospatial Data: A Conceptual Model, Chakravarthy Namindi Sharad Aug 2003

Public Commons For Geospatial Data: A Conceptual Model, Chakravarthy Namindi Sharad

Electronic Theses and Dissertations

A wide variety of spatial data collection efforts are ongoing throughout local, state and federal agencies, private firms and non-profit organizations. Each effort is established for a different purpose but organizations and individuals often collect and maintain the same or similar information. The United States federal government has undertaken many initiatives such as the National Spatial Data Infrastructure, the National Map and Geospatial One-Stop to reduce duplicative spatial data collection and promote the coordinated use, sharing, and dissemination of spatial data nationwide. A key premise in most of these initiatives is that no national government will be able to gather …


Towards A Role-Based Metadata Scheme For Educational Digital Libraries: A Case Study In Singapore, Dian Melati Md Ismail, Ming Yin, Yin-Leng Theng, Dion Hoe-Lian Goh, Ee Peng Lim Aug 2003

Towards A Role-Based Metadata Scheme For Educational Digital Libraries: A Case Study In Singapore, Dian Melati Md Ismail, Ming Yin, Yin-Leng Theng, Dion Hoe-Lian Goh, Ee Peng Lim

Research Collection School Of Computing and Information Systems

In this paper, we describe the development of an appropriate metadata scheme for GeogDL, a Web-based digital library application containing past-year examination resources for students taking a Singapore national examination in geography. The new metadata scheme was developed from established metadata schemes on education and e-learning. Initial evaluation showed that a role-based approach would be more viable, adapting to the different roles of teachers/educators and librarians contributing geography resources to GeogDL. The paper concludes with concrete implementation of the role-based metadata schema for GeogDL.


A Visual Framework Invites Human Into The Clustering Process, Keke Chen, Ling Liu Jul 2003

A Visual Framework Invites Human Into The Clustering Process, Keke Chen, Ling Liu

Kno.e.sis Publications

Clustering is a technique commonly used in scientific research. The task of clustering inevitably involves human participation - the clustering is not finished when the computer/algorithm finishes but the user has evaluated, understood and accepted the patterns. This defines a human involved "clustering-analysis/evaluation" iteration. Instead of neglecting this human involvement, we provide a visual framework (VISTA) with all power of algorithmic approaches (since their result can be visualized), and in addition we allow the user to steer/monitor/refine the clustering process with domain knowledge. The visual-rendering result also provides a precise pattern for fast post-processing.


Toward A Comprehensive Supplement For Language Courses, Krishnaprasad Thirunarayan, Stephen P. Carl Jul 2003

Toward A Comprehensive Supplement For Language Courses, Krishnaprasad Thirunarayan, Stephen P. Carl

Kno.e.sis Publications

No abstract provided.


Event Based Retrieval From Digital Libraries Containing Data Streams, Mohamed Hamed Kholief Jul 2003

Event Based Retrieval From Digital Libraries Containing Data Streams, Mohamed Hamed Kholief

Computer Science Theses & Dissertations

The objective of this research is to study the issues involved in building a digital library that contains data streams and allows event-based retrieval. “Digital Libraries are storehouses of information available through the Internet that provide ways to collect, store, and organize data and make it accessible for search, retrieval, and processing” [29]. Data streams are sources of information for applications such as news-on-demand, weather services, and scientific research, to name a few. A data stream is a sequence of data units produced over a period of time. Examples of data streams are video streams, audio stream, and sensor readings. …


Adding Semantics To Web Services Standards, Kaarthik Sivashanmugam, Kunal Verma, Amit P. Sheth, John Miller Jun 2003

Adding Semantics To Web Services Standards, Kaarthik Sivashanmugam, Kunal Verma, Amit P. Sheth, John Miller

Kno.e.sis Publications

With the increasing growth in popularity of Web services, discovery of relevant Web services becomes a significant challenge. One approach is to develop semantic Web services where by the Web services are annotated based on shared ontologies, and use these annotations for semantics-based discovery of relevant Web services. We discuss one such approach that involves adding semantics to WSDL using DAML+OIL ontologies. Our approach also uses UDDI to store these semantic annotations and search for Web services based on them. We compare our approach with another initiative to add semantics to support Web service discovery, and show how our approach …


Ladar-Based Detection And Tracking Of Moving Objects From A Ground Vehicle At High Speeds, Chieh-Chih Wang, Charles Thorpe, Arne Suppe Jun 2003

Ladar-Based Detection And Tracking Of Moving Objects From A Ground Vehicle At High Speeds, Chieh-Chih Wang, Charles Thorpe, Arne Suppe

Research Collection School Of Computing and Information Systems

Detection and tracking of moving objects (DATMO) in crowded urban areas from a ground vehicle at high speeds is difficult because of a wide variety of targets and uncertain pose estimation from odometry and GPS/DGPS. In this paper we present a solution of the simultaneous localization and mapping (SLAM) with DATMO problem to accomplish this task using ladar sensors and odometry. With a precise pose estimate and a surrounding map from SLAM, moving objects are detected without a priori knowledge of the targets. The interacting multiple model (IMM) estimation algorithm is used for modeling the motion of a moving object …


Using Support Vector Machines For Terrorism Information Extraction, Aixin Sun, Myo-Myo Naing, Ee Peng Lim, Wai Lam Jun 2003

Using Support Vector Machines For Terrorism Information Extraction, Aixin Sun, Myo-Myo Naing, Ee Peng Lim, Wai Lam

Research Collection School Of Computing and Information Systems

Information extraction (IE) is of great importance in many applications including web intelligence, search engines, text understanding, etc. To extract information from text documents, most IE systems rely on a set of extraction patterns. Each extraction pattern is defined based on the syntactic and/or semantic constraints on the positions of desired entities within natural language sentences. The IE systems also provide a set of pattern templates that determines the kind of syntactic and semantic constraints to be considered. In this paper, we argue that such pattern templates restricts the kind of extraction patterns that can be learned by IE systems. …


Adaptive Filters For Continuous Queries Over Distributed Data Stream, Chris Olston, Jing Jiang, Jennifer Widom Jun 2003

Adaptive Filters For Continuous Queries Over Distributed Data Stream, Chris Olston, Jing Jiang, Jennifer Widom

Research Collection School Of Computing and Information Systems

We consider an environment where distributed data sources continuously stream updates to a centralized processor that monitors continuous queries over the distributed data. Significant communication overhead is incurred in the presence of rapid update streams, and we propose a new technique for reducing the overhead. Users register continuous queries with precision requirements at the central stream processor, which installs filters at remote data sources. The filters adapt to changing conditions to minimize stream rates while guaranteeing that all continuous queries still receive the updates necessary to provide answers of adequate precision at all times. Our approach enables applications to trade …


Semantic Web Process Lifecycle: Role Of Semantics In Annotation, Discovery, Composition And Orchestration, Amit P. Sheth May 2003

Semantic Web Process Lifecycle: Role Of Semantics In Annotation, Discovery, Composition And Orchestration, Amit P. Sheth

Kno.e.sis Publications

No abstract provided.


Healthcare Enterprise Process Development And Integration, Kemafor Anyanwu, Amit P. Sheth, Jorge Cardoso, John A. Miller, Krzysztof J. Kochut May 2003

Healthcare Enterprise Process Development And Integration, Kemafor Anyanwu, Amit P. Sheth, Jorge Cardoso, John A. Miller, Krzysztof J. Kochut

Kno.e.sis Publications

Healthcare enterprises involve complex processes that span diverse groups and organisations. These processes involve clinical and administrative tasks, large volumes of data, and large numbers of patients and personnel. The tasks can be performed either by humans or by automated systems. In the latter case, the tasks are supported by a variety of software applications and information systems which are very often heterogeneous, autonomous, and distributed. The development of systems to manage and automate these processes has increasingly played an important role in improving the efficiency of healthcare enterprises. In this paper we look at four healthcare and medical applications …


Exception Handling For Conflict Resolution In Cross-Organizational Workflows, Zongwei Luo, Amit P. Sheth, Krzysztof Kochut, I. Budak Arpinar May 2003

Exception Handling For Conflict Resolution In Cross-Organizational Workflows, Zongwei Luo, Amit P. Sheth, Krzysztof Kochut, I. Budak Arpinar

Kno.e.sis Publications

Workflow management systems (WfMSs) are being increasingly deployed to deliver e-business transactions across organizational boundaries. To ensure a high service quality in such transactions, exception-handling schemes for conflict resolution are needed. The conflicts primarily arise due to failure of a task in workflow execution because of underlying application, or controlling WfMS component failures or insufficient user input. So far, little progress has been reported in addressing conflict resolution in cross-organizational business processes, though its importance has been recognized. In this paper, we identify the exception handling techniques that support conflict resolution in cross-organizational settings. In particular, we propose a novel, …


Ρ-Queries: Enabling Querying For Semantic Associations On The Semantic Web, Kemafor Anyanwu, Amit P. Sheth May 2003

Ρ-Queries: Enabling Querying For Semantic Associations On The Semantic Web, Kemafor Anyanwu, Amit P. Sheth

Kno.e.sis Publications

This paper presents the notion of Semantic Associations as complex relationships between resource entities. These relationships capture both a connectivity of entities as well as similarity of entities based on a specific notion of similarity called ρ-isomorphism. It formalizes these notions for the RDF data model, by introducing a notion of a Property Sequence as a type. In the context of a graph model such as that for RDF, Semantic Associations amount to specific certain graph signatures. Specifically, they refer to sequences (i.e. directed paths) here called Property Sequences, between entities, networks of Property Sequences (i.e. undirected paths), or subgraphs …


Genescene: Biomedical Text And Data Mining, Gondy Leroy, Hsinchun Chen, Jesse D. Martinez, Shauna Eggers, Ryan R. Falsey, Kerri L. Kislin, Zan Huang, Jiexun Li, Jie Xu, Daniel M. Mcdonald, Gavin Ng May 2003

Genescene: Biomedical Text And Data Mining, Gondy Leroy, Hsinchun Chen, Jesse D. Martinez, Shauna Eggers, Ryan R. Falsey, Kerri L. Kislin, Zan Huang, Jiexun Li, Jie Xu, Daniel M. Mcdonald, Gavin Ng

CGU Faculty Publications and Research

To access the content of digital texts efficiently, it is necessary to provide more sophisticated access than keyword based searching. GeneScene provides biomedical researchers with research findings and background relations automatically extracted from text and experimental data. These provide a more detailed overview of the information available. The extracted relations were evaluated by qualified researchers and are precise. A qualitative ongoing evaluation of the current online interface indicates that this method to search the literature is more useful and efficient than keyword based searching.


Planning Your Way To A More Usable Web Site, Pamela Gore, Sandra Hirsh May 2003

Planning Your Way To A More Usable Web Site, Pamela Gore, Sandra Hirsh

Faculty Publications

Planning for long-term periodic usability assessment is therefore as important as adding regularly fresh content and tracking usage. Fortunately, usability assessments need not be time consuming or expensive, unless your site is large and complex and you want to test it thoroughly each time. In a practical sense, usability assessment can reveal problems in the design, navigation, layout, or labeling that prevent users from finding what they need quickly. After analyzing your environment and setting the stage for ongoing usability assessment, it is time to develop the usability assessment plan, which will serve as the blueprint for usability assessment activities …


The Integration Of Cadastral Base Mapping With Cadastral Parcel Attribution, Kurt B. Wurm May 2003

The Integration Of Cadastral Base Mapping With Cadastral Parcel Attribution, Kurt B. Wurm

Electronic Theses and Dissertations

A cadastre is a parcel-based, up-to-date land information system containing a record of interests in land. Creation and maintenance of a cadastre usually involves coordination between different public and private organizations that are responsible for the various data. The U.S. Bureau of Land Management (BLM) has built a Geographic Coordinate Data Base (GCDB) that currently provides cadastral base map data for more than 38,000 townships across the country, with many of the western states nearly complete. The GCDB strategy is that the coordinates can and do change as more recent and accurate information becomes available. The locational reliability of the …


Guest Editorial: Text And Web Mining, Ah-Hwee Tan, Philip S. Yu May 2003

Guest Editorial: Text And Web Mining, Ah-Hwee Tan, Philip S. Yu

Research Collection School Of Computing and Information Systems

Text mining and web mining are two interrelated fields that have received a lot of attention in recent years. Text mining [1, 2] is concerned with the analysis of very large document collections and the extraction of hidden knowledge from text-based data. Web mining [3] refers to the analysis and mining of all web-related data, including web content, hyperlink structure, and web access statistics.


On Querying Geospatial And Georeferenced Metadata Resources In Gportal, Zehua Liu, Ee Peng Lim, Wee-Keong Ng, Dion Hoe-Lian Goh May 2003

On Querying Geospatial And Georeferenced Metadata Resources In Gportal, Zehua Liu, Ee Peng Lim, Wee-Keong Ng, Dion Hoe-Lian Goh

Research Collection School Of Computing and Information Systems

G-Portal is a web portal system providing a range of digital library services to access geospatial and georeferenced resources on the Web. Among them are the storage and query subsystems that provide a central repository of metadata resources organized under different projects. In GPortal, all metadata resources are represented in XML (Extensible Markup Language) and they are compliant to some resource schemas de.ned by their creators. The resource schemas are extended versions of a basic resource schema making it easy to accommodate all kinds of metadata resources while maintaining the portability of resource data. To support queries over the geospatial …


On Machine Learning Methods For Chinese Document Classification, Ji He, Ah-Hwee Tan, Chew-Lim Tan May 2003

On Machine Learning Methods For Chinese Document Classification, Ji He, Ah-Hwee Tan, Chew-Lim Tan

Research Collection School Of Computing and Information Systems

This paper reports our comparative evaluation of three machine learning methods, namely k Nearest Neighbor (kNN), Support Vector Machines (SVM), and Adaptive Resonance Associative Map (ARAM) for Chinese document categorization. Based on two Chinese corpora, a series of controlled experiments evaluated their learning capabilities and efficiency in mining text classification knowledge. Benchmark experiments showed that their predictive performance were roughly comparable, especially on clean and well organized data sets. While kNN and ARAM yield better performances than SVM on small and clean data sets, SVM and ARAM significantly outperformed kNN on noisy data. Comparing efficiency, kNN was notably more costly …


Ua8 Ssn Protection Committee Recommendations, Wku Information Technology Apr 2003

Ua8 Ssn Protection Committee Recommendations, Wku Information Technology

WKU Archives Records

Recommendations of the Social Security Number Protection Committee.


Search And Recovery Of The Space Shuttle Columbia: A Geospatial 1st Responder Perspective, Jeffrey M. Williams Apr 2003

Search And Recovery Of The Space Shuttle Columbia: A Geospatial 1st Responder Perspective, Jeffrey M. Williams

Faculty Publications

A first person account of the Texas geospatial volunteers and their efforts to recover the remains of the Space Shuttle Columbia and her crew lost over eastern Texas and western Louisiana on February 1st, 2003.


Identifying Patterns In Dna Change, Jason R. Gilder, Dan E. Krane, Travis E. Doom, Michael L. Raymer Apr 2003

Identifying Patterns In Dna Change, Jason R. Gilder, Dan E. Krane, Travis E. Doom, Michael L. Raymer

Kno.e.sis Publications

Now that a draft sequence of the human genome is nearly complete, questions regarding both the information contained within our genetic blueprints as well as the manner in which that information content changes over time can be addressed in ways that had not previously been possible. By their very nature, some of the nucleotide sequences present within our genome allow detailed examination of the mode and pattern of evolution that has shaped our genetic instructions over time spans of tens of millions of years. Alu repeats are one example. Using these relatively short, ubiquitous DNA sequences we explore the problem …


Efficient Native Xml Storage System (Enaxs), Khin-Myo Win, Wee-Keong Ng, Ee Peng Lim Apr 2003

Efficient Native Xml Storage System (Enaxs), Khin-Myo Win, Wee-Keong Ng, Ee Peng Lim

Research Collection School Of Computing and Information Systems

XML is a self-describing meta-language and fast emerging as a dominant standard for Web data exchange among various applications. With the tremendous growth of XML documents, an efficient storage system is required to manage them. The conventional databases, which require all data to adhere to an explicitly specified rigid schema, are unable to provide an efficient storage for tree-structured XML documents. A new data model that is specifically designed for XML documents is required. In this paper, we propose a new storage system, named Efficient Native XML Storage System (ENAXS), for large and complex XML documents. ENAXS stores all XML …


Parallel Implementation Of A Face Recognition [Sic] System Based On Modular Pca Approach, Rajkiran Gottumukkal Apr 2003

Parallel Implementation Of A Face Recognition [Sic] System Based On Modular Pca Approach, Rajkiran Gottumukkal

Electrical & Computer Engineering Theses & Dissertations

This thesis describes research in automated methods for the recognition of human faces. The research is driven by the need to design a method, which would ensure high accuracy under the conditions of facial expression, illumination and pose variations. The resulting method is able to cope with uncontrolled nature of facial expression, illumination and head rotations. The main novelty of this work is the idea that some of the local facial features do not vary even when the facial expression, illumination and pose vary. This idea is applied to the existing principle component analysis lPCA) method to arrive at a …


Ontology Driven Information Systems In Action (Capturing And Applying Existing Knowledge To Semantic Applications), Amit P. Sheth Mar 2003

Ontology Driven Information Systems In Action (Capturing And Applying Existing Knowledge To Semantic Applications), Amit P. Sheth

Kno.e.sis Publications

No abstract provided.


Cio Lateral Influence Behaviors: Gaining Peers' Commitment To Strategic Information Systems, Harvey Enns, Sid L. Huff, Christopher A. Higgins Mar 2003

Cio Lateral Influence Behaviors: Gaining Peers' Commitment To Strategic Information Systems, Harvey Enns, Sid L. Huff, Christopher A. Higgins

MIS/OM/DS Faculty Publications

In order to develop and bring to fruition strategic information systems (SIS) projects, chief information officers (CIOs) must be able to effectively influence their peers. This research examines the relationship between CIO influence behaviors and the successfulness of influence outcomes, utilizing a revised model initially developed by Yukl (1994). Focused interviews were first conducted with CIOs and their peers to gain insights into the phenomenon. A survey instrument was then developed and distributed to a sample of CIO and peer executive pairs to gather data with which to test a research model. A total of 69 pairs of surveys were …