Querying semantically heterogeneous data sources using ontologies

dc.contributor.authorBreed, Aditi
dc.date.accessioned2008-12-19T14:34:13Z
dc.date.available2008-12-19T14:34:13Z
dc.date.graduationmonthDecemberen
dc.date.issued2008-12-19T14:34:13Z
dc.date.published2008en
dc.description.abstractIn recent years, we have witnessed a significant increase in the number, size and diversity of the available data sources in many application domains. Data sources in a particular domain are autonomously created and maintained, and therefore distributed and semantically heterogeneous. In this thesis, we focused on the problem of querying such semantically heterogeneous data sources from a user's perspective. We approach this problem by using the concepts of ontologies and mappings between ontologies. A system for answering queries in a transparent way to the user has been designed and implemented. The main components of this system are an ontology mapping algorithm that maps user ontologies to data source ontologies, and a query processing engine that maps user queries to queries that can be answered by the data sources in the system. We have shown that machine learning algorithms can also be incorporated in the system, thus making it possible to learn machine learning classifiers (in particular, generative models such as Naïve Bayes) from distributed, semantically heterogeneous data sources. Because many data sources today are relational in nature, in this work we have dealt specifically with relational data sources, as opposed to flat files, XML or object oriented data sources. However, our system can be easily extended to other types of data sources.en
dc.description.advisorDoina Carageaen
dc.description.degreeMaster of Scienceen
dc.description.departmentDepartment of Computing and Information Sciencesen
dc.description.levelMastersen
dc.identifier.urihttp://hdl.handle.net/2097/1089
dc.language.isoen_USen
dc.publisherKansas State Universityen
dc.subjectOntologiesen
dc.subjectQueryingen
dc.subjectSemantically Heterogeneousen
dc.subjectRelational Data Sourcesen
dc.subjectProtegeen
dc.subjectOracleen
dc.subject.umiComputer Science (0984)en
dc.titleQuerying semantically heterogeneous data sources using ontologiesen
dc.typeThesisen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
AditiBreed2008.pdf
Size:
640.97 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: