Automatic detection of significant features and event timeline construction from temporally tagged data
dc.contributor.author | Erande, Abhijit | |
dc.date.accessioned | 2009-08-14T19:34:41Z | |
dc.date.available | 2009-08-14T19:34:41Z | |
dc.date.graduationmonth | August | |
dc.date.issued | 2009-08-14T19:34:41Z | |
dc.date.published | 2009 | |
dc.description.abstract | The goal of my project is to summarize large volumes of data and help users to visualize how events have unfolded over time. I address the problem of extracting overview terms from a time-tagged corpus of data and discuss some previous work conducted in this area. I use a statistical approach to automatically extract key terms, form groupings of related terms, and display the resultant groups on a timeline. I use a static corpus composed of news stories, as opposed to an on-line setting where continual additions to the corpus are being made. Terms are extracted using a Named Entity Recognizer, and importance of a term is determined using the [superscript]X[superscript]2 measure. My approach does not address the problem of associating time and date stamps with data, and is restricted to corpora that been explicitly tagged. The quality of results obtained is gauged subjectively and objectively by measuring the degree to which events known to exist in the corpus were identified by the system. | |
dc.description.advisor | William H. Hsu | |
dc.description.degree | Master of Science | |
dc.description.department | Department of Computing and Information Sciences | |
dc.description.level | Masters | |
dc.identifier.uri | http://hdl.handle.net/2097/1675 | |
dc.language.iso | en_US | |
dc.publisher | Kansas State University | |
dc.rights | © the author. This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | Timeline generation | |
dc.subject | Feature extraction | |
dc.subject.umi | Computer Science (0984) | |
dc.title | Automatic detection of significant features and event timeline construction from temporally tagged data | |
dc.type | Report |