Paper
12 March 2002 Context-sensitive keyword selection using text data mining
Sai-Ming Li, Sanjeev Seereeram, Raman K. Mehra, Chris Miles
Author Affiliations +
Abstract
Most information retrieval systems rely on the user to provide a set of keywords that the retrieved documents should contain. However, when the objective is to search for documents that is similar to a given document, the system has to choose the keywords from that document first. Automatic selection of keywords is not a trivial task as one word may be a keyword in one context but a very common word in others, and require significant domain specific knowledge. In this paper we describe a method for choosing keywords from a document within a given corpus automatically using text data-mining technique. The key idea is to score the words within the document based on the clustering result of the entire corpus. We applied the scheme to a Software Trouble Report (STR) corpus and obtained highly relevant keywords and search result.
© (2002) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Sai-Ming Li, Sanjeev Seereeram, Raman K. Mehra, and Chris Miles "Context-sensitive keyword selection using text data mining", Proc. SPIE 4730, Data Mining and Knowledge Discovery: Theory, Tools, and Technology IV, (12 March 2002); https://doi.org/10.1117/12.460238
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Switches

Vector spaces

Microchannel plates

Data mining

Spherical lenses

Software engineering

Spectral resolution

RELATED CONTENT

Presence management in next generation networks
Proceedings of SPIE (July 25 2001)
Coarse-to-fine texture images retrieval method
Proceedings of SPIE (January 01 2001)
Angle Tree a new index structure for high dimensional...
Proceedings of SPIE (December 19 2001)
Quasi-moments under the projected rotation group
Proceedings of SPIE (April 21 1995)
Terrain change detection and updating with image pyramid
Proceedings of SPIE (November 14 2007)
A GA based clustering algorithm for large data sets with...
Proceedings of SPIE (September 25 2003)

Back to Top