Key Phrase Indexing With Controlled Vocabularies

Google TechTalks
June 21, 2006

Olena Medelyan is a grad student who has just started on a Google-funded PhD scholarship, looking at keyphrase extraction using lexical and linguistic techniques.

ABSTRACT
Keyphrases are widely used in information retrieval as a brief but precise summary of documents. They are usually selected by professional human indexers. The more consistent the indexers are with each other, the higher the retrieval efficiency. 1. We describe an experiment where six professionals assigned keyphrases from a controlled vocabulary to the same documents, and evaluate their indexing consistency. Interesting patterns discovered in this experiment helped in developing an automatic…