Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

136

Voted

RIAO
2007

123views Information Technology» more RIAO 2007»

Document frequency and term specificity

15 years 8 months ago

Document frequency and term specificity

Download dis.shef.ac.uk

Document frequency is used in various applications in Information Retrieval and other related fields. An assumption frequently made is that the document frequency represents a level of the term’s specificity. However, empirical results to support this assumption are limited. Therefore, a large-scale experiment was carried out, using multiple corpora, to gain further insight into the relationship between the document frequency and terms specificity. The results show that the assumption holds only at the very specific levels that cover the majority of vocabulary. The results also show that a larger corpus is more accurate at estimating the specificity. However, the co-occurrence information is shown to be effective for improving the accuracy when only a small corpus is available.

Hideo Joho, Mark Sanderson

Real-time Traffic

Document Frequency | Information Technology | RIAO 2007 | Terms Specificity | Term’s Specificity |

claim paper

Related Content

» An Intelligent TopicSpecific Crawler Using Degree of Relevance

» Correlation of Term Count and Document Frequency for Google NGrams

» A Comparison of Document Sentence and Term Event Spaces

» Evaluating the effectiveness of term frequency histograms for supporting interactive web s...

» Semantically Enhanced Term Frequency

» Lowerbounding term frequency normalization

» Using term informativeness for named entity detection

» Extending Weighting Models with a Term Quality Measure

» Experiments in Term Weighting and Keyword Extraction in Document Clustering

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	RIAO
Authors	Hideo Joho, Mark Sanderson

Comments (0)