Sciweavers

73 search results - page 11 / 15
» Compression-based document length prior for language models
Sort
View
DOCENG
2003
ACM
14 years 26 days ago
Methods for the semantic analysis of document markup
We present an approach on how to investigate what kind of semantic information is regularly associated with the structural markup of scientific articles. This approach addresses ...
Petra Saskia Bayerl, Harald Lüngen, Daniela G...
CIKM
2011
Springer
12 years 7 months ago
Lower-bounding term frequency normalization
In this paper, we reveal a common deficiency of the current retrieval models: the component of term frequency (TF) normalization by document length is not lower-bounded properly;...
Yuanhua Lv, ChengXiang Zhai
BIBE
2007
IEEE
169views Bioinformatics» more  BIBE 2007»
14 years 1 months ago
Combining Semantics, Context, and Statistical Evidence in Genomics Literature Search
—We present an information retrieval model for combining evidence from concept-based semantics, term statistics, and context for improving search precision of genomics literature...
Jay Urbain, Nazli Goharian, Ophir Frieder
SIGDOC
2006
ACM
14 years 1 months ago
Handling objects: a scenario based approach
We report on the development of a tool which supports software programmers in programming to an interface by providing just in time (JIT) solutions. The development of the tool wa...
Thomas George Kannampallil, John M. Daughtry III
ICDE
2007
IEEE
211views Database» more  ICDE 2007»
14 years 1 months ago
Document Representation and Dimension Reduction for Text Clustering
Increasingly large text datasets and the high dimensionality associated with natural language create a great challenge in text mining. In this research, a systematic study is cond...
M. Mahdi Shafiei, Singer Wang, Roger Zhang, Evange...