Sciweavers

65 search results - page 9 / 13
» Text - Image Separation in Devanagari Documents
Sort
View
IPM
2007
143views more  IPM 2007»
13 years 6 months ago
QCS: A system for querying, clustering and summarizing documents
Information retrieval systems consist of many complicated components. Research and development of such systems is often hampered by the difficulty in evaluating how each particula...
Daniel M. Dunlavy, Dianne P. O'Leary, John M. Conr...
WWW
2001
ACM
14 years 7 months ago
Algorithms and programming models for efficient representation of XML for Internet applications
XML is poised to take the World-Wide-Web to the next level of innovation. XML data, large or small, with or without associated schema, will be exchanged between increasing number ...
Neel Sundaresan, Reshad Moussa
NIPS
2007
13 years 8 months ago
Supervised Topic Models
We introduce supervised latent Dirichlet allocation (sLDA), a statistical model of labelled documents. The model accommodates a variety of response types. We derive a maximum-like...
David M. Blei, Jon D. McAuliffe
ACL
2003
13 years 8 months ago
Automatic Acquisition of Named Entity Tagged Corpus from World Wide Web
In this paper, we present a method that automatically constructs a Named Entity (NE) tagged corpus from the web to be used for learning of Named Entity Recognition systems. We use...
Joohui An, Seungwoo Lee, Gary Geunbae Lee
CIKM
2011
Springer
12 years 6 months ago
Joint inference for cross-document information extraction
Previous information extraction (IE) systems are typically organized as a pipeline architecture of separated stages which make independent local decisions. When the data grows bey...
Qi Li, Sam Anzaroot, Wen-Pin Lin, Xiang Li, Heng J...