Sciweavers

417 search results - page 43 / 84
» Document Classification Using a Finite Mixture Model
Sort
View
ECIR
2007
Springer
13 years 9 months ago
Probabilistic Models for Expert Finding
A common task in many applications is to find persons who are knowledgeable about a given topic (i.e., expert finding). In this paper, we propose and develop a general probabilis...
Hui Fang, ChengXiang Zhai
ICAIL
2007
ACM
13 years 11 months ago
The Legal-RDF Ontology. A Generic Model for Legal Documents
Legal-RDF.org1 publishes a practical ontology that models both the layout and content of a document and metadata about the document; these have been built using data models implici...
John McClure
DEXAW
1999
IEEE
187views Database» more  DEXAW 1999»
14 years 4 days ago
Optical Font Recognition for Multi-Font OCR and Document Processing
In this paper we present a Multi-font OCR system to be employed for document processing, which performs, at the same time, both the character recognition and the font-style detect...
Serena La Manna, Anna Maria Colla, Alessandro Sper...
KDD
2009
ACM
191views Data Mining» more  KDD 2009»
14 years 8 months ago
Efficient methods for topic model inference on streaming document collections
Topic models provide a powerful tool for analyzing large text collections by representing high dimensional data in a low dimensional subspace. Fitting a topic model given a set of...
Limin Yao, David M. Mimno, Andrew McCallum
CORR
2000
Springer
86views Education» more  CORR 2000»
13 years 7 months ago
Variable Word Rate N-grams
The rate of occurrence of words is not uniform but varies from document to document. Despite this observation, parameters for conventional n-gram language models are usually deriv...
Yoshihiko Gotoh, Steve Renals