Sciweavers

368 search results - page 36 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
KDD
1998
ACM
101views Data Mining» more  KDD 1998»
14 years 22 days ago
Probabilistic Modeling for Information Retrieval with Unsupervised Training Data
We apply a well-known Bayesian probabilistic model to textual information retrieval: the classification of documents based on their relevance to a query. This model was previously...
Ernest P. Chan, Santiago Garcia, Salim Roukos
PKDD
1998
Springer
273views Data Mining» more  PKDD 1998»
14 years 22 days ago
TextVis: An Integrated Visual Environment for Text Mining
TextVis is a visual data mining system for document collections. Such a collection represents an application domain, and the primary goal of the system is to derive patterns that p...
David Landau, Ronen Feldman, Yonatan Aumann, Moshe...
COSIT
2005
Springer
125views GIS» more  COSIT 2005»
14 years 2 months ago
Landmark Extraction: A Web Mining Approach
Landmarks play crucial roles in human geographic knowledge. There has been much work focusing on the extraction of landmarks from geographic information systems (GIS) or 3D city mo...
Taro Tezuka, Katsumi Tanaka
TREC
2004
13 years 10 months ago
Indri at TREC 2004: Terabyte Track
This paper provides an overview of experiments carried out at the TREC 2004 Terabyte Track using the Indri search engine. Indri is an efficient, effective distributed search engin...
Donald Metzler, Trevor Strohman, Howard R. Turtle,...
ECIR
2010
Springer
13 years 10 months ago
Extracting Multilingual Topics from Unaligned Comparable Corpora
Topic models have been studied extensively in the context of monolingual corpora. Though there are some attempts to mine topical structure from cross-lingual corpora, they require ...
Jagadeesh Jagarlamudi, Hal Daumé III