Sciweavers

2497 search results - page 400 / 500
» A Partial-Repeatability Approach to Data Mining
Sort
View
KDD
2007
ACM
186views Data Mining» more  KDD 2007»
16 years 4 months ago
Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus
We present a document routing and index partitioning scheme for scalable similarity-based search of documents in a large corpus. We consider the case when similarity-based search ...
Deepavali Bhagwat, Kave Eshghi, Pankaj Mehra
146
Voted
KDD
2004
ACM
181views Data Mining» more  KDD 2004»
16 years 4 months ago
Column-generation boosting methods for mixture of kernels
We devise a boosting approach to classification and regression based on column generation using a mixture of kernels. Traditional kernel methods construct models based on a single...
Jinbo Bi, Tong Zhang, Kristin P. Bennett
WSDM
2010
ACM
194views Data Mining» more  WSDM 2010»
16 years 28 days ago
Ranking with Query-Dependent Loss for Web Search
Queries describe the users' search intent and therefore they play an essential role in the context of ranking for information retrieval and Web search. However, most of exist...
Jiang Bian, Tie-Yan Liu, Tao Qin, Hongyuan Zha
139
Voted
SDM
2009
SIAM
129views Data Mining» more  SDM 2009»
16 years 22 days ago
Multi-topic Based Query-Oriented Summarization.
Query-oriented summarization aims at extracting an informative summary from a document collection for a given query. It is very useful to help users grasp the main information rel...
Dewei Chen, Jie Tang, Limin Yao
154
Voted
ICDM
2009
IEEE
233views Data Mining» more  ICDM 2009»
15 years 10 months ago
Semi-Supervised Sequence Labeling with Self-Learned Features
—Typical information extraction (IE) systems can be seen as tasks assigning labels to words in a natural language sequence. The performance is restricted by the availability of l...
Yanjun Qi, Pavel Kuksa, Ronan Collobert, Kunihiko ...