Search Sciweavers | Sciweavers

385 search results - page 5 / 77

» Automatic Term Categorization by Extracting Knowledge from t...

257

click to vote

WSDM
2012
ACM

252views Data Mining» more WSDM 2012»

WebSets: extracting sets of entities from the web using unsupervised information extraction

14 years 2 months ago

Download www.cs.cmu.edu

We describe a open-domain information extraction method for extracting concept-instance pairs from an HTML corpus. Most earlier approaches to this problem rely on combining cluste...

Bhavana Bharat Dalvi, William W. Cohen, Jamie Call...

claim paper

Read More »

194

click to vote

NLPRS
2001
Springer

205views Natural Language Processing» more NLPRS 2001»

Automatically Harvesting Katakana-English Term Pairs from Search Engine Query Logs

15 years 11 months ago

Download research.microsoft.com

This paper describes a method of extracting katakana words and phrases, along with their English counterparts from non-aligned monolingual web search engine query logs. The method...

Eric Brill, Gary Kacmarcik, Chris Brockett

claim paper

Read More »

225

click to vote

SIGIR
2003
ACM

147views Information Technology» more SIGIR 2003»

Text categorization by boosting automatically extracted concepts

16 years 18 days ago

Download www.cs.brown.edu

Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...

Lijuan Cai, Thomas Hofmann

claim paper

Read More »

211

click to vote

WWW
2005
ACM

153views Internet Technology» more WWW 2005»

METEOR: metadata and instance extraction from object referral lists on the web

16 years 8 months ago

Download www2005.org

The Web has established itself as the largest public data repository ever available. Even though the vast majority of information on the Web is formatted to be easily readable by ...

Hasan Davulcu, Srinivas Vadrevu, Saravanakumar Nag...

claim paper

Read More »

180

click to vote

SIGIR
2004
ACM

130views Information Technology» more SIGIR 2004»

Parameterized generation of labeled datasets for text categorization based on a hierarchical directory

16 years 23 days ago

Download www.cs.technion.ac.il

Although text categorization is a burgeoning area of IR research, readily available test collections in this ﬁeld are surprisingly scarce. We describe a methodology and system (...

Dmitry Davidov, Evgeniy Gabrilovich, Shaul Markovi...

claim paper

Read More »

« Prev « First page 5 / 77 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers