Search Sciweavers | Sciweavers

376 search results - page 5 / 76

» A Hybrid Machine Learning Approach for Information Extractio...

129

Voted

DEBU
2000

95views more DEBU 2000»

Accurately and Reliably Extracting Data from the Web: A Machine Learning Approach

15 years 3 months ago

Download www.isi.edu

A critical problem in developing information agents for the Web is accessing data that is formatted for human use. We have developed a set of tools for extracting data from web si...

Craig A. Knoblock, Kristina Lerman, Steven Minton,...

claim paper

Read More »

137

click to vote

IRAL
2003
ACM

158views Information Technology» more IRAL 2003»

Learning bilingual translations from comparable corpora to cross-language information retrieval: hybrid statistics-based and lin

15 years 9 months ago

Download acl.ldc.upenn.edu

Recent years saw an increased interest in the use and the construction of large corpora. With this increased interest and awareness has come an expansion in the application to kno...

Fatiha Sadat, Masatoshi Yoshikawa, Shunsuke Uemura

claim paper

Read More »

153

click to vote

ICMLA
2007

192views Machine Learning» more ICMLA 2007»

Semi-Supervised Active Learning for Modeling Medical Concepts from Free Text

15 years 5 months ago

Download people.csail.mit.edu

We apply a new active learning formulation to the problem of learning medical concepts from unstructured text. The new formulation is based on maximizing the mutual information th...

Rómer Rosales, Praveen Krishnamurthy, R. Bh...

claim paper

Read More »

148

Voted

KDD
2009
ACM

266views Data Mining» more KDD 2009»

OpinionMiner: a novel machine learning system for web opinion mining and extraction

15 years 10 months ago

Download www-ai.cs.uni-dortmund.de

Merchants selling products on the Web often ask their customers to share their opinions and hands-on experiences on products they have purchased. Unfortunately, reading through al...

Wei Jin, Hung Hay Ho, Rohini K. Srihari

claim paper

Read More »

159

Voted

SIGIR
2003
ACM

147views Information Technology» more SIGIR 2003»

Text categorization by boosting automatically extracted concepts

15 years 9 months ago

Download www.cs.brown.edu

Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...

Lijuan Cai, Thomas Hofmann

claim paper

Read More »

« Prev « First page 5 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers