Search Sciweavers | Sciweavers

1541 search results - page 77 / 309

» Extracting Web Data Using Instance-Based Learning

197

click to vote

WWW
2010
ACM

220views Internet Technology» more WWW 2010»

Not so creepy crawler: easy crawler generation with standard xml queries

16 years 2 months ago

Download www2.pms.ifi.lmu.de

Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...

Franziska von dem Bussche, Klara A. Weiand, Benedi...

claim paper

Read More »

200

click to vote

KDD
2008
ACM

153views Data Mining» more KDD 2008»

Information extraction from Wikipedia: moving down the long tail

16 years 7 months ago

Download www.cs.washington.edu

Not only is Wikipedia a comprehensive source of quality information, it has several kinds of internal structure (e.g., relational summaries known as infoboxes), which enable self-...

Fei Wu, Raphael Hoffmann, Daniel S. Weld

claim paper

Read More »

209

click to vote

ESWA
2008

140views more ESWA 2008»

Web taxonomy integration with hierarchical shrinkage algorithm and fine-grained relations

15 years 7 months ago

Download www.iis.sinica.edu.tw

We address the problem of integrating web taxonomies from different real Internet applications. Integrating web taxonomies is to transfer instances from a source to target taxonom...

Chia-Wei Wu, Richard Tzong-Han Tsai, Cheng-Wei Lee...

claim paper

Read More »

222

click to vote

ICDM
2008
IEEE

142views Data Mining» more ICDM 2008»

Unsupervised Face Annotation by Mining the Web

16 years 1 months ago

Download satoh-lab.ex.nii.ac.jp

Searching for images of people is an essential task for image and video search engines. However, current search engines have limited capabilities for this task since they rely on ...

Duy-Dinh Le, Shin'ichi Satoh

claim paper

Read More »

199

click to vote

KDD
2009
ACM

190views Data Mining» more KDD 2009»

Named entity mining from click-through data using weakly supervised latent dirichlet allocation

16 years 7 months ago

Download research.microsoft.com

This paper addresses Named Entity Mining (NEM), in which we mine knowledge about named entities such as movies, games, and books from a huge amount of data. NEM is potentially use...

Gu Xu, Shuang-Hong Yang, Hang Li

claim paper

Read More »

« Prev « First page 77 / 309 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers