Search Sciweavers | Sciweavers

684 search results - page 3 / 137

» Elimination of Redundant Information for Web Data Mining

click to vote

KDD
2002
ACM

148views Data Mining» more KDD 2002»

Discovering informative content blocks from Web documents

14 years 8 months ago

Download www.cs.ualberta.ca

In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...

Shian-Hua Lin, Jan-Ming Ho

claim paper

Read More »

click to vote

IJCNLP
2005
Springer

168views Natural Language Processing» more IJCNLP 2005»

Web-Based Terminology Translation Mining

14 years 1 months ago

Download www.aclweb.org

Mining terminology translation from a large amount of Web data can be applied in many fields such as reading/writing assistant, machine translation and cross-language information r...

Gaolin Fang, Hao Yu, Fumihito Nishino

claim paper

Read More »

click to vote

DAWAK
2003
Springer

184views Information Technology» more DAWAK 2003»

Recent Developments in Web Usage Mining Research

14 years 22 days ago

Download webspace.elet.polimi.it

Web Usage Mining is that area of Web Mining which deals with the extraction of interesting knowledge from logging information produced by web servers. In this paper, we present a s...

Federico Michele Facca, Pier Luca Lanzi

claim paper

Read More »

click to vote

WWW
2007
ACM

175views Internet Technology» more WWW 2007»

Efficient search in large textual collections with redundancy

14 years 8 months ago

Download www2007.org

Current web search engines focus on searching only the most recent snapshot of the web. In some cases, however, it would be desirable to search over collections that include many ...

Jiangong Zhang, Torsten Suel

claim paper

Read More »

click to vote

ICDM
2008
IEEE

186views Data Mining» more ICDM 2008»

xCrawl: A High-Recall Crawling Method for Web Mining

14 years 2 months ago

Download ls13-www.cs.uni-dortmund.de

Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The ﬁrst step in the Information Extract...

Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...

claim paper

Read More »

« Prev « First page 3 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers