Search Sciweavers | Sciweavers

98 search results - page 7 / 20

» Web Page Classification with an Ant Colony Algorithm

198

click to vote

WWW
2008
ACM

179views Internet Technology» more WWW 2008»

Can chinese web pages be classified with english data source?

16 years 7 months ago

Download www2008.org

As the World Wide Web in China grows rapidly, mining knowledge in Chinese Web pages becomes more and more important. Mining Web information usually relies on the machine learning ...

Xiao Ling, Gui-Rong Xue, Wenyuan Dai, Yun Jiang, Q...

claim paper

Read More »

180

click to vote

LREC
2008

160views Education» more LREC 2008»

Automatic Extraction of Textual Elements from News Web Pages

15 years 8 months ago

Download www.lrec-conf.org

In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...

Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany

claim paper

Read More »

189

click to vote

CIKM
2006
Springer

158views Information Technology» more CIKM 2006»

A comparative study on classifying the functions of web page blocks

15 years 10 months ago

Download www.cs.ust.hk

In this paper, we study the problem of learning block classification models to estimate block functions. We distinguish general models, which are learned across multiple sites, an...

Xiangye Xiao, Qiong Luo, Xing Xie, Wei-Ying Ma

claim paper

Read More »

187

Voted

WWW
2005
ACM

99views Internet Technology» more WWW 2005»

The volume and evolution of web page templates

16 years 7 months ago

Download research.yahoo.com

Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...

David Gibson, Kunal Punera, Andrew Tomkins

claim paper

Read More »

179

Voted

WWW
2006
ACM

179views Internet Technology» more WWW 2006»

Detecting spam web pages through content analysis

16 years 7 months ago

Download research.microsoft.com

In this paper, we continue our investigations of "web spam": the injection of artificially-created pages into the web in order to influence the results from search engin...

Alexandros Ntoulas, Marc Najork, Mark Manasse, Den...

claim paper

Read More »

« Prev « First page 7 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers