Search Sciweavers | Sciweavers

125

HIS
2003

131views Information Technology» more HIS 2003»

Evolving Better Stoplists for Document Clustering and Web Intelligence

15 years 3 months ago

: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...

Mark P. Sinka, David Corne

claim paper

Read More »

104

Voted

CIKM
2009
Springer

127views Information Technology» more CIKM 2009»

Vetting the links of the web

15 years 9 months ago

Download www.cse.lehigh.edu

Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...

Na Dai, Brian D. Davison

claim paper

Read More »

114

click to vote

CICLING
2009
Springer

335views Natural Language Processing» more CICLING 2009»

Language Identification on the Web: Extending the Dictionary Method

15 years 6 months ago

Download www.fi.muni.cz

Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...

Radim Rehurek, Milan Kolkus

claim paper

Read More »

118

Voted

KDD
2002
ACM

293views Data Mining» more KDD 2002»

Automatic Categorization of Web Pages and User Clustering with Mixtures of Hidden Markov Models

16 years 2 months ago

Download www.snn.ru.nl

We propose mixtures of hidden Markov models for modelling clickstreams of web surfers. Hence, the page categorization is learned from the data without the need for a (possibly cumb...

Alexander Ypma, Tom Heskes

claim paper

Read More »

110

Voted

PKDD
2007
Springer

120views Data Mining» more PKDD 2007»

Site-Independent Template-Block Detection

15 years 8 months ago

Download research.microsoft.com

Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...

Aleksander Kolcz, Wen-tau Yih

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers