Search Sciweavers | Sciweavers

34 search results - page 4 / 7

» Mining the Web to Create Minority Language Corpora

208

click to vote

ESWS
2010
Springer

181views Internet Technology» more ESWS 2010»

The Semantic Gap of Formalized Meaning

15 years 4 months ago

Download svn.aksw.org

Recent work in Ontology learning and Text mining has mainly focused on engineering methods to solve practical problem. In this thesis, we investigate methods that can substantially...

Sebastian Hellmann

claim paper

Read More »

174

click to vote

ITCC
2005
IEEE

105views Information Technology» more ITCC 2005»

Elimination of Redundant Information for Web Data Mining

15 years 11 months ago

Download eprints.utas.edu.au

These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...

Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang

claim paper

Read More »

193

click to vote

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Towards domain-independent information extraction from web tables

16 years 6 months ago

Download www2007.org

Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...

Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...

claim paper

Read More »

156

click to vote

CICLING
2009
Springer

335views Natural Language Processing» more CICLING 2009»

Language Identification on the Web: Extending the Dictionary Method

15 years 10 months ago

Download www.fi.muni.cz

Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...

Radim Rehurek, Milan Kolkus

claim paper

Read More »

169

click to vote

LREC
2010

150views Education» more LREC 2010»

A Corpus for Evaluating Semantic Multilingual Web Retrieval Systems: The Sense Folder Corpus

15 years 7 months ago

Download www.lrec-conf.org

In this paper, we present the multilingual Sense Folder Corpus. After the analysis of different corpora, we describe the requirements that have to be satisfied for evaluating sema...

Ernesto William De Luca

claim paper

Read More »

« Prev « First page 4 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers