Sciweavers

2553 search results - page 362 / 511
» How-To Web Pages
Sort
View
122
Voted
AUSAI
2003
Springer
15 years 7 months ago
Information Extraction via Path Merging
Abstract. In this paper, we describe a new approach to information extraction that neatly integrates top-down hypothesis driven information with bottom-up data driven information. ...
Robert Dale, Cécile Paris, Marc Tilbrook
97
Voted
IDMS
1998
Springer
76views Multimedia» more  IDMS 1998»
15 years 6 months ago
Exploiting User Behaviour in Prefetching WWW Documents
As the popularity of the World Wide Web increases, the amount of traffic results in major congestion problems for the retrieval of data over wide distances. To react to this, user...
Abdulmotaleb El-Saddik, Carsten Griwodz, Ralf Stei...
91
Voted
LREC
2010
216views Education» more  LREC 2010»
15 years 3 months ago
BlogBuster: A Tool for Extracting Corpora from the Blogosphere
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...
Georgios Petasis, Dimitrios Petasis
LREC
2010
172views Education» more  LREC 2010»
15 years 3 months ago
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9
CzEng 0.9 is the third release of a large parallel corpus of Czech and English. For the current release, CzEng was extended by significant amount of texts from various types of so...
Ondrej Bojar, Adam Liska, Zdenek Zabokrtský
138
Voted
ECIR
2008
Springer
15 years 3 months ago
Exploiting Locality of Wikipedia Links in Entity Ranking
Abstract. Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research field...
Jovan Pehcevski, Anne-Marie Vercoustre, James A. T...