Search Sciweavers | Sciweavers

945 search results - page 18 / 189

» Information Extraction from HTML: Application of a General M...

147

Voted

WWW
2005
ACM

154views Internet Technology» more WWW 2005»

Thresher: automating the unwrapping of semantic content from the World Wide Web

16 years 3 months ago

Download www2005.org

We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...

Andrew Hogue, David R. Karger

claim paper

Read More »

153

Voted

IR
2000

166views Natural Language Processing» more IR 2000»

Automating the Construction of Internet Portals with Machine Learning

15 years 2 months ago

Download www.kamalnigam.com

Domain-specific internet portals are growing in popularity because they gather content from the Web and organize it for easy access, retrieval and search. For example, www.campsear...

Andrew McCallum, Kamal Nigam, Jason Rennie, Kristi...

claim paper

Read More »

214

Voted

CORIA
2011

289views Information Technology» more CORIA 2011»

Mining the Web for lists of Named Entities

14 years 6 months ago

Download ftp.irit.fr

Named entities play an important role in Information Extraction. They represent unitary namable information within text. In this work, we focus on groups of named entities of the s...

Arlind Kopliku, Mohand Boughanem, Karen Pinel-Sauv...

claim paper

Read More »

115

Voted

ECAI
2004
Springer

153views Artificial Intelligence» more ECAI 2004»

Automatic Recognition of Famous Artists by Machine

15 years 8 months ago

Download www.ofai.at

The paper addresses the question whether it is possible for a machine to learn to distinguish and recognise famous musicians (concert pianists), based on their style of playing. We...

Gerhard Widmer, Patrick Zanon

claim paper

Read More »

116

Voted

CIKM
2005
Springer

125views Information Technology» more CIKM 2005»

Learning to summarise XML documents using content and structure

15 years 8 months ago

Download eprints.pascal-network.org

Documents formatted in eXtensible Markup Language (XML) are becoming increasingly available in collections of various document types. In this paper, we present an approach for the...

Massih-Reza Amini, Anastasios Tombros, Nicolas Usu...

claim paper

Read More »

« Prev « First page 18 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers