Sciweavers

543 search results - page 18 / 109
» Exploiting content redundancy for web information extraction
Sort
View
WWW
2005
ACM
14 years 8 months ago
Thresher: automating the unwrapping of semantic content from the World Wide Web
We describe Thresher, a system that lets non-technical users teach their browsers how to extract semantic web content from HTML documents on the World Wide Web. Users specify exam...
Andrew Hogue, David R. Karger
ICDM
2003
IEEE
225views Data Mining» more  ICDM 2003»
14 years 26 days ago
Combining the web content and usage mining to understand the visitor behavior in a web site
A web site is a semi structured collection of different kinds of data, whose motivation is show relevant information to visitor and by this way capture her/his attention. Understa...
Juan D. Velásquez, Hiroshi Yasuda, Terumasa...
ISCIS
2003
Springer
14 years 23 days ago
A Cooperative Paradigm for Fighting Information Overload
The Web is mainly processed by humans. The role of the machines is just to transmit and display the contents of the documents, barely being able to do something else. Nowadays ther...
Daniel Gayo-Avello, Darío Álvarez Gu...
ICMCS
2009
IEEE
173views Multimedia» more  ICMCS 2009»
13 years 5 months ago
Linking video ADS with product or service information by web search
With the proliferation of online media services, video ads are pervasive across various platforms involving internet services and interactive TV services. Existing research effort...
Jinqiao Wang, Ling-Yu Duan, Bo Wang, Shi Chen, Yi ...
LREC
2008
172views Education» more  LREC 2008»
13 years 9 months ago
CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and
Being the client's first interface, call centres worldwide contain a huge amount of information of all kind under the form of conversational speech. If accessible, this infor...
Martine Garnier-Rizet, Gilles Adda, Frederik Caill...