Sciweavers

2677 search results - page 149 / 536
» Extracting Structured Data from Web Pages
Sort
View
WWW
2006
ACM
16 years 3 months ago
Interactive wrapper generation with minimal user effort
While much of the data on the web is unstructured in nature, there is also a significant amount of embedded structured data, such as product information on e-commerce sites or sto...
Utku Irmak, Torsten Suel
126
Voted
AAAI
2008
15 years 5 months ago
An Unsupervised Approach for Product Record Normalization across Different Web Sites
An unsupervised probabilistic learning framework for normalizing product records across different retailer Web sites is presented. Our framework decomposes the problem into two ta...
Tak-Lam Wong, Tik-Shun Wong, Wai Lam
105
Voted
SIGIR
2009
ACM
15 years 9 months ago
Using anchor texts with their hyperlink structure for web search
As a good complement to page content, anchor texts have been extensively used, and proven to be useful, in commercial search engines. However, anchor texts have been assumed to be...
Zhicheng Dou, Ruihua Song, Jian-Yun Nie, Ji-Rong W...
125
Voted
DL
2000
Springer
210views Digital Library» more  DL 2000»
15 years 7 months ago
Extracting and visualizing semantic structures in retrieval results for browsing
The paper introduces an approach that organizes retrieval results semantically and displays them spatially for browsing. Latent Semantic Analysis as well as cluster techniques are...
Katy Börner
135
Voted
LREC
2008
172views Education» more  LREC 2008»
15 years 4 months ago
CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and
Being the client's first interface, call centres worldwide contain a huge amount of information of all kind under the form of conversational speech. If accessible, this infor...
Martine Garnier-Rizet, Gilles Adda, Frederik Caill...