Search Sciweavers | Sciweavers

232 search results - page 22 / 47

» Query-related data extraction of hidden web documents

105

click to vote

WWW
2007
ACM

144views Internet Technology» more WWW 2007»

Combining classifiers to identify online databases

16 years 4 months ago

Download www2007.org

We address the problem of identifying the domain of online databases. More precisely, given a set F of Web forms automatically gathered by a focused crawler and an online database...

Luciano Barbosa, Juliana Freire

claim paper

Read More »

135

Voted

SIGIR
2009
ACM

174views Information Technology» more SIGIR 2009»

Smoothing clickthrough data for web search ranking

15 years 10 months ago

Download research.microsoft.com

Incorporating features extracted from clickthrough data (called clickthrough features) has been demonstrated to significantly improve the performance of ranking models for Web sea...

Jianfeng Gao, Wei Yuan, Xiao Li, Kefeng Deng, Jian...

claim paper

Read More »

132

Voted

AAAI
1997

162views Intelligent Agents» more AAAI 1997»

Template-Based Information Mining from HTML Documents

15 years 4 months ago

Download research.microsoft.com

Tools for mining information from data can create added value for the Internet. As the majority of electronic documents available over the network are in unstructured textual form...

Jane Yung-jen Hsu, Wen-tau Yih

claim paper

Read More »

111

Voted

CIKM
2005
Springer

126views Information Technology» more CIKM 2005»

Structure-based query-specific document summarization

15 years 9 months ago

Download users.cis.fiu.edu

Summarization of text documents is increasingly important with the amount of data available on the Internet. The large majority of current approaches view documents as linear sequ...

Ramakrishna Varadarajan, Vagelis Hristidis

claim paper

Read More »

126

Voted

PAKDD
2010
ACM

167views Data Mining» more PAKDD 2010»

Resource-Bounded Information Extraction: Acquiring Missing Feature Values on Demand

15 years 7 months ago

Download www.cs.umass.edu

We present a general framework for the task of extracting speciﬁc information “on demand” from a large corpus such as the Web under resource-constraints. Given a database wit...

Pallika Kanani, Andrew McCallum, Shaohan Hu

claim paper

Read More »

« Prev « First page 22 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers