Sciweavers

1541 search results - page 23 / 309
» Extracting Web Data Using Instance-Based Learning
Sort
View
IJMSO
2008
149views more  IJMSO 2008»
13 years 8 months ago
Categorisation of web documents using extraction ontologies
: Automatically recognising which HTML documents on the Web contain items of interest for a user is non-trivial. As a step toward solving this problem, we propose an approach based...
Li Xu, David W. Embley
WWW
2007
ACM
14 years 9 months ago
Integrating web directories by learning their structures
Documents in the Web are often organized using category trees by information providers (e.g. CNN, BBC) or search engines (e.g. Google, Yahoo!). Such category trees are commonly kn...
Christopher C. Yang, Jianfeng Lin
ICML
2005
IEEE
14 years 9 months ago
2D Conditional Random Fields for Web information extraction
The Web contains an abundance of useful semistructured information about real world objects, and our empirical study shows that strong sequence characteristics exist for Web infor...
Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Y...
AAAI
2006
13 years 10 months ago
Automatically Labeling the Inputs and Outputs of Web Services
Information integration systems combine data from multiple heterogeneous Web services to answer complex user queries, provided a user has semantically modeled the service first. T...
Kristina Lerman, Anon Plangprasopchok, Craig A. Kn...
PRIS
2004
13 years 10 months ago
Learning Text Extraction Rules, without Ignoring Stop Words
Information Extraction (IE) from text /web documents has become an important application area of AI. As the number of web sites and documents has grown dramatically, the users need...
João Cordeiro, Pavel Brazdil