Sciweavers

265 search results - page 9 / 53
» Learning Logic Wrappers for Information Extraction from the ...
Sort
View
DEEC
2006
IEEE
14 years 2 months ago
Maintaining Web Navigation Flows for Wrappers
A substantial subset of the web data follows some kind of underlying structure. In order to let software programs gain full benefit from these “semistructured” web sources, wra...
Juan Raposo, Manuel Álvarez, José Lo...
WWW
2004
ACM
14 years 9 months ago
Testbed for information extraction from deep web
Search results generated by searchable databases are served dynamically and far larger than the static documents on the Web. These results pages have been referred to as the Deep ...
Yasuhiro Yamada, Nick Craswell, Tetsuya Nakatoh, S...
COOPIS
1998
IEEE
14 years 6 days ago
Wrapper Generation for Web Accessible Data Sources
There is an increase in the number of data sources that can be queried across the WWW. Such sources typically support HTML forms-based interfaces and search engines query collecti...
Jean-Robert Gruser, Louiqa Raschid, Maria-Esther V...
WSDM
2012
ACM
252views Data Mining» more  WSDM 2012»
12 years 4 months ago
WebSets: extracting sets of entities from the web using unsupervised information extraction
We describe a open-domain information extraction method for extracting concept-instance pairs from an HTML corpus. Most earlier approaches to this problem rely on combining cluste...
Bhavana Bharat Dalvi, William W. Cohen, Jamie Call...
DEXAW
2004
IEEE
104views Database» more  DEXAW 2004»
14 years 9 days ago
Multilingual and Multimedia Information Retrieval from Web Documents
Web documents present new challenges to conventional Information Retrieval (IR) technologies. This paper describes how these challenges are faced in FameIR, a multilingual multime...
Marta Gatius, Manuel Bertrán, Horacio Rodr&...