Sciweavers

2337 search results - page 25 / 468
» Extracting Sequences from the Web
Sort
View
IJCAI
2003
13 years 10 months ago
Information Extraction from Web Documents Based on Local Unranked Tree Automaton Inference
Information extraction (IE) aims at extracting specific information from a collection of documents. A lot of previous work on 10 from semi-structured documents (in XML or HTML) us...
Raymond Kosala, Maurice Bruynooghe, Jan Van den Bu...
ACL
2003
13 years 10 months ago
Extracting Key Semantic Terms from Chinese Speech Query for Web Searches
This paper discusses the challenges and proposes a solution to performing information retrieval on the Web using Chinese natural language speech query. The main contribution of th...
Gang Wang, Tat-Seng Chua, Yongcheng Wang
WWW
2009
ACM
14 years 9 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
COOPIS
1998
IEEE
14 years 1 months ago
Jedi: Extracting and Synthesizing Information from the Web
Jedi (Java based Extraction and Dissemination of Information) is a lightweight tool for the creation of wrappers and mediators to extract, combine, and reconcile information from ...
Gerald Huck, Peter Fankhauser, Karl Aberer, Erich ...
ICDM
2007
IEEE
149views Data Mining» more  ICDM 2007»
14 years 3 months ago
Extracting Author Meta-Data from Web Using Visual Features
Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...
Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles