Sciweavers

563 search results - page 39 / 113
» Assessing the Quality of Natural Language Text Data
Sort
View
ICDE
2008
IEEE
127views Database» more  ICDE 2008»
14 years 10 months ago
Optimizing SQL Queries over Text Databases
Text documents often embed data that is structured in nature, and we can expose this structured data using information extraction technology. By processing a text database with inf...
Alpa Jain, AnHai Doan, Luis Gravano
EMNLP
2009
13 years 6 months ago
Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment
Traditionally, machine learning approaches for information extraction require human annotated data that can be costly and time-consuming to produce. However, in many cases, there ...
Kedar Bellare, Andrew McCallum
WWW
2005
ACM
14 years 9 months ago
Web-assisted annotation, semantic indexing and search of television and radio news
The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described. Automatic speech recogni...
Mike Dowman, Valentin Tablan, Hamish Cunningham, B...
CIKM
2009
Springer
14 years 1 months ago
Data extraction from the web using wild card queries
This paper presents an overview of our framework for searching and retrieving facts and relationships within natural language text sources. In this framework, an extraction task o...
Davood Rafiei, Haobin Li
ACL
2010
13 years 6 months ago
Learning 5000 Relational Extractors
Many researchers are trying to use information extraction (IE) to create large-scale knowledge bases from natural language text on the Web. However, the primary approach (supervis...
Raphael Hoffmann, Congle Zhang, Daniel S. Weld