Sciweavers

SIGMOD
2008
ACM
85views Database» more  SIGMOD 2008»
14 years 11 months ago
Oracle real application testing
Benoît Dageville, Graham Wood, Hailing Yu, J...
SIGMOD
2008
ACM
92views Database» more  SIGMOD 2008»
14 years 11 months ago
Information extraction challenges in managing unstructured data
Over the past few years, we have been trying to build an end-to-end system at Wisconsin to manage unstructured data, using extraction, integration, and user interaction. This pape...
AnHai Doan, Jeffrey F. Naughton, Raghu Ramakrishna...
SIGMOD
2008
ACM
116views Database» more  SIGMOD 2008»
14 years 11 months ago
Using Wikipedia to bootstrap open information extraction
Daniel S. Weld, Raphael Hoffmann, Fei Wu
SIGMOD
2008
ACM
119views Database» more  SIGMOD 2008»
14 years 11 months ago
Modeling and querying probabilistic XML data
Benny Kimelfeld, Yehoshua Sagiv
SIGMOD
2008
ACM
134views Database» more  SIGMOD 2008»
14 years 11 months ago
SystemT: a system for declarative information extraction
As applications within and outside the enterprise encounter increasing volumes of unstructured data, there has been renewed interest in the area of information extraction (IE) ? t...
Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghava...
SIGMOD
2008
ACM
73views Database» more  SIGMOD 2008»
14 years 11 months ago
Purple SOX extraction management system
Philip Bohannon, Srujana Merugu, Cong Yu, Vipul Ag...
SIGMOD
2008
ACM
122views Database» more  SIGMOD 2008»
14 years 11 months ago
Building query optimizers for information extraction: the SQoUT project
Text documents often embed data that is structured in nature. This structured data is increasingly exposed using information extraction systems, which generate structured relation...
Alpa Jain, Panagiotis G. Ipeirotis, Luis Gravano
SIGMOD
2008
ACM
119views Database» more  SIGMOD 2008»
14 years 11 months ago
Webpage understanding: beyond page-level search
In this paper we introduce the webpage understanding problem which consists of three subtasks: webpage segmentation, webpage structure labeling, and webpage text segmentation and ...
Zaiqing Nie, Ji-Rong Wen, Wei-Ying Ma
SIGMOD
2008
ACM
131views Database» more  SIGMOD 2008»
14 years 11 months ago
Domain adaptation of information extraction models
Domain adaptation refers to the process of adapting an extraction model trained in one domain to another related domain with only unlabeled data. We present a brief survey of exis...
Rahul Gupta, Sunita Sarawagi
SIGMOD
2008
ACM
159views Database» more  SIGMOD 2008»
14 years 11 months ago
Web-scale extraction of structured data
A long-standing goal of Web research has been to construct a unified Web knowledge base. Information extraction techniques have shown good results on Web inputs, but even most dom...
Michael J. Cafarella, Jayant Madhavan, Alon Y. Hal...