Sciweavers

288 search results - page 15 / 58
» Extracting compound terms from domain corpora
Sort
View
WWW
2008
ACM
14 years 8 months ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
ACL
2010
13 years 5 months ago
Automatically Generating Term Frequency Induced Taxonomies
We propose a novel method to automatically acquire a term-frequency-based taxonomy from a corpus using an unsupervised method. A term-frequency-based taxonomy is useful for applic...
Karin Murthy, Tanveer A. Faruquie, L. Venkata Subr...
WWW
2007
ACM
14 years 8 months ago
Towards domain-independent information extraction from web tables
Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...
Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...
ECAI
2000
Springer
14 years 1 days ago
Using Description Logics for Ontology Extraction
The paper presents a prototype of a system for querying the Web in natural language (French) for a limited domain. The domain knowledge, represented in description logics (DL), is ...
Amalia Todirascu, François de Bertrand de B...
CIKM
2001
Springer
14 years 6 days ago
A Domain Independent Environment for Creating Information Extraction Modules
Text-Mining is a growing area of interest within the field of Data Mining and Knowledge Discovery. Given a collection of text documents, most approaches to Text Mining perform kno...
Ronen Feldman, Yonatan Aumann, Yair Liberzon, Kfir...