Sciweavers

242 search results - page 20 / 49
» Japanese Named Entity Extraction Evaluation - Analysis of Re...
Sort
View
APWEB
2010
Springer
13 years 6 months ago
ECON: An Approach to Extract Content from Web News Page
Abstract--This paper provides a simple but effective approach, named ECON, to fully-automatically extract content from Web news page. ECON uses a DOM tree to represent the Web news...
Yan Guo, Huifeng Tang, Linhai Song, Yu Wang 0009, ...
DEXA
2009
Springer
173views Database» more  DEXA 2009»
14 years 3 months ago
Incremental Ontology-Based Extraction and Alignment in Semi-structured Documents
SHIRI 1 is an ontology-based system for integration of semistructured documents related to a specific domain. The system’s purpose is to allow users to access to relevant parts ...
Mouhamadou Thiam, Nacéra Bennacer, Nathalie...
ICASSP
2009
IEEE
14 years 3 months ago
Automatic named identification of speakers using diarization and ASR systems
In this paper, we consider the extraction of speaker identity from audio records of broadcast news without a priori acoustic information about speakers. Using an automatic speech ...
Vincent Jousse, Simon Petit-Renaud, Sylvain Meigni...
RIAO
2007
13 years 9 months ago
Extracting Useful Information from the Full Text of Fiction
In this paper, we describe some experiments in large-scale Information Extraction (IE) focusing on book texts. We investigate the scalability of IE techniques to full-sized books,...
Sharon Givon, Maria Milosavljevic
DGO
2007
192views Education» more  DGO 2007»
13 years 10 months ago
D-HOTM: distributed higher order text mining
We present D-HOTM, a framework for Distributed Higher Order Text Mining based on named entities extracted from textual data that are stored in distributed relational databases. Unl...
William M. Pottenger