Sciweavers

323 search results - page 27 / 65
» An Information Extraction Model for Unconstrained Handwritte...
Sort
View
SIGMOD
2009
ACM
140views Database» more  SIGMOD 2009»
14 years 3 months ago
Robust web extraction: an approach based on a probabilistic tree-edit model
On script-generated web sites, many documents share common HTML tree structure, allowing wrappers to effectively extract information of interest. Of course, the scripts and thus ...
Nilesh N. Dalvi, Philip Bohannon, Fei Sha
INTERSPEECH
2010
13 years 3 months ago
Using dependency parsing and machine learning for factoid question answering on spoken documents
This paper presents our experiments in question answering for speech corpora. These experiments focus on improving the answer extraction step of the QA process. We present two app...
Pere Comas, Jordi Turmo, Lluís Màrqu...
KDD
2007
ACM
237views Data Mining» more  KDD 2007»
14 years 9 months ago
Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior
Documents, such as those seen on Wikipedia and Folksonomy, have tended to be assigned with multiple topics as a meta-data. Therefore, it is more and more important to analyze a re...
Issei Sato, Hiroshi Nakagawa
EMNLP
2010
13 years 7 months ago
Positional Language Models for Clinical Information Retrieval
The PECO framework is a knowledge representation for formulating clinical questions. Queries are decomposed into four aspects, which are Patient-Problem (P), Exposure (E), Compari...
Florian Boudin, Jian-Yun Nie, Martin Dawes
AND
2010
13 years 7 months ago
Document: a useful level for facing noisy data
In this paper we will present a set of experiments using large digitalized collections of books to show that logical structures can be extracted with good quality when working at ...
Hervé Déjean, Jean-Luc Meunier