Sciweavers

2189 search results - page 331 / 438
» Webbed documents
Sort
View
108
Voted
IIWAS
2008
15 years 4 months ago
Combining content extraction heuristics: the CombinE system
The main text content of an HTML document on the WWW is typically surrounded by additional contents, such as navigation menus, advertisements, link lists or design elements. Conte...
Thomas Gottron
116
Voted
TREC
2003
15 years 4 months ago
QUALIFIER In TREC-12 QA Main Task
This paper describes a question answering system and its various modules to solve definition, factoid and list questions defined in the TREC12 Main task. In particular, we tackle ...
Hui Yang, Hang Cui, Mstislav Maslennikov, Long Qiu...
120
Voted
IPM
2007
69views more  IPM 2007»
15 years 2 months ago
Investigating sentence weighting components for automatic summarisation
The work described here initially formed part of a triangulation exercise to establish the effectiveness of the Query Term Order algorithm. The methodology produced subsequently p...
Shao Fen Liang, Siobhan Devlin, John Tait
129
Voted
AIIA
2007
Springer
15 years 8 months ago
Harvesting Relational and Structured Knowledge for Ontology Building in the WPro Architecture
We present two algorithms for supporting semi-automatic ontology building, integrated in WPro, a new architecture for ontology learning from Web documents. The first algorithm auto...
Daniele Bagni, Marco Cappella, Maria Teresa Pazien...
129
Voted
IJCNLP
2005
Springer
15 years 8 months ago
Heuristic Methods for Reducing Errors of Geographic Named Entities Learned by Bootstrapping
Abstract. One of issues in the bootstrapping for named entity recognition is how to control annotation errors introduced at every iteration. In this paper, we present several heuri...
Seungwoo Lee, Gary Geunbae Lee