Sciweavers

311 search results - page 27 / 63
» XTRACT: A System for Extracting Document Type Descriptors fr...
Sort
View
TREC
2004
13 years 9 months ago
THUIR at TREC 2004: QA
In this paper, we describe ideas and related experiments of Tsinghua University IR group in TREC 2004 QA track. In this track, our system consists three components: Question analy...
Wei Tan 0002, Qunxiu Chen, Shaoping Ma
WWW
2005
ACM
14 years 8 months ago
Automatically learning document taxonomies for hierarchical classification
While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...
Kunal Punera, Suju Rajan, Joydeep Ghosh
FCS
2006
13 years 9 months ago
Formal Representation and Transformation of DTDs to Sem-ODM Semantic Schemas
Abstract. Many projects have investigated the issue of storing XML in traditional database systems and exporting data in traditional databases as XML documents. However, they paid ...
Li Yang, Naphtali Rishe
CIKM
2010
Springer
13 years 6 months ago
Clickthrough-based translation models for web search: from word models to phrase models
Web search is challenging partly due to the fact that search queries and Web documents use different language styles and vocabularies. This paper provides a quantitative analysis ...
Jianfeng Gao, Xiaodong He, Jian-Yun Nie
PLDI
2010
ACM
14 years 5 months ago
A Context-free Markup Language for Semi-structured Text
An ad hoc data format is any non-standard, semi-structured data format for which robust data processing tools are not available. In this paper, we present ANNE, a new kind of mark...
Qian Xi, David Walker