Sciweavers

296 search results - page 11 / 60
» Classifying XML Documents by Using Genre Features
Sort
View
ECIR
2007
Springer
13 years 9 months ago
Feature- and Query-Based Table of Contents Generation for XML Documents
The availability of a document’s logical structure in XML retrieval allows retrieval systems to return document portions (elements) instead of whole documents. This helps searche...
Zoltán Szlávik, Anastasios Tombros, ...
ML
2006
ACM
13 years 7 months ago
XRules: An effective algorithm for structural classification of XML data
Abstract XML documents have recently become ubiquitous because of their varied applicability in a number of applications. Classification is an important problem in the data mining ...
Mohammed Javeed Zaki, Charu C. Aggarwal
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
13 years 12 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
JODL
2007
109views more  JODL 2007»
13 years 7 months ago
Examining topic shifts in content-oriented XML retrieval
Abstract. Content-oriented XML retrieval systems support access to XML repositories by retrieving, in response to user queries, XML document components (XML elements) instead of wh...
Elham Ashoori, Mounia Lalmas, Theodora Tsikrika
CICLING
2004
Springer
14 years 1 months ago
Automatic Learning Features Using Bootstrapping for Text Categorization
When text categorization is applied to complex tasks, it is tedious and expensive to hand-label the large amounts of training data necessary for good performance. In this paper, we...
Wenliang Chen, Jingbo Zhu, Honglin Wu, Tianshun Ya...