Sciweavers

1071 search results - page 175 / 215
» A kernel-based approach to document retrieval
Sort
View
ICDE
2009
IEEE
156views Database» more  ICDE 2009»
14 years 11 months ago
Distributed Structural Relaxation of XPath Queries
Due to the structural heterogeneity of XML, queries are often interpreted approximately. This is achieved by relaxing the query and ranking the results based on their relevance to ...
Georgia Koloniari, Evaggelia Pitoura
WWW
2010
ACM
14 years 4 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
ISI
2007
Springer
14 years 4 months ago
DOTS: Detection of Off-Topic Search via Result Clustering
— Often document dissemination is limited to a “need to know” basis so as to better maintain organizational trade secrets. Retrieving documents that are off-topic to a user...
Nazli Goharian, Alana Platt
DL
1997
Springer
117views Digital Library» more  DL 1997»
14 years 2 months ago
Evaluating the Cost of Boolean Query Mapping
Non-uniform query languages make searching over heterogeneous information sources difficult. Our approach is to allow a user to compose Boolean queries in one rich front-end lang...
Kevin Chen-Chuan Chang, Hector Garcia-Molina
JCB
2007
106views more  JCB 2007»
13 years 9 months ago
Clustered Sequence Representation for Fast Homology Search
We present a novel approach to managing redundancy in sequence databanks such as GenBank. We store clusters of near-identical sequences as a representative union-sequence and a se...
Michael Cameron, Yaniv Bernstein, Hugh E. Williams