Sciweavers

459 search results - page 73 / 92
» Random sampling from a search engine's index
Sort
View
BMCBI
2007
166views more  BMCBI 2007»
13 years 8 months ago
Data handling strategies for high throughput pyrosequencers
Background: New high throughput pyrosequencers such as the 454 Life Sciences GS 20 are capable of massively parallelizing DNA sequencing providing an unprecedented rate of output ...
Gabriele A. Trombetti, Raoul J. P. Bonnal, Ermanno...
VLDB
2004
ACM
95views Database» more  VLDB 2004»
14 years 1 months ago
Combating Web Spam with TrustRank
Web spam pages use various techniques to achieve higher-than-deserved rankings in a search engine’s results. While human experts can identify spam, it is too expensive to manual...
Zoltán Gyöngyi, Hector Garcia-Molina, ...
XSYM
2004
Springer
115views Database» more  XSYM 2004»
14 years 1 months ago
XQuery Processing with Relevance Ranking
Abstract. We are presenting a coherent framework for XQuery processing that incorporates IR-style approximate matching and allows the ordering of results by their relevance score. ...
Leonidas Fegaras
ECML
2001
Springer
14 years 28 days ago
Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL
Abstract. This paper presents a simple unsupervised learning algorithm for recognizing synonyms, based on statistical data acquired by querying a Web search engine. The algorithm, ...
Peter D. Turney
COOPIS
1998
IEEE
14 years 1 days ago
Wrapper Generation for Web Accessible Data Sources
There is an increase in the number of data sources that can be queried across the WWW. Such sources typically support HTML forms-based interfaces and search engines query collecti...
Jean-Robert Gruser, Louiqa Raschid, Maria-Esther V...