Sciweavers

1565 search results - page 100 / 313
» Topical locality in the Web
Sort
View
WWW
2005
ACM
14 years 9 months ago
User-centric Web crawling
Search engines are the primary gateways of information access on the Web today. Behind the scenes, search engines crawl the Web to populate a local indexed repository of Web pages...
Sandeep Pandey, Christopher Olston
USENIX
2000
13 years 10 months ago
Integrating a Command Shell into a Web Browser
The transition from command-line interfaces to graphical interfaces has resulted in programs that are easier to learn and use, but harder to automate and reuse. Another transition...
Robert C. Miller, Brad A. Myers
ICDM
2008
IEEE
80views Data Mining» more  ICDM 2008»
14 years 3 months ago
Collective Latent Dirichlet Allocation
In this paper, we propose a new variant of Latent Dirichlet Allocation(LDA): Collective LDA (C-LDA), for multiple corpora modeling. C-LDA combines multiple corpora during learning...
Zhiyong Shen, Jun Sun, Yi-Dong Shen
ACL
2009
13 years 6 months ago
Automatically Generating Wikipedia Articles: A Structure-Aware Approach
In this paper, we investigate an approach for creating a comprehensive textual overview of a subject composed of information drawn from the Internet. We use the high-level structu...
Christina Sauper, Regina Barzilay
ICASSP
2008
IEEE
14 years 3 months ago
On-demand new word learning using world wide web
Most of the Web-based methods for lexicon augmenting consist in capturing global semantic features of the targeted domain in order to collect relevant documents from the Web. We s...
Stanislas Oger, Georges Linares, Fréd&eacut...