Sciweavers

708 search results - page 46 / 142
» Identifying Content Blocks from Web Documents
Sort
View
ESWS
2007
Springer
14 years 2 months ago
Putting Business Intelligence into Documents
Business processes are often statically implemented and may not be established ad-hoc. For the realization of dynamic process configurations that demand for changes in these imple...
Tobias Bürger
COLING
2010
13 years 3 months ago
Large Scale Parallel Document Mining for Machine Translation
A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...
Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...
ASSETS
2008
ACM
13 years 10 months ago
What's new?: making web page updates accessible
Web applications facilitated by technologies such as JavaScript, DHTML, AJAX, and Flash use a considerable amount of dynamic web content that is either inaccessible or unusable by...
Yevgen Borodin, Jeffrey P. Bigham, Rohit Raman, I....
SIGIR
1998
ACM
14 years 27 days ago
Improved Algorithms for Topic Distillation in a Hyperlinked Environment
This paper addresses the problem of topic distillation on the World Wide Web, namely, given a typical user query to find quality documents related to the query topic. Connectivity...
Krishna Bharat, Monika Rauch Henzinger
CHI
2009
ACM
14 years 9 months ago
Resonance on the web: web dynamics and revisitation patterns
The Web is a dynamic, ever-changing collection of information accessed in a dynamic way. This paper explores the relationship between Web page content change (obtained from an hou...
Eytan Adar, Jaime Teevan, Susan T. Dumais