Sciweavers

240 search results - page 25 / 48
» Learning to Extract Content from News Webpages
Sort
View
IUI
2010
ACM
14 years 5 months ago
Tell me more, not just "more of the same"
The Web makes it possible for news readers to learn more about virtually any story that interests them. Media outlets and search engines typically augment their information with l...
Francisco Iacobelli, Larry Birnbaum, Kristian J. H...
SIGIR
2003
ACM
14 years 1 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann
AI
2000
Springer
13 years 8 months ago
Learning to construct knowledge bases from the World Wide Web
The World Wide Web is a vast source of information accessible to computers, but understandable only to humans. The goal of the research described here is to automatically create a...
Mark Craven, Dan DiPasquo, Dayne Freitag, Andrew M...
EMNLP
2007
13 years 10 months ago
Enhancing Single-Document Summarization by Combining RankNet and Third-Party Sources
We present a new approach to automatic summarization based on neural nets, called NetSum. We extract a set of features from each sentence that helps identify its importance in the...
Krysta Marie Svore, Lucy Vanderwende, Christopher ...
ICIP
2001
IEEE
14 years 10 months ago
Image data mining from financial documents based on wavelet features
In this paper, we present a framework for clustering and classifying cheque images according to their payee-line content. The features used in the clustering and classificationpro...
Ossama El Badawy, Mahmoud R. El-Sakka, Khaled Hass...