Sciweavers

1506 search results - page 192 / 302
» Developing an Open Architecture for Performance Data Mining
Sort
View
ICDAR
2011
IEEE
12 years 10 months ago
Extending Page Segmentation Algorithms for Mixed-Layout Document Processing
—The goal of this work is to add the capability to segment documents containing text, graphics, and pictures in the open source OCR engine OCRopus. To achieve this goal, OCRopusâ...
Amy Winder, Tim L. Andersen, Elisa H. Barney Smith
KDD
2000
ACM
115views Data Mining» more  KDD 2000»
14 years 2 months ago
A framework for specifying explicit bias for revision of approximate information extraction rules
Information extraction is one of the most important techniques used in Text Mining. One of the main problems in building information extraction (IE) systems is that the knowledge ...
Ronen Feldman, Yair Liberzon, Binyamin Rosenfeld, ...
KDD
2007
ACM
153views Data Mining» more  KDD 2007»
14 years 11 months ago
Exploiting duality in summarization with deterministic guarantees
Summarization is an important task in data mining. A major challenge over the past years has been the efficient construction of fixed-space synopses that provide a deterministic q...
Panagiotis Karras, Dimitris Sacharidis, Nikos Mamo...
KDD
2002
ACM
147views Data Mining» more  KDD 2002»
14 years 11 months ago
Sequential cost-sensitive decision making with reinforcement learning
Recently, there has been increasing interest in the issues of cost-sensitive learning and decision making in a variety of applications of data mining. A number of approaches have ...
Edwin P. D. Pednault, Naoki Abe, Bianca Zadrozny
KDD
2010
ACM
188views Data Mining» more  KDD 2010»
14 years 21 days ago
Inferring networks of diffusion and influence
Information diffusion and virus propagation are fundamental processes talking place in networks. While it is often possible to directly observe when nodes become infected, observi...
Manuel Gomez-Rodriguez, Jure Leskovec, Andreas Kra...