Sciweavers

368 search results - page 21 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
SIGMOD
2000
ACM
85views Database» more  SIGMOD 2000»
13 years 11 months ago
Finding Replicated Web Collections
Many web documents (such as JAVA FAQs) are being replicated on the Internet. Often entire document collections (such as hyperlinked Linux manuals) are being replicated many times....
Junghoo Cho, Narayanan Shivakumar, Hector Garcia-M...
EPIA
2003
Springer
14 years 18 days ago
Automatic Selection of Table Areas in Documents for Information Extraction
: information contained in companies’ financial statements is valuable for decision making at various levels. Much of the relevant information in such documents is contained in t...
Ana Costa e Silva, Alípio Jorge, Luí...
DMIN
2006
146views Data Mining» more  DMIN 2006»
13 years 8 months ago
A Comparison of Two Document Clustering Approaches for Clustering Medical Documents
Medical data is often presented as free text in the form of medical reports. Such documents contain important information about patients, disease progression and management, but ar...
Fathi H. Saad, Beatriz de la Iglesia, Duncan G. Be...
CG
2007
Springer
13 years 7 months ago
Visual text mining using association rules
In many situations, individuals or groups of individuals are faced with the need to examine sets of documents to achieve understanding of their structure and to locate relevant in...
Alneu de Andrade Lopes, Roberto Pinho, Fernando Vi...
TMM
2010
199views Management» more  TMM 2010»
13 years 2 months ago
Video Annotation Through Search and Graph Reinforcement Mining
Abstract--Unlimited vocabulary annotation of multimedia documents remains elusive despite progress solving the problem in the case of a small, fixed lexicon. Taking advantage of th...
Emily Moxley, Tao Mei, Bangalore S. Manjunath