Sciweavers

2827 search results - page 43 / 566
» Marking Text Documents
Sort
View
ICDAR
2011
IEEE
12 years 7 months ago
Identification of Indic Scripts on Torn-Documents
—Questioned Document Examination processes often encompass analysis of torn documents. To aid a forensic expert, automatic classification of content type in torn documents might ...
Sukalpa Chanda, Katrin Franke, Umapada Pal
CIKM
2001
Springer
14 years 14 days ago
A Domain Independent Environment for Creating Information Extraction Modules
Text-Mining is a growing area of interest within the field of Data Mining and Knowledge Discovery. Given a collection of text documents, most approaches to Text Mining perform kno...
Ronen Feldman, Yonatan Aumann, Yair Liberzon, Kfir...
CIT
2005
Springer
13 years 7 months ago
Simple Classification into Large Topic Ontology of Web Documents
The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology an...
Marko Grobelnik, Dunja Mladenic
AAAI
1998
13 years 9 months ago
Learning to Classify Text from Labeled and Unlabeled Documents
In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...
DOCENG
2010
ACM
13 years 6 months ago
Semantics-enriched document exchange
In e-business development, semantics-oriented document exchange is becoming important, because it can support crossdomain user connection, business transaction and collaboration. ...
Jingzhi Guo, Ming Sang Ho