Search Sciweavers | Sciweavers

235

JIIS
2002

168views more JIIS 2002»

Hidden Markov Models for Text Categorization in Multi-Page Documents

15 years 7 months ago

In the traditional setting, text categorization is formulated as a concept learning problem where each instance is a single isolated document. However, this perspective is not appr...

Paolo Frasconi, Giovanni Soda, Alessandro Vullo

claim paper

Read More »

222

click to vote

MICAI
2010
Springer

271views Artificial Intelligence» more MICAI 2010»

Towards Document Plagiarism Detection Based on the Relevance and Fragmentation of the Reused Text

15 years 5 months ago

Download users.dsic.upv.es

Traditionally, External Plagiarism Detection has been carried out by determining and measuring the similar sections between a given pair of documents, known as source and suspiciou...

Fernando Sánchez-Vega, Luis Villaseñ...

claim paper

Read More »

226

click to vote

ML
2000
ACM

124views Machine Learning» more ML 2000»

Text Classification from Labeled and Unlabeled Documents using EM

15 years 7 months ago

Download www.kamalnigam.com

This paper shows that the accuracy of learned text classifiers can be improved by augmenting a small number of labeled training documents with a large pool of unlabeled documents. ...

Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...

claim paper

Read More »

209

click to vote

CIKM
2010
Springer

298views Information Technology» more CIKM 2010»

Automatically suggesting topics for augmenting text documents

15 years 5 months ago

Download www.cs.mcgill.ca

We present a method for automated topic suggestion. Given a plain-text input document, our algorithm produces a ranking of novel topics that could enrich the input document in a m...

Robert West, Doina Precup, Joelle Pineau

claim paper

Read More »

224

click to vote

ICASSP
2009
IEEE

137views Signal Processing» more ICASSP 2009»

Data hiding in hard-copy text documents robust to print, scan and photocopy operations

16 years 2 months ago

Download www.merl.com

This paper describes a method for hiding data inside printed text documents that is resilient to print/scan and photocopying operations. Using the principle of channel coding with...

Avinash L. Varna, Shantanu Rane, Anthony Vetro

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers