Sciweavers

188 search results - page 16 / 38
» A Proposal for the Automatic Generation of Instances from Un...
Sort
View
ECIR
2009
Springer
14 years 4 months ago
Topic and Trend Detection in Text Collections Using Latent Dirichlet Allocation
Algorithms that enable the process of automatically mining distinct topics in document collections have become increasingly important due to their applications in many fields and ...
Levent Bolelli, Seyda Ertekin, C. Lee Giles
JIIS
2002
168views more  JIIS 2002»
13 years 7 months ago
Hidden Markov Models for Text Categorization in Multi-Page Documents
In the traditional setting, text categorization is formulated as a concept learning problem where each instance is a single isolated document. However, this perspective is not appr...
Paolo Frasconi, Giovanni Soda, Alessandro Vullo
DOCENG
2009
ACM
14 years 2 months ago
From rhetorical structures to document structure: shallow pragmatic analysis for document engineering
In this paper, we extend previous work on the automatic structuring of medical documents using content analysis. Our long-term objective is to take advantage of specific rhetoric ...
Gersende Georg, Hugo Hernault, Marc Cavazza, Helmu...
CVPR
2009
IEEE
15 years 2 months ago
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework
Given an image, we propose a hierarchical generative model that classifies the overall scene, recognizes and segments each object component, as well as annotates the image with ...
Fei-Fei Li 0002, Li-Jia Li, Richard Socher
COLING
2002
13 years 7 months ago
Extracting Important Sentences with Support Vector Machines
Extracting sentences that contain important information from a document is a form of text summarization. The technique is the key to the automatic generation of summaries similar ...
Tsutomu Hirao, Hideki Isozaki, Eisaku Maeda, Yuji ...