Sciweavers

376 search results - page 31 / 76
» A Hybrid Machine Learning Approach for Information Extractio...
Sort
View
ICML
1998
IEEE
14 years 8 months ago
Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus
Cross-language latent semantic indexing is a method that learns useful languageindependent vector representations of terms through a statistical analysis of a documentaligned text...
Michael L. Littman, Fan Jiang, Greg A. Keim
WWW
2010
ACM
14 years 2 months ago
A scalable machine-learning approach for semi-structured named entity recognition
Named entity recognition studies the problem of locating and classifying parts of free text into a set of predefined categories. Although extensive research has focused on the de...
Utku Irmak, Reiner Kraft
CLEF
2010
Springer
13 years 8 months ago
Wikipedia Vandalism Detection Through Machine Learning: Feature Review and New Proposals - Lab Report for PAN at CLEF 2010
Wikipedia is an online encyclopedia that anyone can edit. In this open model, some people edits with the intent of harming the integrity of Wikipedia. This is known as vandalism. W...
Santiago Moisés Mola-Velasco
ICASSP
2010
IEEE
13 years 7 months ago
Summarization- and learning-based approaches to information distillation
Information distillation is the task that aims to extract relevant passages of text from massive volumes of textual and audio sources, given a query. In this paper, we investigate...
Boriska Toth, Dilek Hakkani-Tür, Sibel Yaman
ECML
2007
Springer
13 years 11 months ago
User Oriented Hierarchical Information Organization and Retrieval
Abstract. In order to organize huge document collections, labeled hierarchical structures are used frequently. Users are most efficient in navigating such hierarchies, if they refl...
Korinna Bade, Marcel Hermkes, Andreas Nürnber...