Sciweavers

317 search results - page 21 / 64
» Style-independent document labeling: design and performance ...
Sort
View
SIGIR
2008
ACM
13 years 7 months ago
Enhancing text clustering by leveraging Wikipedia semantics
Most traditional text clustering methods are based on "bag of words" (BOW) representation based on frequency statistics in a set of documents. BOW, however, ignores the ...
Jian Hu, Lujun Fang, Yang Cao, Hua-Jun Zeng, Hua L...
KDD
2012
ACM
205views Data Mining» more  KDD 2012»
11 years 10 months ago
Rank-loss support instance machines for MIML instance annotation
Multi-instance multi-label learning (MIML) is a framework for supervised classification where the objects to be classified are bags of instances associated with multiple labels....
Forrest Briggs, Xiaoli Z. Fern, Raviv Raich
EMNLP
2007
13 years 9 months ago
Bootstrapping Information Extraction from Field Books
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
Sander Canisius, Caroline Sporleder
EDBT
2006
ACM
135views Database» more  EDBT 2006»
14 years 7 months ago
A New Design for a Native XML Storage and Indexing Manager
This paper describes the design and implementation of an XML storage manager for fast and interactive XPath expressions evaluation. This storage manager has two main parts: the XML...
Jihad Boulos, Shant Karakashian
ACL
2010
13 years 5 months ago
"Ask Not What Textual Entailment Can Do for You..."
We challenge the NLP community to participate in a large-scale, distributed effort to design and build resources for developing and evaluating solutions to new and existing NLP ta...
Mark Sammons, V. G. Vinod Vydiswaran, Dan Roth