unlabeled data | Sciweavers

173

NIPS
2008

215views Information Technology» more NIPS 2008»

Unlabeled data: Now it helps, now it doesn't

15 years 8 months ago

Empirical evidence shows that in favorable situations semi-supervised learning (SSL) algorithms can capitalize on the abundance of unlabeled training data to improve the performan...

Aarti Singh, Robert D. Nowak, Xiaojin Zhu

claim paper

Read More »

206

click to vote

NIPS
2008

143views Information Technology» more NIPS 2008»

Semi-supervised Learning with Weakly-Related Unlabeled Data: Towards Better Text Categorization

15 years 8 months ago

Download www.cs.cmu.edu

The cluster assumption is exploited by most semi-supervised learning (SSL) methods. However, if the unlabeled data is merely weakly related to the target classes, it becomes quest...

Liu Yang, Rong Jin, Rahul Sukthankar

claim paper

Read More »

180

click to vote

IJCAI
2007

215views Artificial Intelligence» more IJCAI 2007»

Detecting Changes in Unlabeled Data Streams Using Martingale

15 years 8 months ago

Download www.cs.gmu.edu

The martingale framework for detecting changes in data stream, currently only applicable to labeled data, is extended here to unlabeled data using clustering concept. The one-pass...

Shen-Shyang Ho, Harry Wechsler

claim paper

Read More »

172

click to vote

IJCAI
2007

172views Artificial Intelligence» more IJCAI 2007»

Optimistic Active-Learning Using Mutual Information

15 years 8 months ago

Download www.ijcai.org

An “active learning system” will sequentially decide which unlabeled instance to label, with the goal of efﬁciently gathering the information necessary to produce a good cla...

Yuhong Guo, Russell Greiner

claim paper

Read More »

203

click to vote

ICMLA
2007

192views Machine Learning» more ICMLA 2007»

Semi-Supervised Active Learning for Modeling Medical Concepts from Free Text

15 years 8 months ago

Download people.csail.mit.edu

We apply a new active learning formulation to the problem of learning medical concepts from unstructured text. The new formulation is based on maximizing the mutual information th...

Rómer Rosales, Praveen Krishnamurthy, R. Bh...

claim paper

Read More »

188

click to vote

SDM
2010
SIAM

226views Data Mining» more SDM 2010»

Two-View Transductive Support Vector Machines

15 years 8 months ago

Download www.cais.ntu.edu.sg

Obtaining high-quality and up-to-date labeled data can be difficult in many real-world machine learning applications, especially for Internet classification tasks like review spam...

Guangxia Li, Steven C. H. Hoi, Kuiyu Chang

claim paper

Read More »

188

click to vote

EMNLP
2007

250views Natural Language Processing» more EMNLP 2007»

Semi-Supervised Structured Output Learning Based on a Hybrid Generative and Discriminative Approach

15 years 8 months ago

Download acl.ldc.upenn.edu

This paper proposes a framework for semi-supervised structured output learning (SOL), speciﬁcally for sequence labeling, based on a hybrid generative and discriminative approach...

Jun Suzuki, Akinori Fujino, Hideki Isozaki

claim paper

Read More »

193

click to vote

COLING
2008

159views Computational Linguistics» more COLING 2008»

Homotopy-Based Semi-Supervised Hidden Markov Models for Sequence Labeling

15 years 8 months ago

Download www.cs.sfu.ca

This paper explores the use of the homotopy method for training a semi-supervised Hidden Markov Model (HMM) used for sequence labeling. We provide a novel polynomial-time algorith...

Gholamreza Haffari, Anoop Sarkar

claim paper

Read More »

165

click to vote

COLING
2008

130views Computational Linguistics» more COLING 2008»

Re-estimation of Lexical Parameters for Treebank PCFGs

15 years 8 months ago

Download aclweb.org

We present procedures which pool lexical information estimated from unlabeled data via the Inside-Outside algorithm, with lexical information from a treebank PCFG. The procedures ...

Tejaswini Deoskar

claim paper

Read More »

149

click to vote

COLING
2008

130views Computational Linguistics» more COLING 2008»

Learning Reliable Information for Dependency Parsing Adaptation

15 years 8 months ago

Download www.aclweb.org

In this paper, we focus on the adaptation problem that has a large labeled data in the source domain and a large but unlabeled data in the target domain. Our aim is to learn relia...

Wenliang Chen, Youzheng Wu, Hitoshi Isahara

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers