Search Sciweavers | Sciweavers

95 search results - page 10 / 19

» A cross-collection mixture model for comparative text mining

click to vote

KDD
2006
ACM

113views Data Mining» more KDD 2006»

A new efficient probabilistic model for mining labeled ordered trees

14 years 8 months ago

Download delivery.acm.org

Mining frequent patterns is a general and important issue in data mining. Complex and unstructured (or semi-structured) datasets have appeared in major data mining applications, i...

Kosuke Hashimoto, Kiyoko F. Aoki-Kinoshita, Nobuhi...

claim paper

Read More »

click to vote

WSDM
2009
ACM

172views Data Mining» more WSDM 2009»

Clustering the tagged web

14 years 2 months ago

Download www.stanford.edu

Automatically clustering web pages into semantic groups promises improved search and browsing on the web. In this paper, we demonstrate how user-generated tags from largescale soc...

Daniel Ramage, Paul Heymann, Christopher D. Mannin...

claim paper

Read More »

click to vote

GFKL
2005
Springer

93views Data Mining» more GFKL 2005»

A Hybrid Machine Learning Approach for Information Extraction from Free Text

14 years 1 months ago

Download www.dfki.de

Abstract. We present a hybrid machine learning approach for information extraction from unstructured documents by integrating a learned classiﬁer based on the Maximum Entropy Mod...

Günter Neumann

claim paper

Read More »

click to vote

WSDM
2010
ACM

215views Data Mining» more WSDM 2010»

Boilerplate Detection using Shallow Text Features

14 years 5 months ago

Download www.wsdm-conference.org

In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...

Christian Kohlschütter, Peter Fankhauser, Wol...

claim paper

Read More »

click to vote

ICDM
2005
IEEE

163views Data Mining» more ICDM 2005»

Efficient Text Classification by Weighted Proximal SVM

14 years 1 months ago

Download www.cs.ust.hk

In this paper, we present an algorithm that can classify large-scale text data with high classification quality and fast training speed. Our method is based on a novel extension o...

Dong Zhuang, Benyu Zhang, Qiang Yang, Jun Yan, Zhe...

claim paper

Read More »

« Prev « First page 10 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers