Sciweavers

290 search results - page 36 / 58
» Document normalization revisited
Sort
View
ICML
2010
IEEE
13 years 7 months ago
Spherical Topic Models
We introduce the Spherical Admixture Model (SAM), a Bayesian topic model for arbitrary 2 normalized data. SAM maintains the same hierarchical structure as Latent Dirichlet Allocat...
Joseph Reisinger, Austin Waters, Bryan Silverthorn...
CI
2007
124views more  CI 2007»
13 years 7 months ago
Searching for Explanatory Web Pages Using Automatic Query Expansion
: When one tries to use the Web as a dictionary or encyclopedia, entering some single term into a search engine, the highly-ranked pages in the result can include irrelevant or use...
Manabu Tauchi, Nigel Ward
ICDAR
2009
IEEE
13 years 5 months ago
A Character-Structure-Guided Approach to Estimating Possible Orientations of a Rotated Isolated Online Handwritten Chinese Chara
This paper presents a character-structure-guided approach to estimating possible orientations of a rotated isolated online handwritten Chinese character. Using the estimated orien...
Tingting He, Qiang Huo
ICDAR
2009
IEEE
13 years 5 months ago
Online Handwritten Japanese Character String Recognition Using Conditional Random Fields
This paper describes an online handwritten Japanese character string recognition system based on conditional random fields, which integrates the information of character recogniti...
Xiang-Dong Zhou, Cheng-Lin Liu, Masaki Nakagawa
CLEF
2011
Springer
12 years 7 months ago
Intrinsic Plagiarism Detection Using Character Trigram Distance Scores - Notebook for PAN at CLEF 2011
Abstract In this paper, we describe a novel approach to intrinsic plagiarism detection. Each suspicious document is divided into a series of consecutive, potentially overlapping â€...
Mike Kestemont, Kim Luyckx, Walter Daelemans