Sciweavers

80 search results - page 11 / 16
» Extracting context to improve accuracy for HTML content extr...
Sort
View
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
14 years 6 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
MM
2005
ACM
187views Multimedia» more  MM 2005»
14 years 2 months ago
Augmented segmentation and visualization for presentation videos
We investigate methods of segmenting, visualizing, and indexing presentation videos by both audio and visual data. The audio track is segmented by speaker, and augmented with key ...
Alexander Haubold, John R. Kender
CICLING
2009
Springer
14 years 3 months ago
Semi-supervised Word Sense Disambiguation Using the Web as Corpus
Abstract. As any other classification task, Word Sense Disambiguation requires a large number of training examples. These examples, which are easily obtained for most of the tasks,...
Rafael Guzmán-Cabrera, Paolo Rosso, Manuel ...
NAACL
2004
13 years 10 months ago
Predicting Emotion in Spoken Dialogue from Multiple Knowledge Sources
We examine the utility of multiple types of turn-level and contextual linguistic features for automatically predicting student emotions in human-human spoken tutoring dialogues. W...
Katherine Forbes-Riley, Diane J. Litman
NIPS
2007
13 years 10 months ago
Mining Internet-Scale Software Repositories
Large repositories of source code create new challenges and opportunities for statistical machine learning. Here we first develop Sourcerer, an infrastructure for the automated c...
Erik Linstead, Paul Rigor, Sushil Krishna Bajracha...