Sciweavers

1192 search results - page 89 / 239
» Subject Index
Sort
View
186
Voted
SIGIR
2009
ACM
15 years 11 months ago
Evaluating effects of machine translation accuracy on cross-lingual patent retrieval
We organized a machine translation (MT) task at the Seventh NTCIR Workshop. Participating groups were requested to machine translate sentences in patent documents and also search ...
Atsushi Fujii, Masao Utiyama, Mikio Yamamoto, Take...
DOCENG
2007
ACM
15 years 10 months ago
Structure and content analysis for html medical articles: a hidden markov model approach
We describe ongoing research on segmenting and labeling HTML medical journal articles. In contrast to existing approaches in which HTML tags usually serve as strong indicators, we...
Jie Zou, Daniel X. Le, George R. Thoma
WIKIS
2010
ACM
15 years 10 months ago
WikiPics: multilingual image search based on Wiki-mining
This demonstration introduces WikiPics, a language-independent image search engine for Wikimedia Commons. Based on the multilingual thesaurus provided by WikiWord, WikiPics allows...
Daniel Kinzler
AIRWEB
2008
Springer
15 years 8 months ago
Exploring linguistic features for web spam detection: a preliminary study
We study the usability of linguistic features in the Web spam classification task. The features were computed on two Web spam corpora: Webspam-Uk2006 and Webspam-Uk2007, we make t...
Jakub Piskorski, Marcin Sydow, Dawid Weiss
CIKM
2008
Springer
15 years 8 months ago
Using structured text for large-scale attribute extraction
We propose a weakly-supervised approach for extracting class attributes from structured text available within Web documents. The overall precision of the extracted attributes is a...
Sujith Ravi, Marius Pasca