Search Sciweavers | Sciweavers

186

Voted

SIGIR
2009
ACM

130views Information Technology» more SIGIR 2009»

Evaluating effects of machine translation accuracy on cross-lingual patent retrieval

15 years 11 months ago

We organized a machine translation (MT) task at the Seventh NTCIR Workshop. Participating groups were requested to machine translate sentences in patent documents and also search ...

Atsushi Fujii, Masao Utiyama, Mikio Yamamoto, Take...

claim paper

Read More »

137

click to vote

DOCENG
2007
ACM

121views Document Analysis» more DOCENG 2007»

Structure and content analysis for html medical articles: a hidden markov model approach

15 years 10 months ago

Download archive.nlm.nih.gov

We describe ongoing research on segmenting and labeling HTML medical journal articles. In contrast to existing approaches in which HTML tags usually serve as strong indicators, we...

Jie Zou, Daniel X. Le, George R. Thoma

claim paper

Read More »

152

click to vote

WIKIS
2010
ACM

139views Internet Technology» more WIKIS 2010»

WikiPics: multilingual image search based on Wiki-mining

15 years 10 months ago

Download brightbyte.de

This demonstration introduces WikiPics, a language-independent image search engine for Wikimedia Commons. Based on the multilingual thesaurus provided by WikiWord, WikiPics allows...

Daniel Kinzler

claim paper

Read More »

195

click to vote

AIRWEB
2008
Springer

127views Internet Technology» more AIRWEB 2008»

Exploring linguistic features for web spam detection: a preliminary study

15 years 8 months ago

Download airweb.cse.lehigh.edu

We study the usability of linguistic features in the Web spam classification task. The features were computed on two Web spam corpora: Webspam-Uk2006 and Webspam-Uk2007, we make t...

Jakub Piskorski, Marcin Sydow, Dawid Weiss

claim paper

Read More »

215

click to vote

CIKM
2008
Springer

250views Information Technology» more CIKM 2008»

Using structured text for large-scale attribute extraction

15 years 8 months ago

Download www.isi.edu

We propose a weakly-supervised approach for extracting class attributes from structured text available within Web documents. The overall precision of the extracted attributes is a...

Sujith Ravi, Marius Pasca

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers