Search Sciweavers | Sciweavers

43 search results - page 5 / 9

» Creating a Persian-English Comparable Corpus

167

Voted

WWW
2006
ACM

107views Internet Technology» more WWW 2006»

Random sampling from a search engine's index

16 years 6 months ago

Download webee.technion.ac.il

We revisit a problem introduced by Bharat and Broder almost a decade ago: how to sample random pages from the corpus of documents indexed by a search engine, using only the search...

Ziv Bar-Yossef, Maxim Gurevich

claim paper

Read More »

206

click to vote

ACMSE
2009
ACM

195views Theoretical Computer Science» more ACMSE 2009»

Applying randomized projection to aid prediction algorithms in detecting high-dimensional rogue applications

16 years 18 days ago

Download www2.latech.edu

This paper describes a research effort to improve the use of the cosine similarity information retrieval technique to detect unknown, known or variances of known rogue software by...

Travis Atkison

claim paper

Read More »

140

click to vote

AI
2006
Springer

105views Artificial Intelligence» more AI 2006»

Unsupervised Named-Entity Recognition: Generating Gazetteers and Resolving Ambiguity

15 years 9 months ago

Download cogprints.org

In this paper, we propose a named-entity recognition (NER) system that addresses two major limitations frequently discussed in the field. First, the system requires no human interv...

David Nadeau, Peter D. Turney, Stan Matwin

claim paper

Read More »

180

click to vote

ACL
2010

137views Computational Linguistics» more ACL 2010»

Cross Lingual Adaptation: An Experiment on Sentiment Classifications

15 years 4 months ago

Download www.aclweb.org

In this paper, we study the problem of using an annotated corpus in English for the same natural language processing task in another language. While various machine translation sy...

Bin Wei, Christopher Pal

claim paper

Read More »

148

click to vote

ICDAR
2009
IEEE

148views Document Analysis» more ICDAR 2009»

Automated Ground Truth Data Generation for Newspaper Document Images

16 years 25 days ago

Download www.cvc.uab.es

In document image understanding, public datasets with ground-truth are an important part of scientiﬁc work. They are not only helpful for developing new methods, but also provid...

Thomas Strecker, Joost van Beusekom, Sahin Albayra...

claim paper

Read More »

« Prev « First page 5 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers