Sciweavers

220 search results - page 38 / 44
» Language Independent Text Categorization
Sort
View
DIS
2007
Springer
14 years 1 months ago
Unsupervised Spam Detection Based on String Alienness Measures
We propose an unsupervised method for detecting spam documents from Web page data, based on equivalence relations on strings. We propose 3 measures for quantifying the alienness (...
Kazuyuki Narisawa, Hideo Bannai, Kohei Hatano, Mas...
ICDAR
2003
IEEE
14 years 1 months ago
Improved Nearest Neighbor Based Approach to Accurate Document Skew Estimation
The nearest-neighbor based document skew detection methods do not require the presence of a predominant text area, and are not subject to skew angle limitation. However, the accur...
Yue Lu, Chew Lim Tan
ECIR
2006
Springer
13 years 9 months ago
Lexical Entailment for Information Retrieval
Abstract. Textual Entailment has recently been proposed as an application independent task of recognising whether the meaning of one text may be inferred from another. This is pote...
Stéphane Clinchant, Cyril Goutte, Ér...
ACL
1998
13 years 9 months ago
Experiments with Learning Parsing Heuristics
Any large language processing software relies in its operation on heuristic decisions concerning the strategy of processing. These decisions are usually "hard-wired" int...
Sylvain Delisle, Sylvain Létourneau, Stan M...
DOCENG
2010
ACM
13 years 6 months ago
Semantics-enriched document exchange
In e-business development, semantics-oriented document exchange is becoming important, because it can support crossdomain user connection, business transaction and collaboration. ...
Jingzhi Guo, Ming Sang Ho