Sciweavers

308 search results - page 11 / 62
» Syntactic Similarity of Web Documents
Sort
View
AAAI
2006
13 years 8 months ago
Corpus-based and Knowledge-based Measures of Text Semantic Similarity
This paper presents a method for measuring the semantic similarity of texts, using corpus-based and knowledge-based measures of similarity. Previous work on this problem has focus...
Rada Mihalcea, Courtney Corley, Carlo Strapparava
ACSW
2004
13 years 8 months ago
Discovering Parallel Text from the World Wide Web
Parallel corpus is a rich linguistic resource for various multilingual text management tasks, including crosslingual text retrieval, multilingual computational linguistics and mul...
Jisong Chen, Rowena Chau, Chung-Hsing Yeh
ECIR
2008
Springer
13 years 8 months ago
Clustering Template Based Web Documents
More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...
Thomas Gottron
ICPR
2008
IEEE
14 years 8 months ago
Clustering of short commercial documents for the web
Document clustering techniques have been applied in several areas, with the web as one of the most recent and influent. Both general-purpose and text-oriented techniques exist and...
Elisabetta Binaghi, Ignazio Gallo, Moreno Carullo,...
WWW
2007
ACM
14 years 7 months ago
OntoWiki: A Tool for Social, Semantic Collaboration
Abstract We present OntoWiki, a tool providing support for agile, distributed knowledge engineering scenarios. OntoWiki facilitates the visual presentation of a knowledge base as a...
Jens Lehmann, Sören Auer, Sebastian Dietzold,...