Sciweavers

COLING
2010
13 years 5 months ago
A Method for Automatically Generating a Mediatory Summary to Verify Credibility of Information on the Web
In this paper, we propose a method for mediatory summarization, which is a novel technique for facilitating users' assessments of the credibility of information on the Web. A...
Hideyuki Shibuki, Takahiro Nagai, Masahiro Nakano,...
COLING
2010
13 years 5 months ago
Open Entity Extraction from Web Search Query Logs
In this paper we propose a completely unsupervised method for open-domain entity extraction and clustering over query logs. The underlying hypothesis is that classes defined by mi...
Alpa Jain, Marco Pennacchiotti
COLING
2010
13 years 5 months ago
Enhancing Cross Document Coreference of Web Documents with Context Similarity and Very Large Scale Text Categorization
Cross Document Coreference (CDC) is the task of constructing the coreference chain for mentions of a person across a set of documents. This work offers a holistic view of using do...
Jian Huang 0002, Pucktada Treeratpituk, Sarah M. T...
ACL
2010
13 years 8 months ago
Tools for Multilingual Grammar-Based Translation on the Web
This is a system demo for a set of tools for translating texts between multiple languages in real time with high quality. The translation works on restricted languages, and is bas...
Aarne Ranta, Krasimir Angelov, Thomas Hallgren
CORR
1999
Springer
107views Education» more  CORR 1999»
13 years 10 months ago
WebScript - A Scripting Language for the Web
WebScript is a scripting language for processing Web documents. Designed as an extension to Jacl, the Java implementation of Tcl, WebScript allows programmers to manipulate HTML i...
Yin Zhang
CN
1999
143views more  CN 1999»
13 years 10 months ago
Embedding Knowledge in Web Documents
The paper argues for the use of general and intuitive knowledge representation languages (and simpler notational variants, e.g. subsets of natural languages) for indexing the cont...
Philippe Martin, Peter W. Eklund
CIT
2005
Springer
13 years 10 months ago
Simple Classification into Large Topic Ontology of Web Documents
The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology an...
Marko Grobelnik, Dunja Mladenic
CORR
2007
Springer
114views Education» more  CORR 2007»
13 years 10 months ago
SWI-Prolog and the Web
Prolog is an excellent tool for representing and manipulating data written in formal languages as well as natural language. Its safe semantics and automatic memory management make...
Jan Wielemaker, Zhisheng Huang, Lourens van der Me...
ENTCS
2006
116views more  ENTCS 2006»
13 years 10 months ago
How Recent is a Web Document?
One of the most important aspects of a Web document is its up-to-dateness or recency. Up-to-dateness is particularly relevant to Web documents because they usually contain content...
Bo Hu, Florian Lauck, Jan Scheffczyk
APWEB
2010
Springer
13 years 11 months ago
A Space and Time Efficient Algorithm for SimRank Computation
SimRank has been proposed to rank web documents based on a graph model on hyperlinks. The existing techniques for conducting SimRank computation adopt an iteration computation para...
Weiren Yu, Xuemin Lin, Jiajin Le