This paper describes the process of building a newspaper corpus annotated with events described in specific documents. The main difference to the corpora built as part of the TDT ...
In TREC 2007 Blog Track, we developed a three-step algorithm for the opinion retrieval task. An information retrieval step retrieves the query-relevant documents. A following opin...
—The choices made by user in processing a set of documents is related, in a broad sense, to the sum of influences coming from the documents in the user situation, which does not...
: Documents such as spreadsheets are easy to create, edit, and exchange. However, their use causes a set of well known problems such as poor data quality, lack of multi user suppor...
In order to embed a watermark into a binary document image, some subset of image pixels needs to be modified. This modification will cause a document image distortion. Careful sel...