Sciweavers

ACL
2010
13 years 9 months ago
Domain Adaptation of Maximum Entropy Language Models
We investigate a recently proposed Bayesian adaptation method for building style-adapted maximum entropy language models for speech recognition, given a large corpus of written la...
Tanel Alumäe, Mikko Kurimo
COLING
2002
13 years 11 months ago
Fertilization of Case Frame Dictionary for Robust Japanese Case Analysis
This paper proposes a method of fertilizing a Japanese case frame dictionary to handle complicated expressions: double nominative sentences, non-gapping relation of relative claus...
Daisuke Kawahara, Sadao Kurohashi
COLING
2000
14 years 27 days ago
Deletions and their reconstruction in tectogrammatical syntactic tagging of very large corpora
The procedure of reconstruction of the underlying structure of sentences (in the process of tagging a very large corpus of Czech) is described, with a special attention paid to th...
Eva Hajicová, Marketa Ceplova
EACL
2006
ACL Anthology
14 years 28 days ago
Why Are They Excited? Identifying and Explaining Spikes in Blog Mood Levels
We describe a method for discovering irregularities in temporal mood patterns appearing in a large corpus of blog posts, and labeling them with a natural language explanation. Sim...
Krisztian Balog, Gilad Mishne, Maarten de Rijke
LREC
2010
121views Education» more  LREC 2010»
14 years 29 days ago
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information
We developed a search tool for ngrams extracted from a very large corpus (the current system uses the entire Wikipedia, which has
Satoshi Sekine, Kapil Dalwani
CPM
2006
Springer
140views Combinatorics» more  CPM 2006»
14 years 3 months ago
Identifying Co-referential Names Across Large Corpora
A single logical entity can be referred to by several different names over a large text corpus. We present our algorithm for finding all suchco-reference sets in a large corpus. Ou...
Levon Lloyd, Andrew Mehler, Steven Skiena
WWW
2003
ACM
15 years 6 days ago
Modeling Web Knowledge for Answering Event-based Questions
For the TREC-style questions, the query terms we get from the original questions are either too brief or often do not contain much relevant information in the corpus. It will be v...
Hui Yang, Tat-Seng Chua, Shuguang Wang
ICPR
2002
IEEE
15 years 18 days ago
Preprocessing and Recognition of Characters in Container Codes
This paper describes the recognition of container code characters. The system has to deal with outdoor images which usually have damaged characters and obtain an answer in real ti...
Alberto J. Pérez Jiménez, Gabriela A...