Sciweavers

153 search results - page 1 / 31
» Towards noise-resilient document modeling
Sort
View
111
Voted
COLING
2010
14 years 9 months ago
Towards a Unified Approach to Simultaneous Single-Document and Multi-Document Summarizations
Single-document summarization and multidocument summarization are very closely related tasks and they have been widely investigated independently. This paper examines the mutual i...
Xiaojun Wan
119
Voted
CIKM
2011
Springer
14 years 2 months ago
Towards noise-resilient document modeling
We introduce a generative probabilistic document model based on latent Dirichlet allocation (LDA), to deal with textual errors in the document collection. Our model is inspired by...
Tao Yang, Dongwon Lee
144
Voted
DOCENG
2010
ACM
15 years 3 days ago
Diffing, patching and merging XML documents: toward a generic calculus of editing deltas
This work addresses what we believe to be a central issue in the field of XML diff and merge computation: the mathematical modeling o-called editing deltas and the study of their ...
Jean-Yves Vion-Dury
144
Voted
ICDM
2009
IEEE
162views Data Mining» more  ICDM 2009»
15 years 14 days ago
Towards a Universal Text Classifier: Transfer Learning Using Encyclopedic Knowledge
Document classification is a key task for many text mining applications. However, traditional text classification requires labeled data to construct reliable and accurate classifie...
Pu Wang, Carlotta Domeniconi
138
Voted
IAT
2010
IEEE
15 years 21 days ago
Information Retrieval eXperience (IRX): Towards a Human-Centered Personalized Model of Relevance
We approach Information Retrieval (IR) from a User eXperience (UX) perspective. Through introducing a model for Information Retrieval eXperience (IRX), this paper operationalizes a...
Frans van der Sluis, Egon L. van den Broek, Betsy ...