Sciweavers

3441 search results - page 23 / 689
» Intelligent Computation of Presentation Documents
Sort
View
132
Voted
HPDC
2010
IEEE
15 years 4 months ago
ParaText: scalable text modeling and analysis
Automated analysis of unstructured text documents (e.g., web pages, newswire articles, research publications, business reports) is a key capability for solving important problems ...
Daniel M. Dunlavy, Timothy M. Shead, Eric T. Stant...
135
Voted
AAAI
2000
15 years 5 months ago
A Mutually Beneficial Integration of Data Mining and Information Extraction
Text mining concerns applying data mining techniques to unstructured text. Information extraction (IE) is a form of shallow text understanding that locates specific pieces of data...
Un Yong Nahm, Raymond J. Mooney
157
Voted
ACL
2008
15 years 5 months ago
Pairwise Document Similarity in Large Collections with MapReduce
This paper presents a MapReduce algorithm for computing pairwise document similarity in large document collections. MapReduce is an attractive framework because it allows us to de...
Tamer Elsayed, Jimmy J. Lin, Douglas W. Oard
109
Voted
COLING
2010
14 years 10 months ago
Towards Automatic Building of Document Keywords
Document keywords are associated to documents as summarized versions of the documents' content. Considering that the number of documents is quickly growing every day, the ava...
Joaquim Silva, José Gabriel Lopes
133
Voted
NAACL
2003
15 years 5 months ago
Automating XML markup of text documents
We present a novel system for automatically marking up text documents into XML and discuss the benefits of XML markup for intelligent information retrieval. The system uses the Se...
Shazia Akhtar, Ronan G. Reilly, John Dunnion