Sciweavers

3441 search results - page 23 / 689
» Intelligent Computation of Presentation Documents
Sort
View
HPDC
2010
IEEE
15 years 6 months ago
ParaText: scalable text modeling and analysis
Automated analysis of unstructured text documents (e.g., web pages, newswire articles, research publications, business reports) is a key capability for solving important problems ...
Daniel M. Dunlavy, Timothy M. Shead, Eric T. Stant...
AAAI
2000
15 years 7 months ago
A Mutually Beneficial Integration of Data Mining and Information Extraction
Text mining concerns applying data mining techniques to unstructured text. Information extraction (IE) is a form of shallow text understanding that locates specific pieces of data...
Un Yong Nahm, Raymond J. Mooney
ACL
2008
15 years 7 months ago
Pairwise Document Similarity in Large Collections with MapReduce
This paper presents a MapReduce algorithm for computing pairwise document similarity in large document collections. MapReduce is an attractive framework because it allows us to de...
Tamer Elsayed, Jimmy J. Lin, Douglas W. Oard
COLING
2010
15 years 18 days ago
Towards Automatic Building of Document Keywords
Document keywords are associated to documents as summarized versions of the documents' content. Considering that the number of documents is quickly growing every day, the ava...
Joaquim Silva, José Gabriel Lopes
NAACL
2003
15 years 7 months ago
Automating XML markup of text documents
We present a novel system for automatically marking up text documents into XML and discuss the benefits of XML markup for intelligent information retrieval. The system uses the Se...
Shazia Akhtar, Ronan G. Reilly, John Dunnion