Sciweavers

523 search results - page 55 / 105
» Metric Learning for Text Documents
Sort
View
WWW
2009
ACM
16 years 4 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
157
Voted
WWW
2004
ACM
16 years 4 months ago
Incremental formalization of document annotations through ontology-based paraphrasing
For the manual semantic markup of documents to become widespread, users must be able to express annotations that conform to ontologies (or schemas) that have shared meaning. Howev...
Jim Blythe, Yolanda Gil
AND
2010
15 years 1 months ago
Reshaping automatic speech transcripts for robust high-level spoken document analysis
High-level spoken document analysis is required in many applications seeking access to the semantic content of audio data, such as information retrieval, machine translation or au...
Julien Fayolle, Fabienne Moreau, Christian Raymond...
109
Voted
TREC
2003
15 years 4 months ago
Active Feedback - UIUC TREC-2003 HARD Experiments
In this paper, we report our experiments on the HARD (High Accuracy Retrieval from Documents) Track in TREC 2003. We focus on active feedback, i.e., how to intelligently propose q...
Xuehua Shen, ChengXiang Zhai
144
Voted
ACL
2010
15 years 1 months ago
Using Document Level Cross-Event Inference to Improve Event Extraction
Event extraction is a particularly challenging type of information extraction (IE). Most current event extraction systems rely on local information at the phrase or sentence level...
Shasha Liao, Ralph Grishman