Sciweavers

523 search results - page 69 / 105
» Metric Learning for Text Documents
Sort
View
EMNLP
2009
14 years 9 months ago
Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora
A significant portion of the world's text is tagged by readers on social bookmarking websites. Credit attribution is an inherent problem in these corpora because most pages h...
Daniel Ramage, David Hall, Ramesh Nallapati, Chris...
TAL
2004
Springer
15 years 4 months ago
A Study of Chunk-Based and Keyword-Based Approaches for Generating Headlines
Abstract. This paper describes two procedures for generating very short summaries for documents from the DUC-2003 competition: a chunk extraction method based on syntactic dependen...
Enrique Alfonseca, José María Guirao...
BTW
2001
Springer
117views Database» more  BTW 2001»
15 years 3 months ago
XMach-1: A Benchmark for XML Data Management
Abstract. We propose a scaleable multi-user benchmark called XMach-1 (XML Data Management benchmark) for evaluating the performance of XML data management systems. It is based on a...
Timo Böhme, Erhard Rahm
PR
2007
146views more  PR 2007»
14 years 10 months ago
ML-KNN: A lazy learning approach to multi-label learning
Abstract: Multi-label learning originated from the investigation of text categorization problem, where each document may belong to several predefined topics simultaneously. In mul...
Min-Ling Zhang, Zhi-Hua Zhou
FOCS
2009
IEEE
15 years 3 months ago
Space-Efficient Framework for Top-k String Retrieval Problems
Given a set D = {d1, d2, ..., dD} of D strings of total length n, our task is to report the "most relevant" strings for a given query pattern P. This involves somewhat mo...
Wing-Kai Hon, Rahul Shah, Jeffrey Scott Vitter