Sciweavers

523 search results - page 48 / 105
» Metric Learning for Text Documents
Sort
View
DEXAW
2007
IEEE
133views Database» more  DEXAW 2007»
15 years 3 months ago
Generating a Topic Hierarchy from Dialect Texts
We built a system for the automatic creation of a textbased topic hierarchy, meant to be used in a geographically defined community. This poses two main problems. First, the appea...
Wim De Smet, Marie-Francine Moens
SWAP
2008
15 years 19 days ago
Combining Statistical Techniques and Lexico-syntactic Patterns for Semantic Relations Extraction from Text
We describe here a methodology to combine two different techniques for Semantic Relation Extraction from texts. On the one hand, generic lexicosyntactic patterns are applied to the...
Emiliano Giovannetti, Simone Marchi, Simonetta Mon...
SIGMOD
2008
ACM
123views Database» more  SIGMOD 2008»
15 years 11 months ago
SchemaScope: a system for inferring and cleaning XML schemas
We present SchemaScope, a system to derive Document Type Definitions and XML Schemas from corpora of sample XML documents. Tools are provided to visualize, clean, and refine exist...
Geert Jan Bex, Frank Neven, Stijn Vansummeren
ICDM
2007
IEEE
184views Data Mining» more  ICDM 2007»
15 years 5 months ago
Bayesian Folding-In with Dirichlet Kernels for PLSI
Probabilistic latent semantic indexing (PLSI) represents documents of a collection as mixture proportions of latent topics, which are learned from the collection by an expectation...
Alexander Hinneburg, Hans-Henning Gabriel, Andr&eg...
76
Voted
TREC
2000
15 years 16 days ago
The PISAB Question Answering System
The PISAB Question Answering system is based on a combination of Information Extraction and Information Retrieval techniques. Knowledge extracted from documents is modeled as a se...
Giuseppe Attardi, Cristian Burrini