Sciweavers

268 search results - page 36 / 54
» Toward Structured Retrieval in Semi-structured Information S...
Sort
View
DEXAW
1999
IEEE
106views Database» more  DEXAW 1999»
15 years 4 months ago
Textual Similarities Based on a Distributional Approach
The design of efficient textual similarities is an important issue in the domain of textual data exploration. Textual similarities are for example central in document collection s...
Romaric Besançon, Martin Rajman, Jean-C&eac...
98
Voted
EWCBR
2006
Springer
15 years 3 months ago
Unsupervised Feature Selection for Text Data
Feature selection for unsupervised tasks is particularly challenging, especially when dealing with text data. The increase in online documents and email communication creates a nee...
Nirmalie Wiratunga, Robert Lothian, Stewart Massie
103
Voted
HT
2005
ACM
15 years 5 months ago
As we may perceive: inferring logical documents from hypertext
In recent years, many algorithms for the Web have been developed that work with information units distinct from individual web pages. These include segments of web pages or aggreg...
Pavel Dmitriev, Carl Lagoze, Boris Suchkov
87
Voted
CORR
2010
Springer
173views Education» more  CORR 2010»
14 years 11 months ago
CONCISE: Compressed 'n' Composable Integer Set
Bit arrays, or bitmaps, are used to significantly speed up set operations in several areas, such as data warehousing, information retrieval, and data mining, to cite a few. Howeve...
Alessandro Colantonio, Roberto Di Pietro
EMNLP
2009
14 years 9 months ago
Toward Completeness in Concept Extraction and Classification
Many algorithms extract terms from text together with some kind of taxonomic classification (is-a) link. However, the general approaches used today, and specifically the methods o...
Eduard H. Hovy, Zornitsa Kozareva, Ellen Riloff