Sciweavers

3090 search results - page 206 / 618
» Document Processing with LinkIT
Sort
View
GECCO
2006
Springer
186views Optimization» more  GECCO 2006»
15 years 8 months ago
Characterizing large text corpora using a maximum variation sampling genetic algorithm
An enormous amount of information available via the Internet exists. Much of this data is in the form of text-based documents. These documents cover a variety of topics that are v...
Robert M. Patton, Thomas E. Potok
EMNLP
2008
15 years 5 months ago
HTM: A Topic Model for Hypertexts
Previously topic models such as PLSI (Probabilistic Latent Semantic Indexing) and LDA (Latent Dirichlet Allocation) were developed for modeling the contents of plain texts. Recent...
Congkai Sun, Bin Gao, Zhenfu Cao, Hang Li
INEX
2005
Springer
15 years 10 months ago
INEX 2005 Multimedia Track
In this article the activities of the INEX 2005 Multimedia track are reported. We succesfully realized our objective, to provide an evaluation platform for the evaluation of retrie...
Roelof van Zwol, Gabriella Kazai, Mounia Lalmas
COMPSAC
2007
IEEE
15 years 10 months ago
Model Oriented Evolutionary Redocumentation
This paper discusses aspects of the redocumentation of legacy systems and proposes a model oriented approach to generating documentation, which is to produce models from existing ...
Feng Chen, Hongji Yang
SIGIR
2006
ACM
15 years 10 months ago
Stylistic text segmentation
This paper focuses on a method for the stylistic segmentation of text documents. Our technique involves mapping the change in a feature throughout a text. We use the linguistic fe...
Paul J. Chase, Shlomo Argamon