Sciweavers

602 search results - page 12 / 121
» Integrating Data and Probabilistically Structured Text Docum...
Sort
View
WEBDB
2004
Springer
170views Database» more  WEBDB 2004»
15 years 2 months ago
Content and Structure in Indexing and Ranking XML
Rooted in electronic publishing, XML is now widely used for modelling and storing structured text documents. Especially in the WWW, retrieval of XML documents is most useful in co...
Felix Weigel, Holger Meuss, Klaus U. Schulz, Fran&...
ACL
2008
14 years 11 months ago
Automatic Editing in a Back-End Speech-to-Text System
Written documents created through dictation differ significantly from a true verbatim transcript of the recorded speech. This poses an obstacle in automatic dictation systems as s...
Maximilian Bisani, Paul Vozila, Olivier Divay, Jef...
ICDM
2007
IEEE
143views Data Mining» more  ICDM 2007»
15 years 3 months ago
Bit Sequences and Biclustering of Text Documents
We propose a new technique for clustering of text documents that relies on a biclustering structure constructed on terms and documents. Our approach makes use of a greedy algorith...
Selim Mimaroglu, Kuniaki Uehara
NAACL
2004
14 years 10 months ago
Catching the Drift: Probabilistic Content Models, with Applications to Generation and Summarization
We consider the problem of modeling the content structure of texts within a specific domain, in terms of the topics the texts address and the order in which these topics appear. W...
Regina Barzilay, Lillian Lee
ADL
2000
Springer
373views Digital Library» more  ADL 2000»
15 years 1 months ago
BlueView: Virtual Document Servers for Digital Libraries
In the BlueView project, digital library services are developed and partially implemented based on the architecture of virtual document servers. Using standard tools like fulltext...
Andreas Heuer, Holger Meyer, Beate Porst, Patrick ...