Search Sciweavers | Sciweavers

938 search results - page 97 / 188

» Space-Efficient Algorithms for Document Retrieval

230

click to vote

SIGMOD
2008
ACM

123views Database» more SIGMOD 2008»

Query-based partitioning of documents and indexes for information lifecycle management

16 years 5 months ago

Download www.itr-rescue.org

Regulations require businesses to archive many electronic documents for extended periods of time. Given the sheer volume of documents and the response time requirements, documents...

Soumyadeb Mitra, Marianne Winslett, Windsor W. Hsu

claim paper

Read More »

152

click to vote

JCDL
2005
ACM

100views Education» more JCDL 2005»

What's there and what's not?: focused crawling for missing documents in digital libraries

15 years 10 months ago

Download clgiles.ist.psu.edu

Some large scale topical digital libraries, such as CiteSeer, harvest online academic documents by crawling open-access archives, university and author homepages, and authors’ s...

Ziming Zhuang, Rohit Wagle, C. Lee Giles

claim paper

Read More »

179

click to vote

ICDE
2002
IEEE

181views Database» more ICDE 2002»

YFilter: Efficient and Scalable Filtering of XML Documents

15 years 10 months ago

Download yfilter.cs.umass.edu

Soon, much of the data exchanged over the Internet will be encoded in XML, allowing for sophisticated filtering and content-based routing. We have built a filtering engine called ...

Yanlei Diao, Peter M. Fischer, Michael J. Franklin...

claim paper

Read More »

182

click to vote

CIKM
2008
Springer

138views Information Technology» more CIKM 2008»

Identifying table boundaries in digital documents via sparse line detection

15 years 7 months ago

Download chemxseer.ist.psu.edu

Most prior work on information extraction has focused on extracting information from text in digital documents. However, often, the most important information being reported in an...

Ying Liu, Prasenjit Mitra, C. Lee Giles

claim paper

Read More »

131

click to vote

SIGIR
2005
ACM

115views Information Technology» more SIGIR 2005»

Relation between PLSA and NMF and implications

15 years 10 months ago

Download eprints.pascal-network.org

Non-negative Matrix Factorization (NMF, [5]) and Probabilistic Latent Semantic Analysis (PLSA, [4]) have been successfully applied to a number of text analysis tasks such as docum...

Éric Gaussier, Cyril Goutte

claim paper

Read More »

« Prev « First page 97 / 188 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers