documents | Sciweavers

14

JAIR
2010

94views more JAIR 2010»

Which Clustering Do You Want? Inducing Your Ideal Clustering with Minimal Feedback

13 years 3 months ago

While traditional research on text clustering has largely focused on grouping documents by topic, it is conceivable that a user may want to cluster documents along other dimension...

Sajib Dasgupta, Vincent Ng

claim paper

Read More »

22

click to vote

IRCDL
2010

213views Digital Library» more IRCDL 2010»

A New Domain Independent Keyphrase Extraction System

13 years 3 months ago

Download users.dimi.uniud.it

In this paper we present a keyphrase extraction system that can extract potential phrases from a single document in an unsupervised, domain-independent way. We extract word n-grams...

Nirmala Pudota, Antonina Dattolo, Andrea Baruzzo, ...

claim paper

Read More »

18

click to vote

JTAER
2008

118views more JTAER 2008»

Service and Document Based Interoperability for European eCustoms Solutions

13 years 3 months ago

Download www.jtaer.com

Innovative eCustoms solutions play an important role in the pan-European eGovernment strategy. The underlying premise is interoperability postulating a common understanding of pro...

Tobias Vogel, Alexander Schmidt, Alexander Lemm, H...

claim paper

Read More »

18

click to vote

CIKM
2010
Springer

175views Information Technology» more CIKM 2010»

Improved index compression techniques for versioned document collections

13 years 3 months ago

Download cis.poly.edu

Current Information Retrieval systems use inverted index structures for eﬃcient query processing. Due to the extremely large size of many data sets, these index structures are u...

Jinru He, Junyuan Zeng, Torsten Suel

claim paper

Read More »

15

click to vote

CIKM
2010
Springer

135views Information Technology» more CIKM 2010»

Reverted indexing for feedback and expansion

13 years 3 months ago

Download www.fxpal.com

Traditional interactive information retrieval systems function by creating inverted lists, or term indexes. For every term in the vocabulary, a list is created that contains the d...

Jeremy Pickens, Matthew Cooper, Gene Golovchinsky

claim paper

Read More »

15

click to vote

CIKM
2010
Springer

171views Information Technology» more CIKM 2010»

Online learning for recency search ranking using real-time user feedback

13 years 3 months ago

Download www.research.rutgers.edu

Traditional machine-learned ranking algorithms for web search are trained in batch mode, which assume static relevance of documents for a given query. Although such a batch-learni...

Taesup Moon, Lihong Li, Wei Chu, Ciya Liao, Zhaohu...

claim paper

Read More »

15

click to vote

SCHOLARPEDIA
2008

109views more SCHOLARPEDIA 2008»

Latent semantic analysis

13 years 4 months ago

Download lsi.research.telcordia.com

A new method for automatic indexing and retrieval is described. The approach is to take advantage of implicit higher-order structure in the association of terms with documents (&q...

Thomas K. Landauer, Susan T. Dumais

claim paper

Read More »

16

click to vote

PVLDB
2008

101views more PVLDB 2008»

Multidimensional content eXploration

13 years 4 months ago

Download pages.cs.wisc.edu

Content Management Systems (CMS) store enterprise data such as insurance claims, insurance policies, legal documents, patent applications, or archival data like in the case of dig...

Alkis Simitsis, Akanksha Baid, Yannis Sismanis, Be...

claim paper

Read More »

13

click to vote

PVLDB
2008

85views more PVLDB 2008»

Scalable ad-hoc entity extraction from text collections

13 years 4 months ago

Download research.microsoft.com

Supporting entity extraction from large document collections is important for enabling a variety of important data analysis tasks. In this paper, we introduce the "ad-hoc&quo...

Sanjay Agrawal, Kaushik Chakrabarti, Surajit Chaud...

claim paper

Read More »

17

click to vote

PR
2007

100views more PR 2007»

Estimation of skew angles for scanned documents based on piecewise covering by parallelograms

13 years 4 months ago

Download ocrlnx03.iis.sinica.edu.tw

We propose a fast and robust skew estimation method for scanned documents that estimates skew angles based on piecewise covering of objects, such as textlines, ﬁgures, forms, or...

Chien-Hsing Chou, Shih-Yu Chu, Fu Chang

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers