Sciweavers

SIGIR
2008
ACM
13 years 4 months ago
A user browsing model to predict search engine click data from past observations
Search engine click logs provide an invaluable source of relevance information but this information is biased because we ignore which documents from the result list the users have...
Georges Dupret, Benjamin Piwowarski
SIGIR
2008
ACM
13 years 4 months ago
Predicting when browsing context is relevant to search
We investigate a representative case of sudden information need change of Web users. By analyzing search engine query logs, we show that the majority of queries submitted by users...
Mandar Rahurkar, Silviu Cucerzan
IJBRA
2007
116views more  IJBRA 2007»
13 years 4 months ago
Biomedical ontology improves biomedical literature clustering performance: a comparison study
: Document clustering has been used for better document retrieval and text mining. In this paper, we investigate if a biomedical ontology improves biomedical literature clustering ...
Illhoi Yoo, Xiaohua Hu, Il-Yeol Song
PAA
2006
13 years 4 months ago
Automatic name extraction from degraded document images
The problem addressed in this paper is the automatic extraction of names from a document image. Our approach relies on the combination of two complementary analyses. First, the ima...
Laurence Likforman-Sulem, Pascal Vaillant, Aliette...
PR
2008
146views more  PR 2008»
13 years 4 months ago
Retrieval of machine-printed Latin documents through Word Shape Coding
This paper reports a document retrieval technique that retrieves machine-printed Latin-based document images through word shape coding. Adopting the idea of image annotation, a wo...
Shijian Lu, Chew Lim Tan
KES
2006
Springer
13 years 4 months ago
Integrated Document Browsing and Data Acquisition for Building Large Ontologies
Named entities (e.g., "Kofi Annan", "Coca-Cola", "Second World War") are ubiquitous in web pages and other types of document and often provide a simpl...
Felix Weigel, Klaus U. Schulz, Levin Brunner, Edua...
JSA
2006
82views more  JSA 2006»
13 years 4 months ago
A flocking based algorithm for document clustering analysis
ct 7 Social animals or insects in nature often exhibit a form of emergent collective behavior known as flocking. In this paper, 8 we present a novel Flocking based approach for doc...
Xiaohui Cui, Jinzhu Gao, Thomas E. Potok
ENGL
2007
100views more  ENGL 2007»
13 years 4 months ago
Skew Estimation Technique for Binary Document Images based on Thinning and Moments
-When a document is fed to a scanner either mechanically or by a human operator for digitization, it suffers from some degrees of skew or tilt. Skew angle detection is an important...
Aradhya V. N. Manjunath, G. Hemantha Kumar, P. Shi...
JUCS
2008
100views more  JUCS 2008»
13 years 4 months ago
Stacked Dependency Networks for Layout Document Structuring
: We address the problems of structuring and annotation of layout-oriented documents. We model the annotation problems as the collective classification on graph-like structures wit...
Boris Chidlovskii, Loïc Lecerf
JUCS
2008
167views more  JUCS 2008»
13 years 4 months ago
A Generic Architecture for the Conversion of Document Collections into Semantically Annotated Digital Archives
: Mass digitization of document collections with further processing and semantic annotation is an increasing activity among libraries and archives at large for preservation, browsi...
Josep Lladós, Dimosthenis Karatzas, Joan Ma...