Sciweavers

RIAO
2007
13 years 6 months ago
Effectiveness of Rich Document Representation in XML Retrieval
Information Retrieval (IR) systems are built with different goals in mind. Some IR systems target high precision that is to have more relevant documents on the first page of their...
Fahimeh Raja, Mostafa Keikha, Maseud Rahgozar, Far...
RIAO
2007
13 years 6 months ago
An Information Retrieval Driven by Ontology: from Query to Document Expansion
The paper proposes an approach to information retrieval based on the use of a structure (ontology) both for document (resp. query) indexing and query evaluating. The conceptual st...
Mustapha Baziz, Mohand Boughanem, Gabriella Pasi, ...
SERP
2008
13 years 6 months ago
Practically Relevant Quality Criteria for Requirements Documents
Abstract. This paper presents common weaknesses of requirements documents from commercial software projects that frequently cause problems in practice. Many documents contain exten...
Tobias Simon, Jonathan Streit, Markus Pizka
IJCAI
2007
13 years 6 months ago
Semantic Smoothing of Document Models for Agglomerative Clustering
In this paper, we argue that the agglomerative clustering with vector cosine similarity measure performs poorly due to two reasons. First, the nearest neighbors of a document belo...
Xiaohua Zhou, Xiaodan Zhang, Xiaohua Hu
IJCAI
2007
13 years 6 months ago
Pseudo-Aligned Multilingual Corpora
In machine translation, document alignment refers to finding correspondences between documents which are exact translations of each other. We define pseudo-alignment as the task...
Fernando Diaz, Donald Metzler
LREC
2008
99views Education» more  LREC 2008»
13 years 6 months ago
Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian
We aim to characterize the comparability of corpora, we address this issue in the trilingual context through the distinction of expert and non expert documents. We work separately...
Lorraine Goeuriot, Natalia Grabar, Béatrice...
LREC
2008
70views Education» more  LREC 2008»
13 years 6 months ago
An Approach to Modeling Heterogeneous Resources for Information Extraction
In this paper, we describe an approach that aims to model heterogeneous resources for information extraction. Document is modeled in graph representation that enables better under...
Lei Xia, José Iria
GRAPHICSINTERFACE
2008
13 years 6 months ago
An empirical characterisation of electronic document navigation
To establish an empirical foundation for analysis and redesign of document navigation tools, we implemented a system that logs all user actions within Microsoft Word and Adobe Rea...
Jason Alexander, Andy Cockburn
EMNLP
2008
13 years 6 months ago
An Exploration of Document Impact on Graph-Based Multi-Document Summarization
The graph-based ranking algorithm has been recently exploited for multi-document summarization by making only use of the sentence-to-sentence relationships in the documents, under...
Xiaojun Wan
ECIR
2008
Springer
13 years 6 months ago
A Wikipedia-Based Multilingual Retrieval Model
This paper introduces CL-ESA, a new multilingual retrieval model for the analysis of cross-language similarity. The retrieval model exploits the multilingual alignment of Wikipedia...
Martin Potthast, Benno Stein, Maik Anderka