Sciweavers

2108 search results - page 292 / 422
» Visual Data Mining of Web Navigational Data
Sort
View
ACL
2009
15 years 1 months ago
MARS: Multilingual Access and Retrieval System with Enhanced Query Translation and Document Retrieval
In this paper, we introduce a multilingual access and retrieval system with enhanced query translation and multilingual document retrieval, by mining bilingual terminologies and a...
Lianhau Lee, AiTi Aw, Thuy Vu, Sharifah Aljunied M...
DAS
2006
Springer
15 years 6 months ago
XCDF: A Canonical and Structured Document Format
Accessing the structured content of PDF document is a difficult task, requiring pre-processing and reverse engineering techniques. In this paper, we first present different methods...
Jean-Luc Bloechle, Maurizio Rigamonti, Karim Hadja...
MIE
2008
119views Healthcare» more  MIE 2008»
15 years 5 months ago
Using Knowledge for Indexing Health Web Resources in a Quality-Controlled Gateway
Objectives: The aim of this study is to provide to indexers MeSH terms to be considered as major ones in a list of terms automatically extracted from a document. Material and metho...
Michel Joubert, Stéfan Jacques Darmoni, Pau...
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
15 years 11 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
KDD
2008
ACM
217views Data Mining» more  KDD 2008»
16 years 4 months ago
Stream prediction using a generative model based on frequent episodes in event sequences
This paper presents a new algorithm for sequence prediction over long categorical event streams. The input to the algorithm is a set of target event types whose occurrences we wis...
Srivatsan Laxman, Vikram Tankasali, Ryen W. White