Sciweavers

7228 search results - page 205 / 1446
» From Algorithmic to Subjective Randomness
Sort
View
KDD
2006
ACM
162views Data Mining» more  KDD 2006»
16 years 4 months ago
Simultaneous record detection and attribute labeling in web data extraction
Recent work has shown the feasibility and promise of templateindependent Web data extraction. However, existing approaches use decoupled strategies ? attempting to do data record ...
Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Y...
SIGMOD
2007
ACM
170views Database» more  SIGMOD 2007»
16 years 4 months ago
Cardinality estimation using sample views with quality assurance
Accurate cardinality estimation is critically important to high-quality query optimization. It is well known that conventional cardinality estimation based on histograms or simila...
Per-Åke Larson, Wolfgang Lehner, Jingren Zho...
WWW
2010
ACM
15 years 11 months ago
SourceRank: relevance and trust assessment for deep web sources based on inter-source agreement
We consider the problem of deep web source selection and argue that existing source selection methods are inadequate as they are based on local similarity assessment. Specificall...
Raju Balakrishnan, Subbarao Kambhampati
CIKM
2009
Springer
15 years 10 months ago
Automatic retrieval of similar content using search engine query interface
We consider the coverage testing problem where we are given a document and a corpus with a limited query interface and asked to find if the corpus contains a near-duplicate of th...
Ali Dasdan, Paolo D'Alberto, Santanu Kolay, Chris ...
SIGIR
2009
ACM
15 years 10 months ago
A latent topic model for linked documents
Documents in many corpora, such as digital libraries and webpages, contain both content and link information. To explicitly consider the document relations represented by links, i...
Zhen Guo, Shenghuo Zhu, Yun Chi, Zhongfei Zhang, Y...