Sciweavers

CIKM
2009
Springer
13 years 11 months ago
Combining labeled and unlabeled data with word-class distribution learning
We describe a novel simple and highly scalable semi-supervised method called Word-Class Distribution Learning (WCDL), and apply it the task of information extraction (IE) by utili...
Yanjun Qi, Ronan Collobert, Pavel Kuksa, Koray Kav...
CIKM
2009
Springer
13 years 11 months ago
Dynamic in-page logging for flash-aware B-tree index
This paper presents Dynamic IPL B+ -tree (d-IPL in short) as a B+ -tree index variant for flash-based storage systems. The d-IPL B+ -tree adopts a dynamic In-Page Logging (IPL) s...
Gap-Joo Na, Sang-Won Lee, Bongki Moon
CIKM
2009
Springer
13 years 11 months ago
Reducing the risk of query expansion via robust constrained optimization
We introduce a new theoretical derivation, evaluation methods, and extensive empirical analysis for an automatic query expansion framework in which model estimation is cast as a r...
Kevyn Collins-Thompson
CIKM
2009
Springer
13 years 11 months ago
Generating synopses for document-element search
Scientists often search for document-elements like tables, figures, or algorithm pseudo-codes. Domain scientists and researchers report important data, results and algorithms usi...
Sumit Bhatia, Shibamouli Lahiri, Prasenjit Mitra
CIKM
2009
Springer
13 years 11 months ago
Efficient record-level wrapper induction
Shuyi Zheng, Ruihua Song, Ji-Rong Wen, C. Lee Gile...
CIKM
2009
Springer
13 years 11 months ago
ASIC: algebra-based structural index comparison
Structural indices play a significant role in improving the efficiency of XML query evaluation. Being able to compare various structural indexing techniques is critical for a DBM...
Yuqing Wu, Sofia Brenes, Tejas Totade, Shijin Josh...
CIKM
2009
Springer
13 years 11 months ago
Completing wikipedia's hyperlink structure through dimensionality reduction
Wikipedia is the largest monolithic repository of human knowledge. In addition to its sheer size, it represents a new encyclopedic paradigm by interconnecting articles through hyp...
Robert West, Doina Precup, Joelle Pineau
CIKM
2009
Springer
13 years 11 months ago
Feature selection for ranking using boosted trees
Modern search engines have to be fast to satisfy users, so there are hard back-end latency requirements. The set of features useful for search ranking functions, though, continues...
Feng Pan, Tim Converse, David Ahn, Franco Salvetti...
CIKM
2009
Springer
13 years 11 months ago
Adaptive relevance feedback in information retrieval
Relevance Feedback has proven very effective for improving retrieval accuracy. A difficult yet important problem in all relevance feedback methods is how to optimally balance the...
Yuanhua Lv, ChengXiang Zhai
CIKM
2009
Springer
13 years 11 months ago
Utilizing inter-passage and inter-document similarities for re-ranking search results
We present a novel language-model-based approach to reranking an initially retrieved list so as to improve precision at top ranks. Our model integrates whole-document information ...
Eyal Krikon, Oren Kurland, Michael Bendersky