Sciweavers

4124 search results - page 625 / 825
» Improved Algorithms for Data Migration
Sort
View
WSDM
2010
ACM
215views Data Mining» more  WSDM 2010»
16 years 2 months ago
Boilerplate Detection using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, ma...
Christian Kohlschütter, Peter Fankhauser, Wol...
ICML
2009
IEEE
15 years 11 months ago
Independent factor topic models
Topic models such as Latent Dirichlet Allocation (LDA) and Correlated Topic Model (CTM) have recently emerged as powerful statistical tools for text document modeling. In this pap...
Duangmanee Putthividhya, Hagai Thomas Attias, Srik...
WSDM
2009
ACM
113views Data Mining» more  WSDM 2009»
15 years 11 months ago
Time Will Tell: Leveraging Temporal Expressions in IR
Temporal expressions, such as between 1992 and 2000, are frequent across many kinds of documents. Text retrieval, though, treats them as common terms, thus ignoring their inherent...
Irem Arikan, Srikanta J. Bedathur, Klaus Berberich
WSDM
2009
ACM
148views Data Mining» more  WSDM 2009»
15 years 11 months ago
Information arbitrage across multi-lingual Wikipedia
The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale. Pages for the same topic in many different languages emerge both as a r...
Eytan Adar, Michael Skinner, Daniel S. Weld
CEC
2009
IEEE
15 years 11 months ago
Coevolution of simulator proxies and sampling strategies for petroleum reservoir modeling
— Reservoir modeling is an on-going activity during the production life of a reservoir. One challenge to constructing accurate reservoir models is the time required to carry out ...
Tina Yu, Dave Wilkinson