Sciweavers

3371 search results - page 124 / 675
» Using parsimonious language models on web data
Sort
View
IPM
2008
102views more  IPM 2008»
14 years 10 months ago
Fast exact maximum likelihood estimation for mixture of language model
Language modeling is an effective and theoretically attractive probabilistic framework for text information retrieval. The basic idea of this approach is to estimate a language mo...
Yi Zhang 0001, Wei Xu
SEMWEB
2007
Springer
15 years 4 months ago
YARS2: A Federated Repository for Querying Graph Structured Data from the Web
We present the architecture of an end-to-end semantic search engine that uses a graph data model to enable interactive query answering over structured and interlinked data collecte...
Andreas Harth, Jürgen Umbrich, Aidan Hogan, S...
SIGIR
2009
ACM
15 years 4 months ago
Identifying the original contribution of a document via language modeling
Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this effect, we propose a statistica...
Benyah Shaparenko, Thorsten Joachims
IUI
2003
ACM
15 years 3 months ago
Dynamic web page authoring by example using ontology-based domain knowledge
Authoring dynamic web pages is an inherently difficult task. We present DESK, an interactive authoring tool that allows the customization of dynamic page generation procedures wit...
José Antonio Macías Iglesias, Pablo ...
ICDM
2007
IEEE
476views Data Mining» more  ICDM 2007»
15 years 4 months ago
FiVaTech: Page-Level Web Data Extraction from Template Pages
In this paper, we proposed a new approach, called FiVaTech for the problem of Web data extraction. FiVaTech is a page-level data extraction system which deduces the data schema an...
Mohammed Kayed, Chia-Hui Chang, Khaled F. Shaalan,...