Sciweavers

67 search results - page 9 / 14
» 2D Conditional Random Fields for Web information extraction
Sort
View
WIDM
2004
ACM
15 years 5 months ago
Probabilistic models for focused web crawling
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
Hongyu Liu, Evangelos E. Milios, Jeannette Janssen
75
Voted
EMNLP
2007
15 years 1 months ago
Extracting Data Records from Unstructured Biomedical Full Text
In this paper, we address the problem of extracting data records and their attributes from unstructured biomedical full text. There has been little effort reported on this in the ...
Donghui Feng, Gully Burns, Eduard H. Hovy
SIGMOD
2010
ACM
212views Database» more  SIGMOD 2010»
14 years 10 months ago
Understanding deep web search interfaces: a survey
This paper presents a survey on the major approaches to search interface understanding. The Deep Web consists of data that exist on the Web but are inaccessible via text search en...
Ritu Khare, Yuan An, Il-Yeol Song
SIGIR
2008
ACM
14 years 11 months ago
A unified and discriminative model for query refinement
This paper addresses the issue of query refinement, which involves reformulating ill-formed search queries in order to enhance relevance of search results. Query refinement typica...
Jiafeng Guo, Gu Xu, Hang Li, Xueqi Cheng
EMNLP
2009
14 years 9 months ago
Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment
Traditionally, machine learning approaches for information extraction require human annotated data that can be costly and time-consuming to produce. However, in many cases, there ...
Kedar Bellare, Andrew McCallum