Sciweavers

945 search results - page 2 / 189
» Information Extraction from HTML: Application of a General M...
Sort
View
WSDM
2012
ACM
252views Data Mining» more  WSDM 2012»
12 years 25 days ago
WebSets: extracting sets of entities from the web using unsupervised information extraction
We describe a open-domain information extraction method for extracting concept-instance pairs from an HTML corpus. Most earlier approaches to this problem rely on combining cluste...
Bhavana Bharat Dalvi, William W. Cohen, Jamie Call...
ICDM
2006
IEEE
164views Data Mining» more  ICDM 2006»
13 years 11 months ago
Unsupervised Learning of Tree Alignment Models for Information Extraction
We propose an algorithm for extracting fields from HTML search results. The output of the algorithm is a database table– a data structure that better lends itself to high-level...
Philip Zigoris, Damian Eads, Yi Zhang
CICLING
2005
Springer
13 years 10 months ago
A Machine Learning Approach to Information Extraction
Information extraction is concerned with applying natural language processing to automatically extract the essential details from text documents. A great disadvantage of current ap...
Alberto Téllez-Valero, Manuel Montes-y-G&oa...
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
13 years 11 months ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...
IPM
2007
149views more  IPM 2007»
13 years 5 months ago
Web page title extraction and its application
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields...
Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shu...