Sciweavers

368 search results - page 8 / 74
» Template-Based Information Mining from HTML Documents
Sort
View
SAINT
2005
IEEE
15 years 5 months ago
Learning Logic Wrappers for Information Extraction from the Web
This paper discusses a methodology for applying general-purpose first-order inductive learning to extract information from Web documents structured as unranked ordered trees. The...
Costin Badica, Elvira Popescu, Amelia Badica
DKE
2006
126views more  DKE 2006»
14 years 11 months ago
FRACTURE mining: Mining frequently and concurrently mutating structures from historical XML documents
In the past few years, the fast proliferation of available XML documents has stimulated a great deal of interest in discovering hidden and nontrivial knowledge from XML repositori...
Ling Chen 0002, Sourav S. Bhowmick, Liang-Tien Chi...
112
Voted
IPPS
2008
IEEE
15 years 6 months ago
Multi-threaded data mining of EDGAR CIKs (Central Index Keys) from ticker symbols
This paper describes how use the Java Swing HTMLEditorKit to perform multi-threaded web data mining on the EDGAR system (Electronic DataGathering, Analysis, and Retrieval system)....
Dougal A. Lyon
JCIT
2008
154views more  JCIT 2008»
14 years 11 months ago
Multimodal Web Content Conversion for Mobile Services in a U-City
A ubiquitous city is where everything is interconnected with everything else, where information is instantaneously shared. In a U-city, people can access a variety of web data in ...
Soosun Cho, HeeSook Shin
CIKM
2001
Springer
15 years 4 months ago
A Domain Independent Environment for Creating Information Extraction Modules
Text-Mining is a growing area of interest within the field of Data Mining and Knowledge Discovery. Given a collection of text documents, most approaches to Text Mining perform kno...
Ronen Feldman, Yonatan Aumann, Yair Liberzon, Kfir...