Sciweavers

244 search results - page 2 / 49
» From HTML documents to web tables and rules
Sort
View
HICSS
2002
IEEE
115views Biometrics» more  HICSS 2002»
13 years 9 months ago
Fuzzy Rules for HTML Transcoding
With the increasing availability of Web-enabled mobile devices, we are facing the problem to effectively adapt Web content for those devices. For adaptation, Web page structures r...
Robbie Schaefer, Andreas Dangberg, Wolfgang Mü...
IJMSO
2008
149views more  IJMSO 2008»
13 years 4 months ago
Categorisation of web documents using extraction ontologies
: Automatically recognising which HTML documents on the Web contain items of interest for a user is non-trivial. As a step toward solving this problem, we propose an approach based...
Li Xu, David W. Embley
PVLDB
2008
141views more  PVLDB 2008»
13 years 4 months ago
WebTables: exploring the power of tables on the web
The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...
Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...
COLING
2000
13 years 6 months ago
Mining Tables from Large Scale HTML Texts
Table is a very common presentation scheme, but few papers touch on table extraction in text data mining. This paper focuses on mining tables from large-scale HTML texts. Table fi...
Hsin-Hsi Chen, Shih-Chung Tsai, Jin-He Tsai
ITCC
2005
IEEE
13 years 10 months ago
Elimination of Redundant Information for Web Data Mining
These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...
Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang