Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

17

ITCC
2005
IEEE

favoriteEmaildiscussreport

105views Information Technology» more ITCC 2005»

Elimination of Redundant Information for Web Data Mining

13 years 10 months ago

Elimination of Redundant Information for Web Data Mining

Download eprints.utas.edu.au

These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditional text-based documents. However, users usually focus on a particular section of the page that presents the most relevant information to their interest. Therefore, Web documents classification needs to group and filter the pages based on their contents and relevant information. Many researches on Web mining report on mining Web structure and extracting information from web contents. However, they have focused on detecting tables that convey specific data, not the tables that are used as a mechanism for structuring the layout of Web pages. Case modeling of tables can be ted based on structure abstraction. Furthermore, Ripple Down Rules (RDR) is used to implement knowledge organization and construction, because it supports a simple rule maintenance based on case and local validation.

Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang

Real-time Traffic

Information Technology | ITCC 2005 | Mining Web Structure | Relevant Information | Web Pages |

claim paper

Related Content

» Eliminating noisy information in Web pages for data mining

» EST2uni an open parallel tool for automated EST analysis and database creation with a data...

» Simplifying the Clickstream Retrieval Using Weblogger Tool

» Fuzzy Logic for Elimination of Redundant Information of Microarray Data

» Using the Web to Reduce Data Sparseness in PatternBased Information Extraction

» Web data mining exploring hyperlinks contents and usage data

» Adapting Information Extraction Knowledge For Unseen Web Sites

» Using Grammatical Inference to Automate Information Extraction from the Web

» Dimension reduction with redundant gene elimination for tumor classification

Post Info
More Details (n/a)

Added	25 Jun 2010
Updated	25 Jun 2010
Type	Conference
Year	2005
Where	ITCC
Authors	Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang

Comments (0)