Sciweavers

311 search results - page 4 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
IAT
2007
IEEE
14 years 17 days ago
An Intelligent Web Agent to Mine Bilingual Parallel Pages via Automatic Discovery of URL Pairing Patterns
This paper describes an intelligent agent to facilitate bitext mining from the Web via automatic discovery of URL pairing patterns (or keys) for retrieving parallel web pages. The...
Chunyu Kit, Jessica Yee Ha Ng
KDD
2008
ACM
195views Data Mining» more  KDD 2008»
14 years 6 months ago
Learning from multi-topic web documents for contextual advertisement
Contextual advertising on web pages has become very popular recently and it poses its own set of unique text mining challenges. Often advertisers wish to either target (or avoid) ...
Yi Zhang, Arun C. Surendran, John C. Platt, Mukund...
DEXA
2006
Springer
151views Database» more  DEXA 2006»
13 years 10 months ago
Personalized Detection of Fresh Content and Temporal Annotation for Improved Page Revisiting
Abstract. Page revisiting is a popular browsing activity in the Web. In this paper we describe a method for improving page revisiting by detecting and highlighting the information ...
Adam Jatowt, Yukiko Kawai, Katsumi Tanaka
WWW
2003
ACM
14 years 6 months ago
Mining topic-specific concepts and definitions on the web
Traditionally, when one wants to learn about a particular topic, one reads a book or a survey paper. With the rapid expansion of the Web, learning in-depth knowledge about a topic...
Bing Liu, Chee Wee Chin, Hwee Tou Ng
AIIA
2003
Springer
13 years 11 months ago
Preprocessing and Mining Web Log Data for Web Personalization
We describe the web usage mining activities of an on-going project, called ClickWorld3 , that aims at extracting models of the navigational behaviour of a web site users. The model...
Miriam Baglioni, U. Ferrara, Andrea Romei, Salvato...