Sciweavers

498 search results - page 38 / 100
» Robust web content extraction
Sort
View
DEXAW
2009
IEEE
160views Database» more  DEXAW 2009»
15 years 4 months ago
Automatic User Comment Detection in Flat Internet Fora
—Millions of people are using the World Wide Web and are publishing content online. This user generated content contains many information relevant not only to marketing but to co...
Mathias Bank, Michael Mattes
ITCC
2005
IEEE
15 years 3 months ago
Elimination of Redundant Information for Web Data Mining
These days, billions of Web pages are created with HTML or other markup languages. They only have a few uniform structures and contain various authoring styles compared to traditi...
Shakirah Mohd Taib, Soon-ja Yeom, Byeong Ho Kang
WWW
2001
ACM
15 years 10 months ago
Document Visualization on Small Displays
Limitation in display size and resolution on mobile devices is one of the main obstacles for wide-spread use of web applications in a wireless environment. Web pages are often too ...
Ka Kit Hoi, Dik Lun Lee, Jianliang Xu
ICASSP
2009
IEEE
15 years 4 months ago
Efficacy of a constantly adaptive language modeling technique for web-scale applications
In this paper, we describe CALM, a method for building statistical language models for the Web. CALM addresses several unique challenges dealing with the Web contents. First, CALM...
Kuansan Wang, Xiaolong Li
WWW
2006
ACM
15 years 10 months ago
CWS: a comparative web search system
In this paper, we define and study a novel search problem: Comparative Web Search (CWS). The task of CWS is to seek relevant and comparative information from the Web to help users...
Jian-Tao Sun, Xuanhui Wang, Dou Shen, Hua-Jun Zeng...