Sciweavers

311 search results - page 41 / 63
» Cleaning Web Pages for Effective Web Content Mining
Sort
View
WEBDB
2010
Springer
178views Database» more  WEBDB 2010»
15 years 2 months ago
Using Latent-Structure to Detect Objects on the Web
An important requirement for emerging applications which aim to locate and integrate content distributed over the Web is to identify pages that are relevant for a given domain or ...
Luciano Barbosa, Juliana Freire
94
Voted
KAIS
2010
129views more  KAIS 2010»
14 years 8 months ago
Sentiment-oriented contextual advertising
Web advertising (Online advertising), a form of advertising that uses the World Wide Web to attract customers, has become one of the world’s most important marketing channels. Th...
Teng-Kai Fan, Chia-Hui Chang
CIKM
2009
Springer
15 years 4 months ago
Vetting the links of the web
Many web links mislead human surfers and automated crawlers because they point to changed content, out-of-date information, or invalid URLs. It is a particular problem for large, ...
Na Dai, Brian D. Davison
CICLING
2009
Springer
15 years 1 months ago
Language Identification on the Web: Extending the Dictionary Method
Abstract. Automated language identification of written text is a wellestablished research domain that has received considerable attention in the past. By now, efficient and effecti...
Radim Rehurek, Milan Kolkus
WWW
2009
ACM
15 years 10 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...