Sciweavers

219 search results - page 8 / 44
» Web page language identification based on URLs
Sort
View
DEXA
2006
Springer
160views Database» more  DEXA 2006»
15 years 4 months ago
Clustering of Search Engine Keywords Using Access Logs
Abstract. It the becomes possible that users can get kinds of information by just inputting search keyword(s) representing the topic which users are interested in. But it is not al...
Shingo Otsuka, Masaru Kitsuregawa
IAT
2010
IEEE
14 years 10 months ago
Semantic Structure Content for Dynamic Web Pages
Representing web data into a machine understandable format is a curtail task for the next generation of the web. Most of current web pages are dynamic pages. A large percentage of...
Mamdouh Farouk, Mitsuru Ishizuka
IC
2009
14 years 10 months ago
Language Based Crawling: Crawling the Arabic Content of the Web
- Crawling web pages written in Arabic or any other language with limited content in the web may, at first, seem to parallel the process of crawling the English content. However, t...
Saad H. Alabbad, Sultan Alanazi
CIKM
1999
Springer
15 years 4 months ago
Word Segmentation and Recognition for Web Document Framework
It is observed that a better approach to Web information understanding is to base on its document framework, which is mainly consisted of (i) the title and the URL name of the pag...
Chi-Hung Chi, Chen Ding, Andrew Lim
ECAI
2008
Springer
15 years 2 months ago
Reinforcement Learning with Classifier Selection for Focused Crawling
Focused crawlers are programs that wander in the Web, using its graph structure, and gather pages that belong to a specific topic. The most critical task in Focused Crawling is the...
Ioannis Partalas, Georgios Paliouras, Ioannis P. V...