Sciweavers

141 search results - page 2 / 29
» URL Based Classification of Arabic Web Pages
Sort
View
LREC
2008
160views Education» more  LREC 2008»
13 years 6 months ago
Automatic Extraction of Textual Elements from News Web Pages
In this paper we present an algorithm for automatic extraction of textual elements, namely titles and full text, associated with news stories in news web pages. We propose a super...
Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany
TREC
2001
13 years 6 months ago
Retrieving Web Pages Using Content, Links, URLs and Anchors
For this year's web track, we concentrated on the entry page finding task. For the content-only runs, in both the ad-hoc task and the entry page finding task, we used an infor...
Thijs Westerveld, Wessel Kraaij, Djoerd Hiemstra
IEAAIE
2003
Springer
13 years 10 months ago
Applying Semantic Links for Classifying Web Pages
Automatic hypertext classification is an essential technique for organizing vast amount of Internet Web pages or HTML documents. One the of problems in classifying Web pages is tha...
Ben Choi, Qing Guo
CN
2000
109views more  CN 2000»
13 years 5 months ago
On near-uniform URL sampling
We consider the problem of sampling URLs uniformly at random from the Web. A tool for sampling URLs uniformly can be used to estimate various properties of Web pages, such as the ...
Monika Rauch Henzinger, Allan Heydon, Michael Mitz...
IC
2009
13 years 3 months ago
Language Based Crawling: Crawling the Arabic Content of the Web
- Crawling web pages written in Arabic or any other language with limited content in the web may, at first, seem to parallel the process of crawling the English content. However, t...
Saad H. Alabbad, Sultan Alanazi