Sciweavers

467 search results - page 53 / 94
» Pat-tree-based Keyword Extraction for Chinese Information Re...
Sort
View
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
15 years 7 months ago
Site-Independent Template-Block Detection
Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...
Aleksander Kolcz, Wen-tau Yih
AGENTS
1998
Springer
15 years 5 months ago
WebMate: A Personal Agent for Browsing and Searching
The World-Wide Web is developing very fast. Currently, nding useful information on the Web is a time consuming process. In this paper, we present WebMate, an agent that helps user...
Liren Chen, Katia P. Sycara
WWW
2005
ACM
16 years 2 months ago
The volume and evolution of web page templates
Web pages contain a combination of unique content and template material, which is present across multiple pages and used primarily for formatting, navigation, and branding. We stu...
David Gibson, Kunal Punera, Andrew Tomkins
AIRWEB
2009
Springer
15 years 8 months ago
A study of link farm distribution and evolution using a time series of web snapshots
In this paper, we study the overall link-based spam structure and its evolution which would be helpful for the development of robust analysis tools and research for Web spamming a...
Young-joo Chung, Masashi Toyoda, Masaru Kitsuregaw...
CLEF
2005
Springer
15 years 7 months ago
Evaluating a Conceptual Indexing Method by Utilizing WordNet
This paper describes our participation to the English Girt Task of CLEF 2005 Campaign. A method for conceptual indexing based on WordNet is used. Both documents and queries are map...
Mustapha Baziz, Mohand Boughanem, Nathalie Aussena...