The Web and especially major Web search engines are essential tools in the quest to locate online information for many people. This paper reports results from research that examin...
It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...
Abstract Recent progress in mobile broadband communication and semantic web technology is enabling innovative internet services that provide advanced personalization and localizati...
The correct web site text content must be help to the visitors to find what they are looking for. However, the reality is quite different, many times the web page text content is a...
This work addresses the challenge of extracting structure in educational and training media based on the type of material that is presented during lectures and training sessions. ...