Sciweavers

3530 search results - page 562 / 706
» Technology of Text Mining
Sort
View
189
Voted
WWW
2007
ACM
16 years 6 months ago
Towards domain-independent information extraction from web tables
Traditionally, information extraction from web tables has focused on small, more or less homogeneous corpora, often based on assumptions about the use of <table> tags. A mul...
Bernhard Krüpl, Bernhard Pollak, Marcus Herzo...
WWW
2007
ACM
16 years 6 months ago
Classifying web sites
In this paper, we present a novel method for the classification of Web sites. This method exploits both structure and content of Web sites in order to discern their functionality....
Christoph Lindemann, Lars Littig
WWW
2006
ACM
16 years 6 months ago
Detecting spam web pages through content analysis
In this paper, we continue our investigations of "web spam": the injection of artificially-created pages into the web in order to influence the results from search engin...
Alexandros Ntoulas, Marc Najork, Mark Manasse, Den...
WWW
2006
ACM
16 years 6 months ago
Toward tighter integration of web search with a geographic information system
Integration of Web search with geographic information has recently attracted much attention. There are a number of local Web search systems enabling users to find locationspecific...
Taro Tezuka, Takeshi Kurashima, Katsumi Tanaka
WWW
2006
ACM
16 years 6 months ago
What's really new on the web?: identifying new pages from a series of unstable web snapshots
Identifying and tracking new information on the Web is important in sociology, marketing, and survey research, since new trends might be apparent in the new information. Such chan...
Masashi Toyoda, Masaru Kitsuregawa