Sciweavers

467 search results - page 30 / 94
» Pat-tree-based Keyword Extraction for Chinese Information Re...
Sort
View
WWW
2010
ACM
15 years 8 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
HT
2003
ACM
15 years 6 months ago
Extracting evolution of web communities from a series of web archives
Recent advances in storage technology make it possible to store a series of large Web archives. It is now an exciting challenge for us to observe evolution of the Web. In this pap...
Masashi Toyoda, Masaru Kitsuregawa
WWW
2009
ACM
16 years 2 months ago
Extracting community structure through relational hypergraphs
Social media websites promote diverse user interaction on media objects as well as user actions with respect to other users. The goal of this work is to discover community structu...
Yu-Ru Lin, Jimeng Sun, Paul Castro, Ravi B. Konuru...
SIGIR
2010
ACM
15 years 5 months ago
Short text classification in twitter to improve information filtering
In microblogging services such as Twitter, the users may become overwhelmed by the raw data. One solution to this problem is the classification of short text messages. As short te...
Bharath Sriram, Dave Fuhry, Engin Demir, Hakan Fer...
RIAO
1997
15 years 2 months ago
Towards Sophisticated Wrapping of Web-based information Repositories
Access to on-line information via the Web is exploding. Index and retrieval engines already start to integrate a huge variety of heterogeneous repositories. However, the heterogen...
Boris Chidlovskii, Uwe M. Borghoff, Pierre-Yves Ch...