Sciweavers

ICDT
1999
ACM

The Data Warehouse of Newsgroups

13 years 9 months ago
The Data Warehouse of Newsgroups
Electronic newsgroups are one of the primary means for the dissemination, exchange and sharing of information. We argue that the current newsgroup model is unsatisfactory, especially when posted articles are relevant to multiple newsgroups. We demonstrate that considerable additional exibility can be achieved by managing newsgroups in a data warehouse, where each article is a tuple of attribute-value pairs, and each newsgroup is a view on the set of all posted articles. Supporting this paradigm for a large set of newsgroups makes it imperative to e ciently support a very large number of views: this is the key di erence between newsgroup data warehouses and conventional data warehouses. We identify two complementary problems concerning the design of such a newsgroup data warehouse. An important design decision that the system needs to make is which newsgroup views to eagerly maintain (i.e., materialize). We demonstrate the intractability ofthe general newsgroupselection problem, conside...
Himanshu Gupta, Divesh Srivastava
Added 04 Aug 2010
Updated 04 Aug 2010
Type Conference
Year 1999
Where ICDT
Authors Himanshu Gupta, Divesh Srivastava
Comments (0)