Sciweavers

DASFAA
2005
IEEE

Real Datasets for File-Sharing Peer-to-Peer Systems

13 years 10 months ago
Real Datasets for File-Sharing Peer-to-Peer Systems
The fundamental drawback of unstructured peer-to-peer (P2P) networks is the flooding-based query processing protocol that seriously limits their scalability. As a result, a significant amount of research work has focused on designing efficient search protocols that reduce the overall communication cost. What is lacking, however, is the availability of real data, regarding the exact content of users’ libraries and the queries that these users ask. Using trace-driven simulations will clearly generate more meaningful results and further illustrate the efficiency of a generic query processing protocol under a real-life scenario. Motivated by this fact, we developed a Gnutella-style probe and collected detailed data over a period of two months. They involve around 4,500 users and contain the exact files shared by each user, together with any available metadata (e.g., artist for songs) and information about the nodes (e.g., connection speed). We also collected the queries initiated by t...
Shen-Tat Goh, Panos Kalnis, Spiridon Bakiras, Kian
Added 24 Jun 2010
Updated 24 Jun 2010
Type Conference
Year 2005
Where DASFAA
Authors Shen-Tat Goh, Panos Kalnis, Spiridon Bakiras, Kian-Lee Tan
Comments (0)