Welcome to Journal of University of Chinese Academy of Sciences,Today is

›› 2007, Vol. 24 ›› Issue (2): 241-247.DOI: 10.7523/j.issn.2095-6134.2007.2.016

• 论文 • Previous Articles     Next Articles

Content-based clustered P2P search model depending on set distance

WANG Jing, ZHANG Huan-Jie, YANG Shou-Bao, GAO Ying   

  1. Department of Computer Science and Technology, University of Science and Technology of China, Hefei, 230026
  • Received:1900-01-01 Revised:1900-01-01 Online:2007-03-15

Abstract: In content-based unstructured P2P search system, the main issues that affect query efficiency and searching cost are the complexity of computing document similarity brought by high dimensions and the great deal of redundant messages coming with flooding. Content-based cluster P2P search model depending on set distance is proposed in this paper to reduce the query time and redundant messages. This model defines document similarity by set distance to restrain the complexity of computing the document similarity in linear time. Also, clustering peers based on content depending on set distance reduces the query time and decreases the redundant messages. Simulations show that this model not only has higher recall, but also reduces the search cost and query time to the rate of 40% and 30% of Gnutella.

Key words: Peer to Peer, Gnutella, Distributed Hash Tables, Set Distance, Vector Space Model

CLC Number: