Welcome to Journal of University of Chinese Academy of Sciences,Today is

›› 2016, Vol. 33 ›› Issue (4): 562-569.DOI: 10.7523/j.issn.2095-6134.2016.04.019

Previous Articles     Next Articles

Cyberspace device identification based on K-means with cosine distance measure

CAO Laicheng1, ZHAO Jianjun1,2, CUI Xiang2, LI Ke2,3   

  1. 1. School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China;
    2. Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China;
    3. School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2016-01-07 Revised:2016-03-17 Online:2016-07-15

Abstract:

Since the traditional web fingerprinting methods are limited to identification of mainstream web server softwares, a kind of cyberspace device identification model based on K-means with cosine distance measure is proposed.Firstly, identification model is designed and verification method is determined.Secondly, the header fields and the status code of HTTP response are selected as characteristics of terminal device and then the characteristics are transformed into 32-dimensional feature vector by feature extraction and vectorization.Thirdly, cosine distance function is selected as similarity measuring function in K-means.Finally, experiment algorithm process is designed according to the identification model and the experiments for unlabeled samples and labeled samples are carried out.The results show that the identification model works for many kinds of terminal devices, including wireless router, web camera, and intelligent switch, and has high accuracy rate and low omission rate.

Key words: cyberspace, terminal device, K-means, cosine measure, fingerprinting

CLC Number: