Rapid Identification of BitTorrent traffic

Abstract
BitTorrent is one of the dominant traffic generating applications in the Internet today. The ability to identify BitTorrent traffic in real-time could allow network operators to better manage network traffic and provide a better service to their customers. In this paper we analyse the statistical properties of BitTorrent traffic and select four features that can be used for real-time classification using Machine Learning techniques. We then train and test a classifier using the C4.5 algorithm. Our results show that based on statistics calculated on 150-packet sub-flows, we can classify BitTorrent traffic with Recall of 98.2% and Precision of 96.5%. We then show that 98.1% of sub-flows from other client-server bulk transfer applications are correctly classified as non-BitTorrent.

This publication has 9 references indexed in Scilit: