Swarm Intelligence in Data Science: Applications, Opportunities and Challenges

Abstract
The Swarm Intelligence (SI) algorithms have been proved to be a comprehensive method to solve complex optimization problems by simulating the emergence behaviors of biological swarms. Nowadays, data science is getting more and more attention, which needs quick management and analysis of massive data. Most traditional methods can only be applied to continuous and differentiable functions. As a set of population-based approaches, it is proven by some recent research works that the SI algorithms have great potential for relevant tasks in this field. In order to gather better insight into the utilization of these methods in data science and to provide a further reference for future researches, this paper focuses on the relationship between data science and swarm intelligence. After introducing the mainstream swarm intelligence algorithms and their common characteristics, both the theoretical and real-world applications in the literature which utilize the swarm intelligence to the related domains of data analytics are reviewed. Based on the summary of the existing works, this paper also analyzes the opportunities and challenges in this field, which attempts to shed some light on designing more effective algorithms to solve the problems in data science for real-world applications.