A Spark-Based Parallel Fuzzy $c$ -Means Segmentation Algorithm for Agricultural Image Big Data
Open Access
- 26 March 2019
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Access
- Vol. 7 (99), 42169-42180
- https://doi.org/10.1109/access.2019.2907573
Abstract
With the explosive growth of image big data in the agriculture field, image segmentation algorithms are confronted with unprecedented challenges. As one of the most important image segmentation technologies, the fuzzy C-means (FCM) algorithm has been widely used in the field of agricultural image segmentation as it provides simple computation and high quality segmentation. However, due to its large amount of computation, the sequential FCM algorithm is too slow to finish the segmentation task within an acceptable time. This paper proposes a parallel FCM segmentation algorithm based on the distributed memory computing platform Apache Spark for agricultural image big data. The input image is first converted from the RGB color space to the Lab color space and generates point cloud data. Then, point cloud data are partitioned and stored in different computing nodes, in which the membership degrees of pixel points to different cluster centers are calculated, and the cluster centers are updated iteratively in a data-parallel form until the stopping condition is satisfied. Finally, point cloud data are restored after clustering for reconstructing the segmented image. On the Spark platform, the performance of the parallel fuzzy C-means algorithm is evaluated and reaches an average speedup of 12.54 on 10 computing nodes. Experimental results show that the Spark-based parallel fuzzy C-means algorithm can obtain a significant increase in speedup, and the agricultural image testing set delivers a better performance improvement of 128% than the Hadoop-based approach. This research indicates that the Spark-based parallel FCM algorithm provides a faster speed of segmentation for agricultural image big data and has better scaleup and sizeup rates.Funding Information
- National Natural Science Foundation of China (61602388)
- Natural Science Basic Research Plan in Shaanxi Province of China (2017JM6059)
- China Postdoctoral Science Foundation (2017M613216)
- Shaanxi Province Postdoctoral Science Foundation (2016BSHEDZZ121)
- Natural Science Foundation of Hubei Province (2017CFB592)
- Fundamental Research Funds for the Central Universities (2452016081, 2452015194)
This publication has 43 references indexed in Scilit:
- Agricultural image target segmentation based on fuzzy setOptik, 2015
- Discrete wavelets transform for improving greenness image segmentation in agricultural imagesComputers and Electronics in Agriculture, 2015
- Segmentation of nutrient deficiency in incomplete crop images using intuitionistic fuzzy C-means clustering algorithmNonlinear Dynamics, 2015
- A fast image segmentation algorithm for detection of pseudo-foreign fibers in lint cottonComputers and Electrical Engineering, 2015
- A modified fuzzy C‐means algorithm using scale control spatial information for MRI image segmentation in the presence of noiseJournal of Chemometrics, 2015
- Distributed SAR image change detection based on SparkPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2015
- A GPU-based implementations of the fuzzy C-means algorithms for medical image segmentationThe Journal of Supercomputing, 2015
- Image Segmentation Using K -means Clustering Algorithm and Subtractive Clustering AlgorithmProcedia Computer Science, 2015
- Validating fuzzy partitions obtained through c-shells clusteringPattern Recognition Letters, 1996
- A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated ClustersJournal of Cybernetics, 1973