International Journal of Mathematical, Engineering and Management Sciences

ISSN: 2455-7749

ACOCA: Ant Colony Optimization Based Clustering Algorithm for Big Data Preprocessing

Neelam Singh
Department of Computer Science and Engineering, Graphic Era Deemed to be University, Dehradun, Uttarakhand, India.

Devesh Pratap Singh
Department of Computer Science and Engineering, Graphic Era Deemed to be University, Dehradun, Uttarakhand, India.

Bhasker Pant
Department of Computer Science and Engineering, Graphic Era Deemed to be University, Dehradun, Uttarakhand, India.

DOI https://dx.doi.org/10.33889/IJMEMS.2019.4.5-098

Received on May 31, 2018
  ;
Accepted on June 28, 2019

Abstract

Big Data is rapidly gaining impetus and is attracting a community of researchers and organization from varying sectors due to its tremendous potential. Big Data is considered as a prospective raw material to acquire domain specific knowledge to gain insights related to management, planning, forecasting and security etc. Due to its inherent characteristics like capacity, swiftness, genuineness and diversity Big Data hampers the efficiency and effectiveness of search and leads to optimization problems. In this paper we explore the complexity imposed by big search spaces leading to optimization issues. In order to overcome the above mentioned issues we propose a hybrid algorithm for Big Data preprocessing ACO-clustering algorithm approach. The proposed algorithm can help to increase search speed by optimizing the process. As the proposed method using ant colony optimization with clustering algorithm it will also contribute to reducing pre-processing time and increasing analytical accuracy and efficiency.

Keywords- Big Data, ACO, Clustering, Optimization, Preprocessing.

Citation

Singh, N., Singh, D. P., & Pant, B. (2019). ACOCA: Ant Colony Optimization Based Clustering Algorithm for Big Data Preprocessing. International Journal of Mathematical, Engineering and Management Sciences, 4(5), 1239-1250. https://dx.doi.org/10.33889/IJMEMS.2019.4.5-098.

Conflict of Interest

The authors confirm that this article contents have no conflict of interest.

Acknowledgements

The authors acknowledge and express the gratitude for the support of Graphic Era Deemed to be University, in Dehradun, Uttarakhand, India.

References

Ashok, L., & Messinger, D.W. (2012, May). A spectral image clustering algorithm based on ant colony optimization. In Algorithms and Technologies for Multispectral, Hyperspectral, and Ultraspectral Imagery XVIII, (Vol. 8390, p. 83901P). International Society for Optics and Photonics.

Deneubourg, J.L., Goss, S., Franks, N., Sendova-Franks, A., Detrain, C., & Chrétien, L. (1991, February). The dynamics of collective sorting robot-like ants and ant-like robots. In Proceedings of the First International Conference on Simulation of Adaptive Behavior on From Animals to Animats, (pp. 356-363).

Dorigo, M., & Di Caro, G. (1999). Ant colony optimization: a new meta-heuristic. In Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), (Vol. 2, pp. 1470-1477). IEEE.

Fahad, A., Alshatri, N., Tari, Z., Alamri, A., Khalil, I., Zomaya, A.Y., & Bouras, A. (2014). A survey of clustering algorithms for big data: taxonomy and empirical analysis. IEEE Transactions on Emerging Topics in Computing, 2(3), 267-279.

Fong, S., Deb, S., Yang, X.S., & Zhuang, Y. (2014). Towards enhancement of performance of K-Means clustering using nature-inspired optimization algorithms. The Scientific World Journal, Article ID 564829.

Han, J., Pei, J., & Kamber, M. (2011). Data mining: concepts and techniques. Elsevier.

Handl, J., Knowles, J., & Dorigo, M. (2003, July). Ant-based clustering: a comparative study of its relative performance with respect to k-means, average link and 1d-som. In Proceedings of the Third International Conference on Hybrid Intelligent Systems, IOS Press.

Jafar, O.M., & Sivakumar, R. (2010). Ant-based clustering algorithms: a brief survey. International Journal of Computer Theory and Engineering, 2(5), 787-796.

Jiang, L., Ding, L., Peng, Y., & Zhao, C. (2011, July). An efficient clustering approach using ant colony algorithm in mutidimensional search space. In 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), (Vol. 2, pp. 1085-1089). IEEE.

Kurasova, O., Marcinkevicius, V., Medvedev, V., Rapecka, A., & Stefanovic, P. (2014, November). Strategies for big data clustering. In 2014 IEEE 26th International Conference on Tools with Artificial Intelligence, (pp. 740-747). IEEE.

Liu, X., & Fu, H. (2010). An effective clustering algorithm with ant colony. Journal of Computers, 5(4), 598-605.

Lumer, E.D., & Faieta, B. (1994, July). Diversity and adaptation in populations of clustering ants. In Proceedings of the Third International Conference on Simulation of Adaptive Behavior: From Animals to Animats 3: From Animals to Animats, 3 (pp. 501-508). MIT Press.

Mahmood, A., Shi, K., Khatoon, S., & Xiao, M. (2013). Data mining techniques for wireless sensor networks: a survey. International Journal of Distributed Sensor Networks, 9(7), 406316.

Menéndez, H.D., Otero, F.E., & Camacho, D. (2014, September). MACOC: a medoid-based ACO clustering algorithm. In International Conference on Swarm Intelligence, (pp. 122-133). Springer, Cham.

Menéndez, H.D., Otero, F.E., & Camacho, D. (2016). Medoid-based clustering using ant colony optimization. Swarm Intelligence, 10(2), 123-145.

Nanda, S.J., & Panda, G. (2014). A survey on nature inspired metaheuristic algorithms for partitional clustering. Swarm and Evolutionary Computation, 16, 1-18.

Niknam, T., & Amiri, B. (2010). An efficient hybrid approach based on PSO, ACO and k-means for cluster analysis. Applied Soft Computing, 10(1), 183-197.

Peng, L., Dong, G.Y., Dai, F.F., & Liu, G.P. (2014). A new clustering algorithm based on ACO and K-medoids optimization methods. IFAC Proceedings Volumes, 47(3), 9727-9731.

Singh, N., Garg, N., & Pant, J. (2014). Document clustering using feature selection based on multiviewpoint and link similarity measure. International Journal Computer Technology & Applications, 5(3), 1151-1155.

Singh, N., Singh, D.P., & Pant, B. (2017, December). A comprehensive study of big data machine learning approaches and challenges. In 2017 International Conference on Next Generation Computing and Information Systems (ICNGCIS), (pp. 80-85). IEEE.

Soheily-Khah, S. (2016). Generalized k-means-based clustering for temporal data under time warp, (Doctoral dissertation). Université Grenoble Alpes, English. NNT: 2016GREAM064. tel-01680370v2.

Xing, E.P., Ho, Q., Xie, P., & Wei, D. (2016). Strategies and principles of distributed machine learning on big data. Engineering, 2(2), 179-195.

Yang, J., & Li, X. (2013, October). Mapreduce based method for big data semantic clustering. In 2013 IEEE International Conference on Systems, Man, and Cybernetics, (pp. 2814-2819). IEEE.

Yang, Y., & Kamel, M.S. (2006). An aggregated clustering approach using multi-ant colonies algorithms. Pattern Recognition, 39(7), 1278-1289.

Privacy Policy| Terms & Conditions