Latent Feature Separation and Extraction with Multiple Parallel Encoders for Convolutional Autoencoder

Journal: Proceedings - 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022

Citation: Proceedings - 2022 IEEE International Conference on Big Data and Smart Computing, BigComp 2022, pp.263-266

Keyword: Convolutional Autoencoder correlation coefficient feature extraction image clustering independence Multi-head Convolutional Autoencoder multiple parallel encoders

Mesh Keyword: Auto encoders Convolutional autoencoder Correlation coefficient Features extraction Image clustering Image data Independence Multi-head convolutional autoencoder Multiple parallel encoder

All Science Classification Codes (ASJC): Artificial Intelligence Computer Science Applications Computer Vision and Pattern Recognition Information Systems and Management Health Informatics

Abstract: Much of the real-world image data is unlabeled or mislabeled. Therefore, even if there is no label, if similar images can be grouped together with image data itself and used the group as a label, more image data can be effectively used in various tasks. This will be especially effective when dividing images belonging to the same domain into sub-groups. Therefore, in this study, we propose an image feature extraction method to be used for image clustering. The proposed feature extraction model is the Multi-head Convolutional Autoencoder (MCAE), which is a model composed of multiple encoders in parallel based on the Convolutional Autoencoder (CAE). The proposed model showed about 14% lower test reconstruction loss compared to CAE, and the correlation coefficient between extracted features was about 56% lower. In addition, as the results of clustering based on the extracted features, MCAE-based clustering showed about 3.5 times higher silhouette score than that CAE-based clustering.

URI: https://aurora.ajou.ac.kr/handle/2018.oak/36792
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85127557384&origin=inward

Journal URL: http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=9736461

Funding: ACKNOWLEDGMENT This research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (2021R1A2C200347411). Also, this research was supported by the BK21 FOUR program of the National Research Foundation of Korea funded by the Ministry of Education (NRF5199991014091) and the Ajou University research fund.

qrcode