CNN-Based-Fake-Image-Identification-with-Improved-Generalization

IVC(Image & Vision Computing) Lab / Pukyong Nat'l Univ Electronic Engineering / Busan, Republic of Korea
Jeonghan Lee, Hanhoon Park(Major Professor)

Paper(Korean) : J. Lee and, H. Park, "CNN-Based Fake Image Identification with Improved Generalization," Journal of Korea Multimedia Society, 2021, 24.12: 1624-1631.

Abstract : With the continued devleopment of image processing technology, we live in a time when it is difficult to visually discriminate processed (or tampered) images from real images. However, as the risk of fake images being misused for crime increases, the importance of image forensic science for identifying fake images is emerging. Currently, various deep learning-based identifiers have been studied, but there are still many problems to be used in real situations. Due to the inherent characteristics of deep learning that strongly relies on given training data, it is very vulnerable to evaluating data that has never been viewed. Therefore, we try to find a way to improve generalization ability of deep learning-based fake image identifiers. First, images with various contents were added to the training dataset to resolve the over-fitting problem that the identifier can only classify real and fake images with specific contents but fails for those with other contents. Next, color spaces other than RGB were exploited. That is, fake image identification was attempted on color spaces not considered when creating fake images, such as HSV and YCbCr. Finally, dropout, which is commonly used for generalization of neural networks, was used. Through experimental results, it has been confirmed that the combination of the approaches significantly can greatly improve the accuracy and generalization ability of deep learning-based identifiers in identifying fake images that have never been seen before.

Settings

Dataset

Category : LSUN - cat, church_outdoor, train, airplane, bus, cow, bridge, bedroom, classroom, restaurant, sheep
Real/Fake 각각 Train set 25K장/ Test set 2K장 임의추출하여 사용
-> Mix Train set이 25K가 넘지 않도록 Train set 구성. Test set은 Single Category로만 구성.
Sheep 카테고리의 경우 Mixing Process에 포함하지 않는 대표 Unseen Test set으로 설정.
Image size : 256X256
GAN : ProGAN

Feature Extraction

CNN Model : Pelee with HPF
Color Space : RGB, HSV, YCbCr
Dropout : N/A , 0.2, 0.5

Spec

epoch = 50, batch_size = 64, learning_rate = 0.001, optimizer = Adam

Result

Experiment 1 : Accuracy of identifying fake images according to the number of image categories

[Tab 1] Fake image identification rates of Pelee+HPF depending on the number of image categories used in the training step

[Fig 1] Change in fake image identification rates for the images that are not included in the image categories used in the training step as the number of image categories used in the training step increases

Experiment 2 : Accuracy of identifying fake images according to color space

[Tab 2] Fake image identification rates of Pelee+HPF when the image color space was changed to HSV

[Tab 3] Fake image identification rates of Pelee+HPF when the image color space was changed to YCbCr

[Fig 2] Change in fake image identification rates for the images that are not included in the image categories used in the training step when using different color spaces

Experiment 3 : Accuracy of identifying fake images according to dropout ratio

[Tab 4] Fake image identification rates of Pelee+HPF with a dropout ratio of 0.2

[Tab 5] Fake image identification rates of Pelee+HPF with a dropout ratio of 0.5

[Fig 3] Change in fake image identification rates for the images that are not included in the image categories used in the training step when using different dropout ratios

Experiment 4 : The optimal combination for improving generalization ability and comparison of performance with Xception

[Tab 6] Fake image identification rates of Pelee+HPF with the RGB to HSV color conversion and a dropout of 0.2

[Tab 7] Fake image identification rates of Xception

[Fig 4] Change in fake image identification rates of combinations of generalization methods, and comparison with Xception

Conclusion

학습(Train)에 사용된 영상의 카테고리 수(또는 콘텐츠)가 증가할수록 학습에 사용되지 않은 카테고리(처음보는 콘텐츠)에 대한 식별 정확도가 향상
HSV나 YCbCr로 색 공간 변환(Convert)이나 드롭아웃(Dropout)을 적용함으로서 일반화 능력(Generalization Performance)이 향상
-> but, 모든 방법을 함께 사용하는 것보다 HSV로 색 공간 변환을 사용하고 학습에 사용되는 영상의 카테고리 수(또는 콘텐츠)를 늘리는 것이 가장 효과적
본 논문에서는 기본 딥러닝 모델로 Pelee를 사용했으나, Xception을 비롯한 다른 딥러닝 모델의 일반화 성능을 향상시키기 위한 방법에 대해 추가연구 필요

This content is inspired by the documents below :

T. Karras, S. Laine, M. Aittala, J. Hellsten, J. Lehtinen, and T. Aila, "Analyzing and Improving the Image Quality of StyleGAN," Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8110=8119, 2020.
C. Ledig, L. Theis, F. Huszar, J. Caballero, A. Cunningham, A. Acosta, and W. Shi, "Photo-Realistic Single Image SUper-Resolution Using a Generative Adversarial Network," Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4681-4690, 2017.
J.Y. Zhu, T. Park, P. Isola, and A.A. Efros, "Unparied Image_to_Image Translation Using Cycle-Consistent Adversarial Networks," Proceeding of the IEEE International Conference on Computer Vision, pp. 2223-2232, 2017.
H. Mo, B. Chen, And W. Luo, "Fake Faces Identification via Convolutional Neural Network," Proceeding of the 6th ACM Workshep on Indformation Hiding and Multimedia Security, pp. 43- 47, 2018.
N.T. Do, I.S. Na, and S.H. Kim, "Forensics Face Detection from GANs Using Convolutional Neural Network," Proceeding of International Symposium on Information Technology Convergence, 2018.
I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, and Y. Bengio, "Generative Adversarial Nets," Advances in Neural Information Processing Systems, 27, 2014.
T. Karras, T. Aila, S. Laine, and J. Lehtinen, "Progressive Growing of GANs for Improved Quality, Stability, and Variation," arXiv preprint arXiv:1710.10196, 2017.
F. Marra, D. Gragnaniello, L. Verdoliva, and G. Poggi, "Do GANs Leave Artificial Fingerprints?," Proceeding of IEEE Conference on Multimedia Information Processing and Retrieval, pp. 506-511, 2019.
N. Yu, L.S. Davis, and M. Fritz, “Attributing Fake Images to GANs: Learning and Analyzing GAN fingerprints,” Proceeding of the IEEE/CVF International Conference on Computer Vision, pp. 7556-7566, 2019.
L. Verdoliva, “Media Forensics and Deepfakes: an Overview,” IEEE Journal of Selected Topics in Signal Processing, Vol. 14, No. 5, 910-932, 2020.
D. Cozzolino, J. Thies, A. Rossler, C. Riess, M. Nießner, and L. Verdoliva, “Forensictransfer: Weakly-Supervised Domain Adaptation for Forgery Detection,” arXiv preprint arXiv:1812.02510, 2018.
M. Du, S. Pentyala, Y. Li, and X. Hu, “Towards Generalizable Forgery Detection with Locality-Aware Autoencoder,” arXiv preprint arXiv:1909.05999, 2019.
H. Li, B. Li, S. Tan, and J. Huang, “Detection of Deep Network Generated Images Using Disparities in Color Components,” arXiv preprint arXiv:1808.07276, 2018.
F. Chollet, “Xception: Deep Learning with Depthwise Separable Convolutions,” Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1251-1258, 2017.
F. Yu, A. Seff, Y. Zhang, S. Song, T. Funkhouser, and J. Xiao, “LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop,” arXiv preprint arXiv:1506.03365, 2015.
R.J. Wang, X. Li, and C.X. Ling, “Pelee: A Real-Time Object Detection System on Mobile Devices,” arXiv preprint arXiv:1804.06882, 2018.
S. Kang and H. Park, “Hierarchical CNN-Based Senary Classification of Steganographic Algorithms,” Journal of Korea Multimedia Society, Vol. 24, No. 4, pp. 550-557, 2021.

decide02 / cnn-based-fake-image-identification-with-improved-generalization Goto Github PK