@techreport{oai:ipsj.ixsq.nii.ac.jp:00233222, author = {山, 裕人 and 辻村, 剛 and 袖山, 正悟 and 筒井, 友葵 and 伊藤, 秀武 and 楊, 柳楠 and 宇田, 隆哉 and Yuto, Yama and Tsuyoshi, Tsujimura and Shogo, Sodeyama and Yuki, Tsutsui and Shumu, Itou and Liunan, Yang and Ryuya, Uda}, issue = {8}, month = {Mar}, note = {生成 AI の発達によって AI がテキストや画像,動画を作れるようになった.しかし,この技術を悪用し,偽の画像や動画を作成して社会に混乱をもたらす人々が現れている.対抗策として AI が生成した偽動画 (Deepfake) を検出する手法が研究されているが完全ではない.そこで,本論文では動画をフレームごとに静止画に変換し,フレーム間のピクセルごとの RGB,HSV,HLS の差を機械学習することで Deepfake を検出する手法を提案する.実験の結果,CNN では RGB,HSV,HLS すべてにおいて精度が 50% 前後,CNN-LSTM では RGB は 50%,HSV,HLS は 55% 前後の精度であり,CNN-LSTM では訓練用の画像を増やした場合においても精度の上昇は 61% にとどまった.一方で SVM,RF を用いたアンサンブル手法では 100% の精度で検出可能であり,SVM 単体で 37%,RF 単体で 100% の精度となることが確認できた., With the development of generative AI, AI has become capable of creating text, images, and videos. However, some people abuse this technology and create fake images and videos to cause chaos in society. As a countermeasure, methods for detecting AI-generated fake videos (deepfakes) have been developed, but they are not perfect. Therefore, in this paper, we propose a method to detect deepfakes using machine learning with the differences in RGB, HSV, and HLS for each pixel between frames by converting videos to still images frame by frame. The experimental results showed that CNN had the accuracy of about 50% for RGB, HSV and HLS, and CNN-LSTM had the accuracy of about 50% for RGB and 55% for HSV and HLS. For CNN-LSTM, the accuracy increased to 61% even when the number of training images was increased. On the other hand, the ensemble method using SVM and RF could detect the deepfakes with 100% accuracy. It was also confirmed that the accuracy was 37% with SVM alone and 100% with RF alone.}, title = {動画フレーム間の差を用いた機械学習によるDeepfake動画の検出}, year = {2024} }