Frequency-Domain Deepfake Video Detection

Overview

This repository contains the code and research for a deepfake video detection pipeline explicitly optimized for low-latency forensic triage.

While most conventional detectors operate in the spatial domain and rely on pixel-level artifacts, this project explores a frequency-domain approach. By leveraging log-magnitude Fast Fourier Transform (FFT) spectra, the model achieves high accuracy and near-real-time inference speeds, making it highly viable for rapid triage of suspicious content.

You can read the full methodology, detailed mathematical formulations, and view the training graphs in the included Capstone Project Report.

Architecture & Pipeline

The project evaluates a three-stage progression of models:

Baseline 3D CNN: Trained on short RGB video clips to capture spatial-temporal features.
Intermediate ResNet50: Leverages transfer learning applied to log-magnitude FFT spectra.
Fine-Tuned ResNet50: The final model, with the top convolutional block fine-tuned specifically for frequency-domain features.

Frequency Domain Representation (FFT)

Instead of analyzing pixel-level artifacts, the frames are converted into log-magnitude FFT spectra before classification:

Performance & Results

The shift from the spatial domain to the frequency domain yielded significant performance improvements across six independent runs.

(Note: Full classification reports and confusion matrices are available in the attached Capstone Project Report PDF).

Model Architecture	Accuracy	AUC Score	Median Latency (per-frame)
Baseline 3D CNN	57% - 66%	0.55 - 0.74	N/A
FFT-based ResNet50	72% - 76%	0.82 - 0.85	N/A
Fine-Tuned FFT ResNet50	90% - 92%	> 0.95	~2.7 ms

Dataset & Acknowledgements

The video data used to train and evaluate these models is sourced from the FaceForensics++ dataset.

Original Dataset Repository: ondyari/FaceForensics

Repository Structure

├── .gitignore
├── README.md
├── Capstone Project Report.pdf       # Full academic report, methodology, and graphs
├── Build_and_Train_Model.ipynb       # Model definition, training, and evaluation
├── data_loader.py                    # Custom dataset loading and transformation logic
├── requirements.txt                  # Python dependencies
├── assets/                           # Readme images and diagrams
│   ├── pipeline.png
│   ├── spectra_comparison.png
│   ├── performance_bars.png
│   └── roc_curve.png
└── videos/                           # Target directory for the dataset
    ├── Real_videos/
    └── Deepfakes_videos/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Frequency-Domain Deepfake Video Detection

Overview

Architecture & Pipeline

Frequency Domain Representation (FFT)

Performance & Results

Dataset & Acknowledgements

Repository Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
assets		assets
videos		videos
.gitignore		.gitignore
Build_and_Train_Model.ipynb		Build_and_Train_Model.ipynb
README.md		README.md
capstone-project-report.pdf		capstone-project-report.pdf
data_loader.py		data_loader.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Frequency-Domain Deepfake Video Detection

Overview

Architecture & Pipeline

Frequency Domain Representation (FFT)

Performance & Results

Dataset & Acknowledgements

Repository Structure

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages