3.5 Deepfakes — The Evolving Threat
The Arms Race
graph TD
A["2017: Basic<br>face swaps<br>(minutes to create,<br>easy to detect)"] --> B["2019: DeepFaceLab<br>(hours, moderate<br>detection)"]
B --> C["2021: Real-time<br>face swap<br>(instant, harder<br>to detect)"]
C --> D["2023: LivePortrait<br>One-shot animation<br>(seconds from single<br>photo, very hard)"]
D --> E["2025+: Neural<br>avatars, 3D Gaussians<br>(real-time, near<br>indistinguishable)"]
Why Deepfakes Are Uniquely Dangerous for Banking
- Low barrier to entry: Free tools, consumer GPUs, tutorial videos
- Real-time capability: Can respond to active liveness challenges live
- Scalability: One model can generate unlimited attack attempts
- Improving faster than detection: Generative AI advances outpace defensive AI
- Cross-modal: Can combine face + voice for multi-modal bypass
Detection Signals
| Signal |
What To Look For |
Reliability |
| Temporal flickering |
Face boundary flickers between frames |
Good for lower-quality deepfakes; fading for state-of-the-art |
| Blending boundary |
Visible seam where swapped face meets original |
Good — most swaps still show subtle boundaries |
| Frequency artifacts |
GAN upsampling creates checkerboard patterns in FFT |
Good but increasingly addressed by newer models |
| Eye reflection inconsistency |
Reflections in left vs right eye should match; deepfakes often don't |
Moderate — improving in newer methods |
| Physiological absence (rPPG) |
No detectable heart rate signal in synthetic face |
Excellent — extremely hard to fake |
| Lip sync quality |
Subtle timing/shape mismatches between audio and mouth |
Moderate — tools like Wav2Lip are very good |
| Hair/ear/neck artifacts |
Deepfakes struggle with fine hair detail, ear consistency, neck blending |
Good supplementary signal |
| Background consistency |
Face processing may leave background untouched, creating lighting/color mismatch |
Moderate |
Anti-Deepfake Architecture
graph TD
A["Input Video<br>(N frames)"] --> B["Frame-level<br>Deepfake Detector<br>(per-frame score)"]
A --> C["Temporal<br>Consistency<br>Analyzer"]
A --> D["rPPG Extractor<br>(physiological<br>signal)"]
A --> E["Forensic<br>Frequency<br>Analyzer"]
B --> F["Ensemble<br>Fusion"]
C --> F
D --> F
E --> F
F --> G{"Deepfake<br>Probability"}
G -->|"> 0.7"| H["❌ Deepfake<br>Detected"]
G -->|"0.3 - 0.7"| I["⚠️ Uncertain<br>Escalate"]
G -->|"< 0.3"| J["✅ Likely<br>Genuine"]
Recommended Research Papers
- FaceForensics++ (Rössler et al., 2019) — benchmark dataset
- Thinking in Frequency (Qian et al., 2020) — frequency analysis approach
- Multi-Attentional Deepfake Detection (Zhao et al., 2021) — attention-based
- DeepfakeBench (Yan et al., 2023) — comprehensive benchmark
- Implicit Identity Leakage (Dong et al., 2023) — identity-based detection
Next: Social Engineering & Process Attacks →