CVE-2026-31253 — Unsafe `torch.load` in FlashAttention training / eval checkpoint paths

MITRE service request: 1988723

Status: RESERVED (pending a qualifying public reference per CNA Rules §5.3).

Official CVE description

The flash-attention training framework thru commit e724e2588cbe754beb97cf7c011b5e7e34119e62 (2025-13-04) contains an insecure deserialization vulnerability (CWE-502) in its checkpoint loading mechanism. The load_checkpoint() function in checkpoint.py and the checkpoint loading code in eval.py use torch.load() without enabling the security-restrictive weights_only=True parameter. This allows the deserialization of arbitrary Python objects via the pickle module. An attacker can exploit this by providing a maliciously crafted checkpoint file. When a victim loads this checkpoint during model warmstarting or evaluation, arbitrary code is executed on the victim’s system.

Summary

Warm-start and evaluation flows call load_checkpoint() → torch.load(..., map_location=device) without weights_only=True, so any swapped checkpoint on shared NFS/HF cache executes pickle gadgets in large GPU clusters.

Affected product and versions

Product: Dao-AILab/flash-attention.
Known vulnerable commit (coordination record): e724e2588cbe754beb97cf7c011b5e7e34119e62.

Technical details

Surface: train.warmstart.path or eval checkpoint arguments referencing .pt files.
Sink: training/src/utils/checkpoint.py and training/src/eval.py.
Impact: RCE on trainer nodes with high-value GPU tenancy.
Mitigation: Verify checkpoint hashes; mount caches read-only where possible.

CVE-2026-31253 — Unsafe `torch.load` in FlashAttention training / eval checkpoint paths

Official CVE description

Summary

Affected product and versions

Technical details

Risk

Remediation / workaround

CVE Program next steps

CVE-2026-31253 — Unsafe torch.load in FlashAttention training / eval checkpoint paths

Official CVE description

Summary

Affected product and versions

Technical details

Risk

Remediation / workaround

CVE Program next steps

CVE-2026-31253 — Unsafe `torch.load` in FlashAttention training / eval checkpoint paths