MITRE service request: 1988723
Status: RESERVED (pending a qualifying public reference per CNA Rules §5.3).
Docling’s METS GBS backend is vulnerable to XML Entity Expansion (XXE) attacks thru 2.61.0. The backend extracts and validates XML files from .tar.gz archives using etree.fromstring() without disabling entity resolution. An attacker can craft a malicious XML file with nested entity definitions (XML Bomb) and package it into a .tar.gz archive. When processed by Docling, the exponential expansion of entities during XML parsing leads to excessive resource consumption, resulting in a denial of service (DoS) condition on the system running the Docling parser.
Malicious .tar.gz archives containing METS XML with nested entities pass _validate_mets_xml() → etree.fromstring without safe parser flags, causing CPU/memory exhaustion when archivists batch-convert documents.
docling ... exploit_mets.tar.gz.etree.fromstring on extracted METS XML.Medium–High for automated digitization pipelines.