SBOM and Software Supply Chain for Federal AI

Why SBOM matters now

Executive Order 14028 (Improving the Nation's Cybersecurity, May 2021) directed NIST and NTIA to define SBOM requirements for federal software procurement. NIST's Secure Software Development Framework (SSDF, SP 800-218) and NTIA's minimum SBOM elements followed. OMB M-22-18 and M-23-16 extended attestation obligations to federal software procurements; OMB M-26-05 (2026) then pulled back the blanket mandate, leaving SBOM delivery risk-based and agency-discretionary. For FedRAMP, SBOM is increasingly a standard ConMon deliverable. For contractors delivering AI systems, SBOM is now an expected artifact in the body of evidence.

SBOM EXPECTATIONS KEEP SHIFTING

Executive Order 14028 requires SBOMs for software sold to the federal government. For AI systems, SBOM must cover model weights, training frameworks, and all dependencies — not just application code. ML-specific SBOM tooling (CycloneDX ML) is available but adoption is still maturing.

SBOM is the lowest-effort supply-chain control that produces the most operational value. Teams that still argue about whether to generate one are having the wrong conversation.

SPDX vs CycloneDX

Two major machine-readable SBOM formats exist.

Format	Origin	Strengths
SPDX	Linux Foundation, ISO/IEC 5962 standard	Mature tooling, strong in license compliance, broad adoption in enterprise.
CycloneDX	OWASP	Stronger vulnerability linkage (VEX), explicit ML-BOM extension, simpler schema.

Both are accepted by federal consumers. CycloneDX is increasingly preferred for AI/ML workloads because of its explicit ML-BOM extension. SPDX remains dominant for traditional software. Many teams generate both.

Minimum NTIA elements

NTIA defined the minimum fields an SBOM must contain. Every component entry needs:

Supplier name
Component name
Component version
Other unique identifier (e.g., PURL, CPE)
Dependency relationships
Author of the SBOM entry
Timestamp

Modern SBOMs add more — hashes, license information, vulnerabilities (via VEX), sources URLs. For federal delivery, target NTIA-plus: the minimum plus hashes, licenses, and a CycloneDX VEX for known-exploited-vulnerability status on any Critical/High CVEs.

ML-specific supply-chain elements

Traditional SBOM covers code. For AI systems you also need to inventory:

Element	Why it matters	How to record
Model weights	Provenance, fine-tuning lineage, classification inheritance	CycloneDX ML-BOM or custom component entry with model hash, version, base-model reference, training-data reference.
Training data	Licensing, PII/CUI exposure, bias source	Dataset name, version, hash, license, classification, source provenance.
Fine-tuning data	Same as training — sensitivity travels into weights	Same schema as training.
Inference tooling	Quantization, serving stack, tokenizers	Standard SBOM entries for llama.cpp, vLLM, transformers, etc.
Retrieval indexes	Source-document provenance for RAG	Index build pipeline version, source corpus reference, embedding model used.
Prompt templates and system prompts	Behavioral configuration, traceable to change management	Versioned in git, hashed, referenced in SBOM.

How to generate SBOMs in practice

You do not hand-author SBOMs. You generate them from the build and deploy pipelines.

SBOM Generation and Maintenance Pipeline

Source scan (syft / cdx-python)

Build CI

Container image scan (syft on final image)

Image build

ML-BOM metadata from model registry

Model push

Vulnerability enrichment (grype / VEX)

Post-scan

SBOM signing (cosign / sigstore)

Attestation

KEV delta monitoring and ConMon reporting

Monthly

Language-ecosystem tools

syft (Anchore) for general containers and filesystems, cyclonedx-python, cyclonedx-maven, cyclonedx-node-npm for language-specific builds.

Container images

syft on the final image, scanned with grype for vulnerabilities, VEX emitted.

ML artifacts

CycloneDX ML-BOM fields populated from model-registry metadata (Hugging Face, internal registries).

Signing

cosign or sigstore for SBOM attestation; sign with keys managed inside your authorization boundary.

The output belongs alongside the artifact it describes. A container image without its SBOM is incomplete. A deployed model without its ML-BOM is incomplete.

Known-exploited-vulnerability posture

CISA publishes the Known Exploited Vulnerabilities (KEV) catalog. Federal agencies use KEV to prioritize remediation. For any SBOM you deliver, you should be able to answer:

Does any component map to a KEV CVE?
For each KEV, is the vulnerable code path reachable in your deployment?
What is the remediation plan?

CycloneDX VEX is the standard for expressing "this CVE is present in the component but not exploitable in this deployment because..." statements. Federal consumers value a VEX-annotated SBOM over a raw SBOM because it reduces the manual triage burden.

Three practical patterns

1. SBOM-in-CI

Generate SBOMs on every build, sign them, attach them to the artifact registry. Do not treat SBOM generation as a compliance afterthought — it is a build step.

2. Continuous VEX

Run vulnerability scans against SBOMs on a schedule. When a new CVE lands, regenerate the VEX. Federal consumers value fresh VEX data over historical static scans.

3. ML-BOM at model-register time

When a new model version is added to the internal registry, generate the ML-BOM as part of the registration workflow. Base-model reference, training-data hash, fine-tuning lineage, evaluation results — all attached at registration.

The SBOM is not the deliverable. The SBOM pipeline is the deliverable. A one-time SBOM is worthless next month.

Common mistakes

Generating an SBOM once at contract award and never refreshing it.
Omitting transitive dependencies — Python's transitive graph alone can be hundreds of entries.
Missing container base-image components.
No ML-BOM at all for AI systems.
No signing — an unsigned SBOM is trivially forgeable.
No VEX — handing consumers a raw SBOM and expecting them to triage.

Bottom line

SBOM is a federal expectation. CycloneDX with ML-BOM extension is the right default for AI systems. Generate in CI, sign with sigstore/cosign, annotate with VEX, and regenerate continuously. For AI specifically, inventory model weights, training data, fine-tuning lineage, and retrieval indexes. Treat the SBOM pipeline as a product, not a paperwork step.

Frequently asked questions

Is SBOM required for federal contracts?

It depends on the contract. EO 14028 and OMB M-22-18/M-23-16 set the direction, and OMB M-26-05 (2026) made SBOM delivery risk-based rather than blanket-mandatory — but many agency contracts and FedRAMP ConMon practice still specify SBOMs directly, so read the clause.

SPDX or CycloneDX?

Both are accepted. CycloneDX is increasingly preferred for AI/ML due to the ML-BOM extension and stronger VEX integration. Many teams generate both. SPDX is ISO/IEC 5962.

What are the NTIA minimum SBOM elements?

Supplier name, component name, component version, unique identifier (PURL or CPE), dependency relationships, SBOM author, and timestamp. Most real SBOMs include hashes, licenses, and vulnerabilities on top.

Do I need to include model weights in the SBOM?

Yes, for AI systems. CycloneDX ML-BOM or custom components with model hash, version, base-model reference, and training-data lineage.

What is VEX?

Vulnerability Exploitability eXchange — a way to annotate an SBOM with statements like 'CVE-X is present but not exploitable in this deployment because...'. Reduces triage burden for consumers.

How do I sign an SBOM?

cosign or sigstore, with keys managed inside your authorization boundary. Attach the signed SBOM to the artifact registry alongside the artifact it describes.

SBOM and software supply chain for federal AI

Why SBOM matters now

SPDX vs CycloneDX

Minimum NTIA elements

ML-specific supply-chain elements

How to generate SBOMs in practice

Known-exploited-vulnerability posture

Three practical patterns

1. SBOM-in-CI

2. Continuous VEX

3. ML-BOM at model-register time

Common mistakes

Bottom line

Frequently asked questions

Shipping SBOMs for AI?

SBOM and software supply chain for federal AI

Why SBOM matters now

SPDX vs CycloneDX

Minimum NTIA elements

ML-specific supply-chain elements

How to generate SBOMs in practice

Known-exploited-vulnerability posture

Three practical patterns

1. SBOM-in-CI

2. Continuous VEX

3. ML-BOM at model-register time

Common mistakes

Bottom line

Frequently asked questions

STIG Compliance for LLM Containers

Zero Trust for Federal AI

NIST 800-53 Rev 5 Controls LLM Systems Trip On

Shipping SBOMs for AI?