Blue Goat Cyber logoBlue Goat CyberSMMedical Device Cybersecurity
    K
    Blog · FDA

    FDA AI Cybersecurity Threats: 7 Attacks 524B Manufacturers Must Address

    The FDA's Feb 3, 2026 guidance names 7 AI cyber threats — data poisoning, model inversion, evasion, leakage, overfitting, bias, drift — as 524B obligations.

    Hero illustration for the article: FDA AI Cybersecurity Threats: 7 Attacks 524B Manufacturers Must Address
    Christian Espinosa, Founder & CEO at Blue Goat Cyber

    By Christian Espinosa, MBA, CISSP

    Founder & CEO · Blue Goat Cyber

    Published: June 29, 2026

    Published June 29, 2026

    Direct answer

    The FDA's February 3, 2026 final guidance on cybersecurity in medical devices, read together with the AI/ML-enabled device guidance, names seven AI-specific cyber threats — data poisoning, model inversion/stealing, model evasion, data leakage, overfitting, model bias, and performance drift — and treats them as in-scope cybersecurity risks under section 524B of the FD&C Act for any AI-enabled "cyber device." Manufacturers must analyze each threat in their threat model, document controls in their Secure Product Development Framework (SPDF), and demonstrate the controls work in premarket submissions.

    Key Takeaways

    • The FDA's Feb 3, 2026 guidance explicitly enumerates 7 AI cyber threats as 524B obligations.
    • Threats span the full ML lifecycle: training data, model weights, inference, and post-deployment drift.
    • Each threat needs an entry in your threat model with a documented mitigation and test evidence.
    • Generic application security controls are not sufficient — AI threats need AI-specific controls.
    • Postmarket monitoring must cover drift and bias, not only CVEs.
    • Document everything in your SPDF; the FDA will ask for it during premarket review.

    Table of Contents

    Why This Matters

    The FDA's final premarket cybersecurity guidance dated February 3, 2026 makes one point unmistakable: if your device meets the definition of a "cyber device" under section 524B(c) of the FD&C Act and incorporates AI, the AI is not a separate compliance track. It is part of your cybersecurity submission. The guidance lists seven categories of AI risk that "can be impacted by cybersecurity threats" — and the FDA expects each one to be analyzed in your threat model, controlled in your Secure Product Development Framework, and verified before clearance.

    This is a meaningful shift. Until recently, manufacturers could treat model quality as a data-science concern and cybersecurity as an IT concern. The 2026 guidance, read with AAMI CR34971 on AI risk management and the FDA's predetermined change control plan (PCCP) framework, fuses them. A reviewer can — and will — issue a cybersecurity deficiency for an AI-enabled device whose threat model does not address data poisoning or adversarial evasion. Standards including IEC 81001-5-1, AAMI TIR57, AAMI TIR97, and ISO/IEC 27001 provide the surrounding controls, but it is the threat-modeling and SPDF evidence that the FDA reads first.

    The seven threats below are taken directly from the guidance text. Each section explains what the threat is, why the FDA treats it as a cybersecurity issue (not only a model-quality issue), and what control evidence a reviewer expects to see.

    1. Data Poisoning

    What it is. Cyber attackers deliberately inject inauthentic or maliciously modified records into training, fine-tuning, or feedback data so the model learns the wrong thing — for example, labeling malignant lesions as benign or shifting decision boundaries in a sepsis predictor.

    Why it is a 524B issue. Data poisoning is not just bad data hygiene; it is an integrity attack on the model's training pipeline. The FDA treats the training pipeline as part of the device's attack surface when training artifacts or update channels affect the deployed model.

    Mitigations.

    • Cryptographically signed dataset manifests and per-record hashes; reject any training run whose corpus does not match the manifest.
    • Role separation between data engineers, label reviewers, and model owners; least-privilege access to the training bucket and feature store.
    • Statistical drift and outlier detection on inbound training and feedback data, gated before the next training run.
    • A documented retrain/revert procedure with a known-good model baseline if poisoning is suspected post-deployment.

    SPDF control / test mapping.

    SPDF activity Control artifact Verification test Submission evidence
    Secure design Training-data provenance policy + signed manifest schema Manifest-signature validation test (negative case rejected) Data-integrity test report referenced in eSTAR Security Architecture section
    Secure implementation Access-control matrix for training infra and feature store IAM/least-privilege review + log review Access-review record and IAM export attached to threat model
    Secure verification Statistical drift / outlier detector on inbound data Red-team poisoning injection against detector Adversarial-test report; STRIDE/ATLAS row "Tampering — AML.T0020 Poison Training Data"
    Secure release & response Retrain-from-known-good runbook Tabletop on poisoning incident Postmarket response plan + runbook version in submission

    For a deeper treatment, see our dedicated post on AI data poisoning in medical devices.

    2. Model Inversion and Model Stealing

    What it is. An attacker queries the deployed model with crafted inputs to either (a) reconstruct sensitive attributes of the training data — model inversion — or (b) replicate the model's behavior in a surrogate — model stealing. Both attacks turn the inference endpoint into a side channel.

    Why it is a 524B issue. Inversion can leak PHI used during training, which is a confidentiality breach under 524B. Stealing exposes proprietary weights and creates a parallel model an attacker can then attack offline at no cost, accelerating every other threat on this list.

    Mitigations.

    • Authentication, authorization, and per-user/per-device rate limiting on every inference endpoint.
    • Output minimization: return the class or score the clinician needs, not full logits, embeddings, or attention maps.
    • Differential-privacy or noise-injection techniques where training data is sensitive.
    • Query-pattern anomaly detection for high-volume probing or adversarial query distributions.

    SPDF control / test mapping.

    SPDF activity Control artifact Verification test Submission evidence
    Secure design Inference-API threat model entry (Information Disclosure) Architectural review against ATLAS "AML.T0024 Exfiltration via ML Inference API" Threat-model row + data-flow diagram
    Secure implementation AuthN/AuthZ, rate-limit, and output-minimization config Endpoint pen test with stolen credentials and probing harness Pen-test report (AI scope)
    Secure verification Model-stealing simulation (query budget, fidelity score) Surrogate-model fidelity test below documented threshold Verification report with fidelity metrics
    Secure release & response Inference-log retention + anomaly alert rules Incident tabletop on detected probing campaign Postmarket monitoring plan section

    See Medical Device AI Model Inversion for the full breakdown.

    3. Model Evasion

    What it is. An attacker crafts or perturbs an input — often imperceptibly to a clinician — so the model misclassifies it. A pixel-level perturbation on a dermatology image, an adversarial patch on an ECG waveform, or carefully chosen noise in a radiology scan can flip a prediction.

    Why it is a 524B issue. Evasion attacks degrade the reliability of clinical output, which the FDA treats as patient-safety-relevant. The 2026 guidance is explicit that cyber controls must protect the integrity of model predictions, not only the system around the model.

    Mitigations.

    • Adversarial robustness testing against published attack libraries (FGSM, PGD, patch attacks, modality-specific attacks).
    • Input validation, range checks, and out-of-distribution (OOD) detection on raw inputs and intermediate features.
    • Confidence thresholds with human-in-the-loop fallback when confidence is low or OOD detection fires.
    • Adversarial training or certified-robustness techniques where the harm rating warrants it.

    SPDF control / test mapping.

    SPDF activity Control artifact Verification test Submission evidence
    Secure design Adversarial-robustness requirement tied to hazard analysis (ISO 14971) Threat-model row "Tampering — AML.T0015 Evade ML Model" Hazard-to-control traceability table
    Secure implementation OOD detector + confidence-threshold logic in inference path Unit + integration tests for OOD fallback Code-review record + test results
    Secure verification Adversarial test plan (FGSM, PGD, patch, modality-specific) Robustness benchmark vs. clean accuracy delta Adversarial-test report appended to risk-based testing summary
    Secure release & response Field-collected misprediction triage workflow Tabletop on adversarial input cluster in production Postmarket surveillance plan

    See AI Model Evasion in Medical Devices for examples and test patterns.

    4. Data Leakage

    What it is. Cyber threats exploit vulnerabilities to access sensitive training data, fine-tuning data, or inference-time data — for example, through misconfigured object storage, over-broad model endpoints, debug logs, or cached intermediate tensors.

    Why it is a 524B issue. Data leakage is a direct confidentiality failure under 524B's expectations for protecting electronic information. It is also a regulatory issue under HIPAA and applicable international privacy regimes, but the FDA cybersecurity reviewer will judge it on 524B grounds.

    Mitigations.

    • Encryption in transit and at rest for every store that holds training, validation, or inference data.
    • Strict separation of training infrastructure from production patient data; documented data-minimization at each stage.
    • Tamper-evident access logging that captures who read which dataset and when.
    • Pen testing that explicitly probes the data pipeline and the inference endpoint, not only the device firmware.

    SPDF control / test mapping.

    SPDF activity Control artifact Verification test Submission evidence
    Secure design Data-flow diagram with classification labels (PHI, derived, model artifact) Architecture review against 524B "protect electronic information" Architecture section of submission
    Secure implementation Encryption + IAM config for every data store and bucket Config scan and IAM review Cryptographic-controls summary
    Secure verification Pipeline pen test (storage, logs, caches, inference) Authenticated and unauthenticated tests with leakage findings AI penetration testing report
    Secure release & response Incident-response runbook covering PHI exposure Tabletop with disclosure and notification path IR plan + breach-notification SOP

    5. Overfitting

    See also: FDA IDE Cybersecurity Requirements: 2026 Submission Guide, MQTT Vulnerabilities in Connected Medical Devices: FDA Risks, Controls, and Deficiency Patterns, and SBOM Diffing and CVE Correlation: The Postmarket Workflow the FDA Expects.

    What it is. A model is trained too closely to its training set and generalizes poorly. A cyber attacker can deliberately push a model toward overfitting by injecting narrow patterns into training data, then exploit the brittle decision surface with adversarial inputs at inference time.

    Why it is a 524B issue. Overfitting is normally a model-quality concern, but the FDA includes it here because attackers can cause it deliberately and then exploit it. That makes overfitting an indicator of compromise as well as a quality defect.

    Mitigations.

    • Held-out validation across demographically and clinically diverse cohorts; cross-validation results in the submission.
    • Train/validation gap and decision-boundary sharpness treated as security-relevant alerts, not only data-science metrics.
    • Regularization, dropout, and ensemble strategies documented in the model card.
    • Re-evaluation of overfitting indicators after every training-data refresh or PCCP-authorized update.

    SPDF control / test mapping.

    SPDF activity Control artifact Verification test Submission evidence
    Secure design Generalization requirement linked to clinical performance acceptance criteria Review against intended-use population Performance requirements traceability
    Secure implementation Regularization + ensemble configuration captured in model card Reproducible training pipeline test Model card + training-pipeline artifact
    Secure verification Subgroup + external-dataset validation; train/val gap threshold Validation report with diverse cohorts Clinical/non-clinical performance section
    Secure release & response Overfitting-alert rule tied to retraining triggers Tabletop on suspected adversarial overfitting Postmarket monitoring plan

    See AI Overfitting Cybersecurity Threats in Medical Devices for detection patterns.

    6. Model Bias

    What it is. Bias enters when training data under-represents populations, when labels carry historical inequities, or when an attacker deliberately skews fine-tuning data, embeds backdoors, or distributes pre-trained models that already carry bias to be amplified downstream.

    Why it is a 524B issue. The 2026 guidance treats deliberate bias manipulation — including backdoor triggers and adversarial bias amplification — as a cybersecurity attack on the model. Reviewers expect bias monitoring to be integrated with security monitoring, not run as a separate fairness exercise.

    Mitigations.

    • Provenance and integrity of any third-party datasets or pre-trained checkpoints; SBOM-style accounting for ML artifacts.
    • Subgroup performance reporting (sensitivity, specificity, calibration by demographic and clinical subgroup) in the premarket submission.
    • Backdoor scanning on imported model weights and a procedure for re-training if a backdoor is found.
    • Postmarket bias monitoring tied to the device's postmarket cybersecurity program.

    SPDF control / test mapping.

    SPDF activity Control artifact Verification test Submission evidence
    Secure design ML-SBOM listing datasets, checkpoints, licenses, versions Review of artifact provenance against SBOM/supply chain requirements ML-SBOM in submission
    Secure implementation Backdoor / trojan scan in CI on imported weights Negative test with planted trigger CI scan report
    Secure verification Subgroup performance + fairness metrics Subgroup test report with predefined thresholds Performance section of submission
    Secure release & response Bias-monitoring rules in postmarket telemetry Tabletop on detected subgroup drift Postmarket plan

    7. Performance Drift

    What it is. Cyber attackers shift the input data distribution over time — through compromised upstream sensors, manipulated EHR fields, or slow injection of skewed feedback — so the model's accuracy decays. Drift can also arise naturally; the threat is that an attacker can cause it on purpose and hide it as ordinary degradation.

    Why it is a 524B issue. Performance drift connects directly to postmarket cybersecurity obligations. The FDA expects manufacturers to detect drift, decide whether it is benign or adversarial, and respond within their PCCP or change-control process.

    Mitigations.

    • Continuous performance monitoring against a held-out reference distribution; alarm thresholds documented in the SPDF.
    • Distinction between data drift, concept drift, and adversarial drift — with investigation playbooks for each.
    • Integrity controls on the update channel so legitimate retraining cannot be hijacked.
    • Reporting paths into the FDA when drift indicates a cybersecurity issue affecting safety or effectiveness.

    SPDF control / test mapping.

    SPDF activity Control artifact Verification test Submission evidence
    Secure design Drift-monitoring requirement + reference distribution definition Architectural review of telemetry path Postmarket monitoring architecture
    Secure implementation Signed update channel + rollback control Negative test: unsigned update rejected CI/CD security gates evidence
    Secure verification Drift-detector validation against benign and adversarial shifts Adversarial-drift simulation Verification report
    Secure release & response Drift triage playbook + FDA reporting decision tree Tabletop covering benign vs. adversarial drift Postmarket plan + reporting SOP

    See AI Performance Drift in Medical Devices for the monitoring architecture.

    Mapping the 7 Threats to Your SPDF and Threat Model

    For a 524B submission, each of the seven threats above should appear in three places in your documentation:

    1. Threat model. Use STRIDE or a STRIDE-plus-MITRE-ATLAS overlay so AI-specific threats sit alongside conventional ones. ATLAS gives you the adversarial ML tactics and techniques; STRIDE keeps the framing consistent with the rest of the device. See our step-by-step threat modeling guide.
    2. SPDF risk and control matrix. Each threat ties to one or more controls, each control ties to one or more verification activities, and each verification activity produces an artifact that goes into the submission.
    3. Postmarket monitoring plan. Drift, bias, and anomalous query patterns need monitoring rules and response playbooks — not just dashboards.

    When a reviewer asks "how does your device address AI cybersecurity threats," the answer is not a paragraph. It is a traceable chain from threat to control to test to monitoring rule, for all seven categories.

    How Blue Goat Cyber Approaches AI Threat Modeling

    Our ThreatGoat workflow treats the seven FDA-listed AI threats as first-class entries in the threat model — not appendices. We map each one to STRIDE plus the relevant MITRE ATLAS techniques, generate the control list and test plan from that mapping, and produce the SPDF artifacts a 524B reviewer expects to see. Our AI penetration testing service then exercises the controls against the actual deployed model, so the evidence in your submission reflects how the system behaves under attack, not only how it is designed.

    If you are preparing a 510(k), De Novo, or PMA for an AI-enabled device, the fastest path through a clean cybersecurity review is a threat model and SPDF that already speak the language of the Feb 3, 2026 guidance.

    FAQ

    Does the 2026 guidance apply if my device only uses a locked AI model? Yes. Locked models are still subject to the seven threats — particularly inversion, evasion, leakage, and data poisoning of any retraining pipeline you might invoke under a PCCP. The locked-vs-adaptive distinction affects change control, not whether the cyber threats apply.

    Do I need ATLAS in my submission, or is STRIDE enough? STRIDE alone is rarely enough for AI-enabled devices. Reviewers increasingly expect adversarial ML techniques (drawn from MITRE ATLAS or equivalent) to be referenced in the threat model so it is clear you considered AI-specific attacks, not only generic ones.

    How does this interact with a Predetermined Change Control Plan (PCCP)? A PCCP defines what model changes you can ship without a new submission. Your cybersecurity controls for the seven threats must hold across the range of changes the PCCP authorizes. The FDA will ask how your controls are verified after each PCCP-authorized update.

    Are these threats only relevant to imaging or diagnostic AI? No. Performance drift and data leakage apply equally to predictive monitoring, closed-loop control, and clinical decision support. Evasion is most studied in imaging but is documented in waveform, tabular, and language-model contexts.

    Where do AAMI CR34971 and IEC 81001-5-1 fit? AAMI CR34971 gives you the AI risk-management process; IEC 81001-5-1 gives you the security activities across the product lifecycle. Together they support — but do not replace — the threat-modeling and SPDF evidence the FDA expects under 524B.

    Each of the threats above has a dedicated post with detection patterns, controls, and test evidence:

    Supporting reading: AAMI CR34971 AI Risk Management Explained · Threat Modeling Connected and Implantable Devices · FDA Section 524B Subsections Explained


    Need help building an SPDF and threat model that covers all seven of these AI threats? Talk to Blue Goat Cyber about a 524B-ready AI cybersecurity engagement, or start with our ThreatGoat threat-modeling workflow.

    About the author — The Blue Goat Cyber team specializes in cybersecurity for FDA-regulated medical devices, including AI/ML-enabled systems subject to section 524B of the FD&C Act. We help manufacturers ship clean premarket submissions and run defensible postmarket programs.

    Sources & references

    Primary sources cited in this article. Links open in a new tab.

    1. final premarket cybersecurity guidance dated February 3, 2026- U.S. FDA
    Related articles

    Keep reading

    Related 524B & eSTAR resources

    Keep going: the 524B and eSTAR working set

    Start with the walkthrough hub, then drill into the statute, the eSTAR field map, SBOM monitoring, postmarket planning, and deficiency response. Use these as the playbook behind every cyber device submission.

    Hub
    FDA Section 524B & eSTAR Cybersecurity Walkthrough

    Start here: the hub that ties the statute, the February 2026 guidance, and the eSTAR fields together in the order a submission team works through them.

    Related services

    Put this into practice on your device

    Every Blue Goat Cyber engagement maps directly to FDA Section 524B and the SPDF - so the evidence you need lands in your submission, not in a separate report.

    Ready when you are

    Get FDA cleared without the cybersecurity headaches.

    30-minute strategy session. No cost, no commitment - just answers from people who've shipped 250+ FDA submissions.