Issue #9 — May 2026 | AI Security Weekly

On May 8, 2026, CISA added CVE-2026-42208 — a SQL injection vulnerability in BerriAI’s LiteLLM — to the Known Exploited Vulnerabilities catalog.¹ The entry is unremarkable in its CVSS score and unremarkable in its mechanism. What is remarkable is its meaning: an open-source LLM gateway — the kind of software that sits between an enterprise application and a foundation model API — has become the first piece of AI infrastructure tooling to enter the federal government’s actively-exploited list. The transparency that lets a security team audit LiteLLM also let an adversary find a SQL injection in it, weaponize it, and use it in production attacks fast enough to draw a federal advisory.

The same week, the PyTorch Lightning PyPI package was confirmed compromised across versions 2.6.2 and 2.6.3, with remediation guidance instructing operators to pin to 2.6.1 and rotate any secrets accessible to compromised builds.² A month earlier, the popular Axios HTTP client — embedded in countless Node.js services including AI-tooling stacks — was poisoned at versions 1.14.1 and 0.30.4 in a campaign Microsoft threat intelligence attributed to the North Korean actor Sapphire Sleet.³ Three compromises. Three different open-source packages. Three different points along the AI delivery pipeline. One pattern.

“The transparency that builds trust is the same transparency that hands attackers a roadmap. Open-source AI is now ubiquitous in enterprise — and ubiquity is the precondition for systemic risk.”

— ASI Intelligence Team observation, W19 2026

This edition of AI Security Weekly examines what these three compromises mean cumulatively, why the open-source dependency graph is the underwriter’s next pricing frontier, and why the EU AI Act’s August 2, 2026 high-risk deadline lands differently on enterprises that have built on open-weight foundations than on those that have not.

Section 01

The KEV Threshold — AI Tooling Enters the Catalog

LiteLLM on CISA KEV: A Threshold Crossed

CISA’s Known Exploited Vulnerabilities catalog is not a vulnerability database. It is a list of vulnerabilities that have already been used in the wild against U.S. federal systems — an operational signal that an adversary has both the means and the motive to exploit a given flaw at scale. Federal civilian agencies are bound by Binding Operational Directive 22-01 to remediate KEV-listed vulnerabilities within tight deadlines.⁴ When something enters KEV, it has moved from theoretical risk to demonstrated loss. Until May 8, 2026, the catalog contained operating systems, browsers, network devices, remote-access platforms, and endpoint security tools. It did not contain AI infrastructure tooling. CVE-2026-42208 changes that. LiteLLM is the open-source LLM proxy and gateway that thousands of organizations — including production deployments at major enterprises — use to route requests across OpenAI, Anthropic, Azure, Bedrock, and self-hosted model endpoints. The SQL injection in CVE-2026-42208 affects LiteLLM’s management server, the component that holds API keys, routing rules, and audit logs.⁵ A successful exploit reaches credentials for every downstream model API the gateway serves. The blast radius is not one model; it is the enterprise’s entire model-access surface.

The Paradox in One CVE

The transparency that makes LiteLLM auditable also made CVE-2026-42208 discoverable. The repository is public. The codebase is browsable. The SQL queries are readable. Defenders who run LiteLLM can review its security posture before deploying. So can adversaries. This is the open-source paradox in its purest form: the property that earns institutional trust — legibility — is the same property that lowers the cost of adversarial reconnaissance. In a closed model gateway, finding this vulnerability would have required behavioral fuzzing, binary analysis, or insider access. In LiteLLM, it required reading. CISA’s decision to add the CVE to KEV is the indicator that someone did the reading and put it to use. For an underwriter writing cyber coverage on an enterprise running LiteLLM as production infrastructure, the relevant question is no longer whether such a vulnerability could exist. It is whether the insured has rotated credentials, patched to LiteLLM v1.81.1 or later, and audited their gateway logs for the indicators of compromise CISA referenced in the catalog entry — and whether their policy form contemplates any of those answers.¹

May 8, 2026

First date an AI infrastructure tool (LiteLLM, CVE-2026-42208) was added to the CISA Known Exploited Vulnerabilities catalog — threshold crossed from theoretical to operationally exploited¹

89%

Organizations adopting AI that report using open-source AI in some form in their infrastructure — the denominator for the KEV exposure that just opened⁶

Section 02

Three Compromises, One Pattern

The Three-Week Cluster: LiteLLM, PyTorch Lightning, Axios

W17 – W19 2026

Compromise 1 LiteLLM CVE-2026-42208 (KEV)

Compromise 2 PyTorch Lightning 2.6.2/2.6.3

Compromise 3 Axios 1.14.1 / 0.30.4 (Sapphire Sleet)

Layer Hit Gateway, framework, dependency

Common Denominator Open-source, enterprise-embedded

Underwriting Visibility None on standard cyber forms

The gateway layer (LiteLLM). A SQL injection in a management server that holds credentials for every downstream model API. A successful exploit yields the enterprise’s entire model-access keyring in one query. There is no closed-source equivalent of LiteLLM in widespread enterprise use: the gateway pattern is a phenomenon native to the open-source AI ecosystem, where teams compose multi-provider routing layers rather than committing to a single vendor SDK. The compromise vector is therefore architecturally tied to the open-source posture.⁵

The framework layer (PyTorch Lightning). Two consecutive PyPI releases — versions 2.6.2 and 2.6.3 — flagged as compromised on April 30, 2026, with the published advisory recommending pinning to 2.6.1 and rotating any secrets accessible during build or training. PyTorch Lightning is the orchestration framework that sits above the torch primitive and is used by research teams and production ML platforms alike. A poisoned version executing during a training run has direct access to the credentials and data the training job was authorized to use. The remediation window between publish and discovery is the loss window.²

The dependency layer (Axios). A widely-installed HTTP client compromised at npm versions 1.14.1 and 0.30.4 with attribution to Sapphire Sleet, a North Korean actor with a documented history of cryptocurrency-theft campaigns repurposed for broader supply-chain operations. Axios is rarely a direct dependency of an AI application — but it is a transitive dependency of dozens of AI SDKs, agent frameworks, and orchestration tools. The compromise reaches AI workloads not because anyone installed it, but because something they installed installed it.³

The Pattern

Three compromises in three layers of the AI delivery pipeline in three consecutive market weeks — gateway, framework, dependency — against open-source packages that the same underwriting application does not name and does not enumerate. The pattern is not coincidence. It is the consequence of 89% AI-adoption-rate exposure to a software supply chain that has only recently come under sustained adversarial pressure.⁶

Section 03

The Underwriter’s Audit Problem

Open-source AI is no longer a fringe deployment posture. The Linux Foundation Research’s May 2025 market impact study — commissioned by Meta and synthesizing multiple primary surveys — reports that 89% of organizations that have adopted AI use open-source AI in some form in their infrastructure, and approximately 63% are actively using an open model.⁶ The Stanford HAI AI Index 2025 reports that 66% of the 149 notable foundation models released in 2023 were released under open licenses — up from 44% in 2022 and 33% in 2021.⁷ The Stack Overflow 2025 Developer Survey reports that among developers building AI agents, Ollama is used by 51% and LangChain by 33% — making open-source frameworks the dominant infrastructure layer for agentic workloads.⁸ Hugging Face surpassed 2 million hosted models in 2025, with the second million accumulating in 335 days — 65% faster than the first.⁹ The transformers library on PyPI records approximately 142 million downloads per month; torch records approximately 83 million.¹⁰

66%

Foundation models released in 2023 under open licenses — up from 33% in 2021. The composition of the model stack has flipped open in three years.⁷

142M

Monthly PyPI downloads of the transformers library — the scale at which a single compromised release reaches production environments¹⁰

335 days

Time for Hugging Face to add its second million hosted models — the pace at which the open model surface is expanding⁹

A cyber underwriting application that does not enumerate which open-source AI packages an applicant runs in production — which model gateway, which inference engine, which orchestration framework, which embedding library, which vector store client — is a form that cannot price the risk it is being asked to cover. The dependency graph that determines the actual attack surface is invisible to the questionnaire. When an underwriter sees a clean cyber control answer set and writes coverage at a soft-market rate, the LiteLLM-on-KEV scenario is the loss that follows. Not because the controls were misrepresented; because the inputs that would have surfaced the exposure were never solicited.

The mature analogue is the property market’s relationship with concentration data. Property underwriters do not write a building without knowing the ZIP code, the construction class, the occupancy, the protective devices, and the catastrophe modeling overlay. None of those data points existed in the original property questionnaire. They emerged because losses taught the market what to ask. The AI software dependency graph is at the analogous moment. The losses are beginning to teach. The questionnaires have not yet caught up.

An actionable starting point: applicants running AI workloads should be required to disclose (i) the open-source LLM gateways and proxies in production, with version and patch status; (ii) the open-source training and inference frameworks in production, with the build provenance of any pinned versions; (iii) the open-source agent orchestration tools in production, and whether their configuration surfaces are write-controlled; and (iv) the cadence and method by which the applicant audits new versions of these dependencies before production deployment. Until those four questions are routine, the LiteLLM scenario is uninsurable by design.

Section 04

What Closed Source Buys You — And What It Doesn’t

The natural counter-question is whether closed-source AI deployments avoid the paradox. The honest answer is: partially, and at a different cost. Closed-source model APIs from OpenAI, Anthropic, and the major cloud-resident model platforms remove the open-source paradox at the model layer. The weights are not browsable. The serving infrastructure is not auditable from outside. An adversary cannot read the source code looking for SQL injections in the model gateway. The transparency cost is paid in the other direction: the customer cannot audit the model itself for the same reasons the adversary cannot. This is a trade, not a victory.

Closed

Adversary cannot read source — reduced reconnaissance leverage at the model and serving layers, but the customer also cannot audit the model

Open

Customer can audit and pin specific versions — but adversary has the same access to discover and weaponize defects

Hybrid

Most enterprises run closed model APIs through open-source gateways — inheriting both audit obligations and adversarial-reconnaissance exposure at the gateway layer

Underwriting

Risk pricing must distinguish between these three postures — current cyber forms do not. The hybrid posture is the most common and the least understood

The most common enterprise deployment pattern in 2026 is hybrid: closed-source model APIs — OpenAI, Anthropic, Bedrock, Vertex — accessed through open-source gateway and orchestration layers — LiteLLM, LangChain, LlamaIndex, vLLM. This pattern carries both postures’ risk profiles simultaneously. The model itself is in a closed system the customer cannot audit. The gateway in front of the model is in an open system the adversary can audit. CVE-2026-42208 is the exact scenario this pattern produces under adversarial pressure: the model is protected behind a hardened API; the gateway in front of the model is the breach point.⁵ Underwriting applications that do not differentiate between “runs OpenAI” and “runs OpenAI through LiteLLM” are pricing two materially different exposures as one.

A second observation matters for the closed-source side: vendor transparency hubs are now functioning as the de facto audit substitute. Anthropic’s Transparency Hub publishes voluntary commitments, responsible disclosure policy, and bug bounty channels.¹¹ OpenAI’s Trust and Transparency page publishes government request data, child safety practices, and moderation transparency materials. These are the disclosure surfaces a regulator or examiner would consult when the question is whether a closed-source provider is meeting reasonable security and governance standards. They are not audits. They are statements. The asymmetry between “we attest to these properties” and “we have evaluated these properties externally” matters — and is a question worth asking every closed-source AI counterparty.

Section 05

EU AI Act + Open Source — How August 2 Lands Differently

On May 7, 2026, the European Commission published the draft guidelines on transparency obligations under Article 50 of the EU AI Act — the article that governs disclosure requirements for AI systems including obligations to inform users when they are interacting with AI, to label AI-generated content, and to publish documentation sufficient to enable downstream operators to meet their own compliance obligations.¹² The guidelines arrive twelve weeks before the August 2, 2026 effective date for the high-risk system obligations — the deadline that pulls post-market monitoring, incident reporting, and conformity assessment from the future tense into the present tense for any AI deployment touching the EU market.¹³

For open-weight deployments, the Article 50 obligations land with a particular weight. The transparency obligation is satisfied differently when the underlying model is open: the operator can produce the model card, the training data documentation, the evaluation results, and the version history because all of those artifacts are part of the open release. The same operator cannot, however, satisfy the obligation simply by pointing to the upstream open release — the operator is responsible for documenting how the model is used in the deployed system, including any fine-tuning, retrieval augmentation, agent scaffolding, or guardrail configuration applied. Open-weight deployment is not a transparency shortcut. It is a transparency baseline with operator-specific additions required on top.

For closed-model deployments, the Article 50 obligations land differently again. The operator depends on the model provider’s documentation to satisfy parts of the disclosure requirement. Where the provider publishes detailed system cards and evaluation reports, the operator’s obligation is lighter. Where the provider publishes only general statements, the operator’s obligation is heavier — because the regulator’s question (“what does this system actually do, and how do you know?”) does not become easier to answer when the upstream provider is opaque. The Article 50 guidelines reinforce that the deploying operator carries the ultimate disclosure obligation regardless of the upstream model’s posture.

August 2, 2026 — What Becomes Operative

EU AI Act

Post-market monitoring becomes mandatory for high-risk AI systems — including documented procedures for detecting performance degradation, behavioral drift, or unintended outcomes after deployment. For open-source-heavy stacks, this obligation reaches into every component the operator embeds: the gateway, the inference engine, the orchestration framework. A monitoring plan that covers only the model and not the gateway in front of it does not satisfy the obligation.¹³

Serious incident reporting becomes a regulatory obligation. When an AI system causes or contributes to a serious incident — defined to include serious privacy breaches, infrastructure disruption, or rights infringements — the operator must report to the relevant national competent authority within 15 days. A CVE-2026-42208-class compromise of an open-source gateway in a high-risk EU deployment would, from August 2 onward, plausibly trigger this obligation. The cyber insurer’s response timeline now sits inside a regulatory reporting clock.¹³

Conformity assessment documentation must be in place for high-risk systems, including documentation of the data used for training and testing, the risk management system, the quality management system, and the technical specifications of the deployed AI. For open-source stacks, this documentation must include the pinned versions, the audit findings, and the rationale for accepting any residual risk — including the residual risk of running components that were not built or audited by the operator itself.¹³

The Underwriting Implication

Cyber forms drafted before August 2, 2026 will be asked to respond to incidents that, post-August 2, are also regulatory reporting events. The coverage question expands from “what is the loss?” to “what is the loss, plus the regulatory exposure, plus the post-market monitoring deficiency the regulator may surface during incident response?” Open-source-heavy stacks compound the exposure because the documentation burden is higher and the operator carries more of it directly. This is not an argument against open-source AI. It is an argument for underwriters to begin documenting which posture an insured runs and pricing the regulatory-overlay exposure into the premium.

Section 06

Market Index — W19 Reading

ASI Market Index W19: 37.7

Up 0.1 from W18 (37.6). The composite ticked up this week against regulatory pressure (EU AI Act transparency guidelines published May 7) and software supply-chain pressure (LiteLLM KEV addition, PyTorch Lightning compromise). Signal of the week: the regulatory track — the EU clock advanced.

ASI Market Index → Full Signal Detail

The ASI Market Index reads 37.7 for Week 19, a 0.1-point increase from the W18 close of 37.6. The directional move is modest — the kind of week-over-week change that, in isolation, would not invite analysis. What earns the analysis is the composition. The W19 increase is driven principally by two of the index’s constituent signals: the regulatory track, which advanced on the European Commission’s May 7 publication of draft Article 50 transparency guidelines and the related implementation-timeline updates ahead of August 2, 2026; and the software supply-chain track, which absorbed the LiteLLM KEV listing and the PyTorch Lightning advisory in the same week. The ranker selected the EU AI Act guidance as Signal of the Week with a score of 1.4375, well above the runner-up at 1.0062 — reflecting both the regulatory weight of an Article 50 instrument and its proximity to the August 2 effective date.

The relevance for underwriters: the Market Index is registering pressure in two signals that together describe this edition’s thesis. Regulatory tightening on one side; open-source software supply-chain compromise on the other. Both pressures are structural, neither is a one-week anomaly, and both bear directly on how cyber coverage of AI-dependent enterprises should be priced through the back half of 2026. The composite’s 0.1-point move understates the directional weight of what the underlying signals are reporting. The full index page carries the per-signal breakdown and the W19 audit record.

Section 07

The Bottom Line — Five Takeaways for W19

Watchlist — Open-Source AI Security Posture

May 11, 2026

AI infrastructure tooling has entered the CISA KEV catalog

LiteLLM CVE-2026-42208 (added May 8, 2026) is the first AI-specific infrastructure component to appear on the actively-exploited list. The threshold from theoretical to operational has been crossed at the gateway layer. Underwriters writing cyber on AI-dependent enterprises should now treat KEV monitoring of AI-tooling vendors as a standing intake question.¹

Three layers compromised in three consecutive weeks — the pattern is not coincidence

LiteLLM (gateway), PyTorch Lightning (framework), Axios (transitive dependency). When 89% of AI-adopting organizations use open-source AI somewhere in their infrastructure, the dependency graph is the actual underwriting exposure — and current cyber applications do not enumerate it.⁶

The hybrid posture is the most common and the most under-priced

Most enterprises run closed-source model APIs through open-source gateway and orchestration layers. This hybrid pattern inherits both posture profiles’ risk surfaces simultaneously. Underwriting forms that do not distinguish “runs OpenAI” from “runs OpenAI through LiteLLM” are pricing materially different exposures as one.⁵

The EU AI Act post-market monitoring obligation applies to the whole stack — not just the model

August 2, 2026 brings post-market monitoring, serious incident reporting, and conformity assessment into legal effect for high-risk systems. A monitoring plan that covers only the model and not the open-source gateway in front of it does not satisfy the obligation — and a CVE-class compromise of that gateway is plausibly a 15-day reportable event.¹³

Four questions to add to every AI cyber underwriting application

(i) Which open-source LLM gateways and proxies are in production, with version and patch status. (ii) Which open-source training and inference frameworks are in production, with build provenance. (iii) Which agent orchestration tools are in production, with configuration write-control posture. (iv) How and how often the applicant audits new versions of these dependencies before production deployment. Until these are routine, the dependency-graph exposure remains uninsurable by design.