TL;DR

An Anthropic Frontier Red Team analysis, summarized by Thorsten Meyer AI, mapped 832 banned malicious cyber accounts from March 2025 to March 2026 onto MITRE ATT&CK. The analysis says technique counts are no longer a reliable proxy for attacker danger, while autonomous model scaffolding is becoming a stronger risk signal.

An Anthropic Frontier Red Team analysis of 832 accounts banned for malicious cyber activity between March 2025 and March 2026 found that common threat-rating signals are losing value as AI helps less-skilled actors perform techniques once associated with more capable operators.

The accounts were mapped onto MITRE ATT&CK, the standard taxonomy many security teams use to classify attacker tactics and techniques. The dataset is described as a detailed window into cases with enough information for technique mapping, not a full census of all AI-enabled cyber misuse.

According to the source material, 67.3% of the banned accounts, or 560 accounts, used AI to help write malware. A smaller share, 6.5%, or 54 accounts, used AI for lateral movement inside networks. The share of actors rated medium-risk or higher rose from 33% in the first six months to 56% in the second six months, a roughly 1.7-fold increase across the year.

The analysis says the number of techniques an actor used no longer separated weaker and stronger actors well. Least-skilled actors averaged 16 techniques, while the most-skilled averaged 20. The platform used, including Claude Code, API access or chat, also did not correlate with risk in the summary provided.

ThorstenMeyerAI.com
AI & Security · Field Note
AI-enabled cyber threats · a year mapped

The frameworks can’t see the thing that matters

For decades, danger meant which techniques an attacker commands. A year of real AI-enabled attacks — 832 banned accounts mapped onto MITRE ATT&CK — shows that signal breaking, just as a new, harder-to-see one takes over.

Anthropic Frontier Red Team · Mar 2025–Mar 2026 · 832 accounts · via Verizon DBIR
01The dataset

A year of real misuse, mapped to the standard taxonomy

A window, not a census — these are the cases with enough detail to assess techniques thoroughly. Inside it, the risk level climbed fast.

WHAT WAS STUDIED

832 accounts
Banned for malicious cyber activity, Mar 2025–Mar 2026, mapped onto MITRE ATT&CK. The most common AI use was prep — 67.3% (560) used AI to help write malware; 6.5% (54) for lateral movement deep inside networks.

THE RISK CLIMB · MEDIUM-OR-HIGHER ACTORS

First 6 months33%
33%
Second 6 months56%
56%
≈ 1.7× increase in a single year
02The measurement breaks · press play
Amazon

AI malware detection tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Amazon

AI cybersecurity threat detection tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

“More techniques” stopped meaning “more dangerous”

The old heuristic: count the techniques, judge the tooling. AI dissolved it — because the model supplies the techniques either way. Watch the old signal fail, then watch what it misses.

Risk score vs. technique count

Two ways to read the same attacker. One is going blind. Press play.

the old signalSkill ≈ number of techniques?
Least-skilled
16
Most-skilled
20
16 vs. 20. A novice and an expert now look almost alike by technique-count — and the platform (Claude Code / API / chat) didn’t correlate with risk either.
what it missesThe Nov 2025 espionage operation
by technique count
30
techniques · 13 tactics
Looks like many medium-risk actors. Unremarkable.
by risk-scoring methodology
100
max risk score
The model ran as an autonomous agent — same case.
The most dangerous attribute of the year’s most dangerous attack is taxonomically invisible. ⌁ there is no MITRE ATT&CK ID for agentic orchestration
03Where the AI moved
Amazon

cyber threat intelligence software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Amazon

malware analysis software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Deeper into the attack — and into less-skilled hands

Across the year, AI use drifted from getting in toward acting once already inside — the operationally demanding stages that used to require an expert.

The attack lifecycle · where AI is now applied

The center of gravity moved right — toward post-compromise work.

Initial access
phishing, getting in
Account discovery
finding valid accounts
Lateral movement
navigating the network
Privilege escalation
deeper control
↓ 8.6%
AI-assisted phishing
A classic way to gain access — falling.
↑ 8.9%
AI for account discovery
Post-compromise work — rising.
The crack in the old model: post-compromise techniques used to be restricted to actors skilled enough to perform them. AI can now perform them on behalf of less sophisticated actors — the dangerous deep stages are no longer self-limiting.
04What actually predicts danger now
Amazon

network security monitoring devices

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Amazon

network security monitoring devices

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

From “what they know” to “what they’ve built”

The report sorts the signals into three tiers — one dead, one fading, one durable.

🔢

Technique count & tooling

16 vs. 20 between novice and expert; platform doesn’t correlate. The model supplies the techniques either way.

dead signal
📍

Where in the lifecycle AI is applied

Concentrating on operationally demanding, post-compromise stages is a better signal — but it’s eroding as the whole population heads there.

fading signal
🏗️

The scaffolding around the model

Architectures that let the model chain stages and run with minimal human input. Not what they know — whether they’ve built a system that lets AI run the attack.

durable signal
05What follows · read straight
Amazon

AI-powered intrusion detection system

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Amazon

cybersecurity threat intelligence platforms

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Fixing the map before the territory moves again

A taxonomy that can’t name the most dangerous behavior on the field will quietly mislead the people relying on it. The response runs in two directions.

🛡️ defensively

Fed back into the models

The findings informed safeguards on the most capable models, built to detect & block some of what was observed:

  • Blocking malware development
  • Blocking mass data exfiltration
  • Putting tools in defenders’ hands first (Project Glasswing)
🧭 institutionally

Taking it to the source

Following the Verizon work, Anthropic says it’s in discussions with MITRE about how ATT&CK might evolve:

  • A vocabulary for agentic orchestration
  • Naming the scaffolding that makes a model an operator
  • An interactive technique visualization on the Red blog

Reading it in proportion

  • The 832 cases are a detailed subset, not the full population — the precise percentages are directional, not definitive.
  • “More autonomous” is not “fully autonomous” — even the standout case needed human input at key moments, which is itself a place for defenders to intervene.
  • This is one vendor’s window — the company with visibility into misuse of its own model, publishing what it found. The right thing to do with the data, and worth remembering as you read it.
ThorstenMeyerAI.com
Source: Anthropic, “What we learned mapping a year’s worth of AI-enabled cyber threats” (Jun 3, 2026) · Frontier Red Team · Verizon 2026 DBIR · figures per the report · independent commentary · findings only, no operational detail.

Why It Matters

The finding matters because many security programs still rely on technique counts, tooling sophistication and lifecycle position to judge attacker capability. If AI can supply many techniques on demand, then a low-skill actor may appear similar to a stronger operator when measured only through the old lens.

The analysis points instead to the architecture around the model: systems that let AI chain attack stages, act with limited human input and operate across post-compromise tasks. That kind of scaffolding is harder to capture in a taxonomy built around human-executed techniques, yet the source material says it was central to the highest-risk case reviewed.

Background

MITRE ATT&CK has long helped defenders organize cyber activity into tactics and techniques, from initial access to privilege escalation and lateral movement. That structure remains useful for describing what happened during an intrusion, but the Anthropic-linked analysis argues that it is weaker at describing who, or what, is doing the operational work when an AI model is orchestrating steps.

The source material cites a November 2025 espionage operation as the clearest example. By technique count, the operation used 30 techniques across 13 tactics, which made it look similar to many medium-risk actors. By Anthropic’s risk-scoring method, the same case reached the maximum score because the model operated as an autonomous agent.

The analysis also says AI use moved deeper into the attack lifecycle during the year. AI-assisted phishing fell by 8.6%, while AI use for account discovery rose by 8.9%. The source material frames that movement as a shift toward post-compromise work that once demanded more operator skill.

“there is no MITRE ATT&CK ID for agentic orchestration”

— Thorsten Meyer AI field note on MITRE ATT&CK coverage

What Remains Unclear

The dataset does not show the full scale of AI-enabled cyber activity, because it covers banned accounts with enough detail for mapping. It is also unclear how representative these cases are across other AI systems, threat groups or criminal markets.

It remains unsettled how MITRE ATT&CK may change, whether a durable vocabulary for agentic orchestration will be adopted, and how well model safeguards will hold as attackers adapt their methods.

What’s Next

Anthropic says the findings informed safeguards on its most capable models, including controls aimed at blocking malware development and mass data exfiltration, and tools for defenders under Project Glasswing. Following related Verizon work, Anthropic also says it is in discussions with MITRE about how ATT&CK might evolve to describe agentic orchestration and the scaffolding that can turn a model into an operator.

Key Questions

What is the actual news development?

An Anthropic Frontier Red Team analysis mapped 832 banned malicious cyber accounts from March 2025 to March 2026 onto MITRE ATT&CK and found that traditional measures, especially technique counts, are becoming weaker indicators of attacker danger.

What is confirmed by the source material?

The confirmed figures in the provided material are the 832 banned accounts, the March 2025 to March 2026 study period, the 67.3% malware-writing use share, the 6.5% lateral-movement use share, and the rise in medium-or-higher actors from 33% to 56% across the year.

What is claimed or interpreted?

The interpretation is that AI has weakened the old link between technique count and attacker skill, and that model scaffolding is now a better signal of danger. That conclusion is attributed to the analysis described by Thorsten Meyer AI and Anthropic.

Why does this matter for defenders?

Security teams may under-rank attacks if they focus only on visible techniques. The analysis says the higher-risk signal may be whether an attacker has built an AI-enabled system that can chain actions and carry out post-compromise work with limited human direction.

What remains unclear?

It is not yet clear how broad the pattern is beyond the banned-account dataset, how ATT&CK may be updated, or how quickly defensive tools and model safeguards can adapt to agentic attack workflows.

Source: Thorsten Meyer AI

You May Also Like

ICE Agents Have List of 20 Million People on Their iPhones Thanks to Palantir

ICE officials revealed they can access a list of 20 million potential targets via Palantir, boosting enforcement capabilities but raising privacy concerns.

Tesla’s $1.3 Billion Bitcoin Position Nets $80 Million Profit

Discover how Tesla’s $1.3 billion Bitcoin holdings have gained an $80 million unrealized profit, revealing impacts on their financial future.

Show HN: Rmux – A programmable terminal multiplexer with a Playwright-style SDK

RMUX v0.2.0 introduces a Rust-based, tmux-compatible multiplexer with SDK, daemon support, and native integrations, enabling scripting and inspection.

AI Rivalry Heats Up: Grok 3 Overtakes Key Benchmarks

Unprecedented advancements in AI are reshaping the competitive landscape, but what will this mean for the future of technology and innovation?