TL;DR

Anthropic has published lessons from running hundreds of Claude Code Skills internally, saying the most useful Skills package instructions with scripts, references, templates and checks. The confirmed development is a June 3, 2026 Claude blog post; claims about quality gains come from Anthropic’s own measurements and have not been independently verified.

Anthropic has published a Claude Code engineering write-up explaining what it says it learned from running hundreds of reusable Skills inside its own engineering organization, framing Skills as folders agents can read and run rather than saved prompts.

The post, cited by Thorsten Meyer AI and attributed to Thariq Shihipar on Anthropic’s Claude blog, describes a Skill as a shareable folder that can include SKILL.md instructions, reference files, scripts, templates, configuration, hooks and memory. The confirmed point is architectural: Anthropic is presenting Skills as reusable file-system packages, not one-off prompt snippets.

According to the source material, Anthropic’s internal Skills clustered into nine categories: library and API reference, product verification, data fetching and analysis, business-process automation, code scaffolding and templates, code quality and review, CI/CD and deployment, runbooks, and infrastructure operations. Anthropic’s measured claim, as summarized by Thorsten Meyer AI, is that verification Skills had the strongest effect on output quality.

The July 1 dispatch from Thorsten Meyer AI interprets the post as a business signal as well as an engineering guide. Its central reading is that reusable Skills can turn repeated agent instructions into versioned institutional knowledge, giving teams a way to capture internal practices, guardrails and workflows in a format coding agents can apply repeatedly.

At a glance

reportWhen: Anthropic post published June 3, 2026;…

The developmentAnthropic published a Claude Code engineering post describing what it learned from using hundreds of reusable Skills across its own engineering organization.

AI Dispatch · Insights · 1 July 2026

A Skill is a folder, not a prompt

Q: What did Anthropic publish?

Anthropic published a Claude Code engineering post about lessons from using hundreds of Skills across its own engineering organization.

Q: What is a Skill in this report?

A Skill is described as a folder that can hold instructions, references, scripts, assets, configuration and hooks. The agent can read and run parts of that folder when a task calls for them.

Q: What was Anthropic’s main claimed finding?

According to the source material, Anthropic found that verification Skills, which check the agent’s work, had the strongest effect on output quality. That claim is attributed to Anthropic’s own measurement.

Anthropic published what it learned running hundreds of Skills across its own engineering org. Read as a business memo, the point is bigger than a coding trick: this is how ad-hoc prompting becomes durable institutional capability — the SOPs your agents actually follow, versioned and shared.

✕ The misconception

“A Skill is just a clever markdown prompt you save in a file.”

✓ What it actually is

A folder the agent can discover, read & run — instructions, scripts, references, templates, config & on-demand hooks.

Anatomy of a Skill — the file system is context engineering

my-skill/the unit you share & version

├─ SKILL.mdroot instructions + a description written for the model (its trigger)

├─ references/deep detail pulled in only when needed — progressive disclosure

├─ scripts/real code, so the agent composes instead of rebuilding boilerplate

├─ assets/templates & files to copy into the output

├─ config.jsonsetup the agent asks for if it’s missing (e.g. which Slack channel)

└─ hooks + memoryon-demand guardrails + an append-only log so it remembers

Why it matters: the folder itself is the knowledge base. The agent reads the root, then reaches deeper only when the task demands it — the same way you’d hand a new hire a one-pager that points to the detailed docs.

The nine types — a gap-analysis map for your own library

1Library / API reference

2Product verification ★ top impact

3Data fetching & analysis

4Business-process automation

5Code scaffolding & templates

6Code quality & review

7CI/CD & deployment

8Runbooks

9Infrastructure operations

By Anthropic’s own measurement, verification Skills — the ones that check the work — moved output quality the most. If you build one category well, build that one.

The craft — what separates a good Skill from a useless one

Gotchas = highest-signal section Describe for the model, not humans (it’s the trigger) Don’t state the obvious Ship scripts, not just prose On-demand guardrail hooks (/careful, /freeze) Let it remember (log / SQLite) Don’t railroad — leave room to adapt

The take

The knowledge of how your organization actually operates can be captured, versioned, shared & executed — and the thing capturing it is a humble folder with a script and a gotchas list inside. For the builder, that’s context engineering with real tools attached. For whoever owns the budget, it’s the difference between AI that starts from zero every morning and an asset that compounds. Caveats: best practices are still evolving, checked-in Skills cost context, and curation beats accumulation. Start with one Skill, one gotcha, and the category that catches your mistakes.

Source: “Lessons from building Claude Code: How we use skills,” Thariq Shihipar (Anthropic), Claude blog, 3 June 2026. Categories, examples & measured claims are Anthropic’s; framing is the author’s. Docs: code.claude.com/docs/en/skills.

thorstenmeyerai.com

Agent Workflows Become Shared Assets

For engineering teams using coding agents, the development matters because it points to a way of reducing repeated setup work. Instead of rewriting the same prompt each day, teams can package instructions, scripts, templates and checks into a folder that agents can discover when a task calls for it.

The business relevance is consistency. If the approach works beyond Anthropic’s own environment, a Skill library could help teams apply the same review standards, deployment steps or product checks across projects. That could make agent-assisted work less dependent on one person’s prompt-writing habits and more tied to shared operating practices.

Notion Automation 2026: Build Smarter Workflows & Databases with No-Code (50 Ready Templates): Productivity Systems, AI Integrations & Automations for … (No-Code Automation Mastery Book 4)

As an affiliate, we earn on qualifying purchases.

From Prompts To Skill Libraries

The source material contrasts a saved prompt with a Skill folder. A prompt mainly tells an agent what to do in the current exchange; a Skill can also provide supporting files, runnable code, examples and guardrails that the agent can use only when needed.

Thorsten Meyer AI describes this as progressive disclosure: the agent reads the root Skill instructions first, then reaches into references, scripts or assets when the task requires more detail. The dispatch compares that pattern to giving a new hire a short guide that points to deeper documentation.

The post also frames curation as part of the work. The source material says best practices are still evolving, checked-in Skills consume context, and accumulation alone is not the goal. The recommended starting point is narrow: build one useful Skill, capture one hard-won caveat, and focus first on checks that catch mistakes.

“A Skill is a folder, not a prompt.”
— Thorsten Meyer AI dispatch

Mastering MCP: The New Era of AI Integration: How to Connect LLMs (Claude, ChatGPT) to Your Databases, APIs, and Local Files to Build Autonomous Agents with Python (AI Engineering & Local LLMs)

As an affiliate, we earn on qualifying purchases.

Limits Outside Anthropic Remain Open

It is not yet clear how well Anthropic’s internal results apply to smaller teams, non-Anthropic tools or organizations without mature engineering practices. The source material refers to hundreds of Skills and Anthropic’s own measurements, but it does not provide a full public dataset, exact metric definitions or independent validation.

There is also an adoption question. Skills can become useful shared assets, but only if teams maintain them, remove stale instructions and keep scripts aligned with current systems. Without that maintenance, a Skill library could become another set of outdated internal docs.

My Software Deployment Checklist

As an affiliate, we earn on qualifying purchases.

Builders Start With Verification Skills

The next practical step for teams is to test the model on one recurring workflow, especially a verification task that catches errors before code ships. Anthropic’s docs at code.claude.com/docs/en/skills are the cited starting point for implementation details.

More evidence will be needed to judge the broader impact. Readers should watch for public examples, benchmark details, and case studies showing whether Skill folders improve agent reliability outside Anthropic’s own engineering organization.

Plaud Note AI Voice Recorder, Note Taker w/Case, App Control, Transcribe & Summarize with AI, Support 112 Languages, for Meetings, Calls, Lectures, Professionals, Teams, Black, Non-Pro Version

Plaud Intelligence: Capture conversations in 112 languages and generate accurate transcripts with the Plaud App and Web. Plaud…

As an affiliate, we earn on qualifying purchases.

Key Questions

What did Anthropic publish?

Anthropic published a Claude Code engineering post about lessons from using hundreds of Skills across its own engineering organization.

What is a Skill in this report?

A Skill is described as a folder that can hold instructions, references, scripts, assets, configuration and hooks. The agent can read and run parts of that folder when a task calls for them.

What was Anthropic’s main claimed finding?

According to the source material, Anthropic found that verification Skills, which check the agent’s work, had the strongest effect on output quality. That claim is attributed to Anthropic’s own measurement.

Why should businesses care?

The Skill model could let teams turn repeated instructions and internal know-how into versioned, reusable assets. That may reduce repeated prompting and make agent-assisted work more consistent.

What remains unproven?

The public source material does not show full measurement methods or independent tests. It remains unclear how much the same approach helps teams outside Anthropic’s engineering environment.

Source: Thorsten Meyer AI

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.

A Skill Is a Folder, Not a Prompt: What Anthropic Learned Running Hundreds of Them

Up next

What to know ahead of the July 1 student loan shakeup

Author

The Liberty Portfolio Team

Share article

A Skill is a folder, not a prompt

Agent Workflows Become Shared Assets

Notion Automation 2026: Build Smarter Workflows & Databases with No-Code (50 Ready Templates): Productivity Systems, AI Integrations & Automations for … (No-Code Automation Mastery Book 4)

From Prompts To Skill Libraries

Mastering MCP: The New Era of AI Integration: How to Connect LLMs (Claude, ChatGPT) to Your Databases, APIs, and Local Files to Build Autonomous Agents with Python (AI Engineering & Local LLMs)