<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>LLM Wiki — Agentic AI Landscape</title><link>https://blog.imfsoftware.com/llm-wiki/</link><description>Recent content on LLM Wiki — Agentic AI Landscape</description><generator>Hugo</generator><language>en-us</language><atom:link href="https://blog.imfsoftware.com/llm-wiki/index.xml" rel="self" type="application/rss+xml"/><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/analyses/cross-source-themes/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/analyses/cross-source-themes/</guid><description>&lt;h1 id="cross-source-theme-analysis"&gt;Cross-Source Theme Analysis&lt;a class="anchor" href="#cross-source-theme-analysis"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;16 sources, 8 tools, 2 standards, 3 methodologies, 1 practitioner account, 2 skill/eval resources. Here are the themes that appear across 3+ sources independently — not because they reference each other, but because they converged on the same ideas.&lt;/p&gt;
&lt;blockquote class='book-hint '&gt;
&lt;p&gt;&lt;strong&gt;Note&lt;/strong&gt;: This analysis was originally written against 11 sources. The 5 newest sources (Paperclip, Spec Kit, BMad Method, Anthropic Eval Guide, Promptfoo) strengthen existing themes — particularly Theme 3 (human-in-the-loop spectrum) and Theme 7 (evaluation). A full refresh is recommended when the wiki reaches 20+ sources.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/analyses/how-to-eval-a-skill/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/analyses/how-to-eval-a-skill/</guid><description>&lt;h1 id="how-to-eval-a-skill-practical-guide"&gt;How to Eval a Skill (Practical Guide)&lt;a class="anchor" href="#how-to-eval-a-skill-practical-guide"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Anthropic&amp;rsquo;s prompt evals measure whether a prompt produces good output. Skill evals are harder because a skill has more surface area: it needs to trigger correctly, execute the right steps, use the right tools, produce the right output, and NOT trigger on the wrong inputs.&lt;/p&gt;
&lt;p&gt;This guide maps Anthropic&amp;rsquo;s eval methodology onto skills, drawing from the wiki&amp;rsquo;s sources.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="the-key-difference-prompts-vs-skills"&gt;The Key Difference: Prompts vs. Skills&lt;a class="anchor" href="#the-key-difference-prompts-vs-skills"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;&lt;/th&gt;
 &lt;th&gt;Prompt Eval&lt;/th&gt;
 &lt;th&gt;Skill Eval&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;What you test&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Does this prompt produce good output?&lt;/td&gt;
 &lt;td&gt;Does this skill trigger, execute, and produce correctly?&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Input&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;A prompt + expected output&lt;/td&gt;
 &lt;td&gt;A prompt + context + expected behavior chain&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Failure modes&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Bad output&lt;/td&gt;
 &lt;td&gt;Wrong trigger, wrong steps, wrong tools, bad output, false positive activation&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Non-determinism&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Output varies&lt;/td&gt;
 &lt;td&gt;Trigger, routing, tool selection, AND output all vary&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;A skill eval must test the full chain: &lt;strong&gt;routing → activation → execution → output → side effects&lt;/strong&gt;.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/analyses/key-insights-agentic-landscape/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/analyses/key-insights-agentic-landscape/</guid><description>&lt;h1 id="key-insights-the-agentic-ai-landscape-april-2026"&gt;Key Insights: The Agentic AI Landscape (April 2026)&lt;a class="anchor" href="#key-insights-the-agentic-ai-landscape-april-2026"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Synthesized from 16 sources across this wiki. This analysis captures the patterns, tensions, and emerging consensus visible when you look across the entire landscape rather than at any single tool.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="1-five-layers-are-emerging"&gt;1. Five Layers Are Emerging&lt;a class="anchor" href="#1-five-layers-are-emerging"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;The landscape has organized into five distinct layers:&lt;/p&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Layer&lt;/th&gt;
 &lt;th&gt;Representative&lt;/th&gt;
 &lt;th&gt;Core Bet&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Company&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/paperclip/"&gt;paperclip&lt;/a&gt;&lt;/td&gt;
 &lt;td&gt;Orchestrate agents into companies with org charts, budgets, governance.&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Methodology&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/"&gt;spec-kit&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/bmad-method/"&gt;bmad-method&lt;/a&gt;&lt;/td&gt;
 &lt;td&gt;Structure the development process. Specs before code, or adaptive agile workflows.&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Infrastructure&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt; (GCP)&lt;/td&gt;
 &lt;td&gt;The agent runtime is the hard problem. Be a hypervisor. Stay agnostic.&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Product&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt; (AWS)&lt;/td&gt;
 &lt;td&gt;Ship an opinionated end-to-end agent. Autonomy and scale matter most.&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Tool&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt; (Anthropic), &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt;&lt;/td&gt;
 &lt;td&gt;Make the individual agent excellent. Let users compose upward.&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;The methodology layer is new — &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/"&gt;spec-kit&lt;/a&gt; (&amp;ldquo;specs before code&amp;rdquo;) and &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/bmad-method/"&gt;bmad-method&lt;/a&gt; (&amp;ldquo;expert collaboration over autopilot&amp;rdquo;) represent two competing philosophies for structuring AI-assisted development. &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/paperclip/"&gt;paperclip&lt;/a&gt; adds the company layer above everything, orchestrating agents into organizations with budgets and governance.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/analyses/ten-pillars-evidence-map/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/analyses/ten-pillars-evidence-map/</guid><description>&lt;h1 id="evidence-map-supporting-the-ten-pillars-framework"&gt;Evidence Map: Supporting the Ten Pillars Framework&lt;a class="anchor" href="#evidence-map-supporting-the-ten-pillars-framework"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;This analysis maps each pillar from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt; against real-world evidence collected across 11 sources in this wiki. Your paper acknowledged &amp;ldquo;no original controlled study&amp;rdquo; as a limitation — the wiki now provides post-hoc validation from production implementations.&lt;/p&gt;
&lt;hr&gt;
&lt;h2 id="pillar-1-architecture-and-structure"&gt;Pillar 1: Architecture and Structure&lt;a class="anchor" href="#pillar-1-architecture-and-structure"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Your claim&lt;/strong&gt;: Organize content into clearly defined sections — metadata, interfaces, core logic, workflows, configuration.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Supporting evidence&lt;/strong&gt;:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/strong&gt; codified this into a formal spec: SKILL.md with YAML frontmatter (name, description, license, compatibility, metadata, allowed-tools) + markdown body + optional scripts/references/assets directories. This is now an open standard at agentskills.io.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/strong&gt; implements it: &lt;code&gt;.claude&lt;/code&gt; directory with CLAUDE.md, &lt;code&gt;.claude/rules/&lt;/code&gt;, &lt;code&gt;.claude/skills/&lt;/code&gt;, &lt;code&gt;.claude/agents/&lt;/code&gt;. Hierarchical, scoped (org → project → user → local).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&lt;/strong&gt; takes it further: USER/ vs SYSTEM/ separation. Six layers of customization (identity, preferences, workflows, skills, hooks, memory). Upgrade-safe architecture.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/skills-pipeline-sleestk/"&gt;skills-pipeline-sleestk&lt;/a&gt;&lt;/strong&gt; follows the spec exactly: each skill is a directory with SKILL.md + references/ subdirectory.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;Strength&lt;/strong&gt;: Strong. Multiple independent implementations converged on the same structure. The Agent Skills spec formalizes what your paper recommended.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/</guid><description>&lt;h1 id="agent-skills-standard"&gt;Agent Skills Standard&lt;a class="anchor" href="#agent-skills-standard"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;An open standard (agentskills.io) for packaging reusable capabilities that AI agents can load dynamically. Initiated by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/"&gt;anthropic&lt;/a&gt;, designed for cross-tool compatibility.&lt;/p&gt;
&lt;h2 id="specification"&gt;Specification&lt;a class="anchor" href="#specification"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A skill is a directory with a &lt;code&gt;SKILL.md&lt;/code&gt; file:&lt;/p&gt;
&lt;pre tabindex="0"&gt;&lt;code&gt;skill-name/
├── SKILL.md # Required: YAML frontmatter + markdown instructions
├── scripts/ # Optional: executable code
├── references/ # Optional: documentation
├── assets/ # Optional: templates, resources&lt;/code&gt;&lt;/pre&gt;&lt;h3 id="required-frontmatter"&gt;Required Frontmatter&lt;a class="anchor" href="#required-frontmatter"&gt;#&lt;/a&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;code&gt;name&lt;/code&gt;: Max 64 chars, lowercase + hyphens, must match directory name&lt;/li&gt;
&lt;li&gt;&lt;code&gt;description&lt;/code&gt;: Max 1024 chars, what it does and when to use it&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="optional-frontmatter"&gt;Optional Frontmatter&lt;a class="anchor" href="#optional-frontmatter"&gt;#&lt;/a&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;code&gt;license&lt;/code&gt;, &lt;code&gt;compatibility&lt;/code&gt;, &lt;code&gt;metadata&lt;/code&gt;, &lt;code&gt;allowed-tools&lt;/code&gt; (experimental)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="progressive-disclosure"&gt;Progressive Disclosure&lt;a class="anchor" href="#progressive-disclosure"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;The key design principle — skills scale without blowing up context:&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-state-model/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-state-model/</guid><description>&lt;h1 id="agent-state-model"&gt;Agent State Model&lt;a class="anchor" href="#agent-state-model"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt; tracks agent state across three dimensions:&lt;/p&gt;
&lt;h2 id="phase-lifecycle"&gt;Phase (Lifecycle)&lt;a class="anchor" href="#phase-lifecycle"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;The infrastructure lifecycle of the agent container:&lt;/p&gt;
&lt;p&gt;&lt;code&gt;created → provisioning → cloning → starting → running → stopping → stopped (or error)&lt;/code&gt;&lt;/p&gt;
&lt;h2 id="activity-cognitive-state"&gt;Activity (Cognitive State)&lt;a class="anchor" href="#activity-cognitive-state"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;What the agent is doing within the &lt;code&gt;running&lt;/code&gt; phase:&lt;/p&gt;
&lt;p&gt;&lt;code&gt;idle | thinking | executing | waiting_for_input | blocked | completed | limits_exceeded | offline&lt;/code&gt;&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Sticky activities&lt;/strong&gt;: &lt;code&gt;completed&lt;/code&gt;, &lt;code&gt;blocked&lt;/code&gt;, and &lt;code&gt;limits_exceeded&lt;/code&gt; persist until explicit restart or stop&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;blocked&lt;/strong&gt;: Set by agents themselves when waiting for an expected event (e.g., child agent completing) — prevents false stall detection&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;offline&lt;/strong&gt;: Occurs when heartbeat is lost, often due to auth token refresh failure. Fix: stop and restart the agent.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="detail-freeform-context"&gt;Detail (Freeform Context)&lt;a class="anchor" href="#detail-freeform-context"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Freeform text about the current activity — tool name, message, task summary.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/</guid><description>&lt;h1 id="agent"&gt;Agent&lt;a class="anchor" href="#agent"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;In &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, an Agent is an isolated process running an LLM + &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/"&gt;harness&lt;/a&gt; loop against a task. It is the fundamental unit of execution.&lt;/p&gt;
&lt;h2 id="properties"&gt;Properties&lt;a class="anchor" href="#properties"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Each agent has:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Identity&lt;/strong&gt;: Unique name, container ID, and (in hosted mode) a UUID and URL-safe slug&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Home directory&lt;/strong&gt;: Mounted at &lt;code&gt;/home/&amp;lt;user&amp;gt;&lt;/code&gt; — contains harness config, credentials, &lt;code&gt;agent-info.json&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Workspace&lt;/strong&gt;: Mounted at &lt;code&gt;/workspace&lt;/code&gt; — a dedicated git worktree or cloned repo&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Template&lt;/strong&gt;: The &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/template/"&gt;template&lt;/a&gt; blueprint that seeded its configuration&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Harness&lt;/strong&gt;: The LLM-specific adapter (Claude, Gemini, OpenCode, Codex)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="state-model"&gt;State Model&lt;a class="anchor" href="#state-model"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;See &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-state-model/"&gt;agent-state-model&lt;/a&gt; for the full three-dimensional state tracking system.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/</guid><description>&lt;h1 id="context-management"&gt;Context Management&lt;a class="anchor" href="#context-management"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Strategies for managing the limited context window available to LLM agents, especially in multi-skill pipelines where different agent personas must share information efficiently.&lt;/p&gt;
&lt;h2 id="why-it-matters"&gt;Why It Matters&lt;a class="anchor" href="#why-it-matters"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Context windows are finite. Every skill, instruction, tool definition, and conversation turn consumes tokens. Without management, agents hit limits, lose early instructions, or waste budget on irrelevant context.&lt;/p&gt;
&lt;h2 id="progressive-disclosure"&gt;Progressive Disclosure&lt;a class="anchor" href="#progressive-disclosure"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;The &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt; and &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt; both use this pattern:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Metadata&lt;/strong&gt; (~100 tokens): Loaded at startup for all skills — just name + description&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Instructions&lt;/strong&gt; (&amp;lt;5000 tokens): Loaded only when skill activated&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Resources&lt;/strong&gt;: Loaded only when specifically needed&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt; extends this: MCP tool definitions are deferred by default (only names consume context until a tool is used).&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/</guid><description>&lt;h1 id="frontier-agent"&gt;Frontier Agent&lt;a class="anchor" href="#frontier-agent"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;A term coined by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/aws/"&gt;aws&lt;/a&gt; for a new class of AI agents with three defining characteristics:&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Autonomous&lt;/strong&gt;: Direct them towards a goal, and they figure out how to achieve it&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Massively scalable&lt;/strong&gt;: Able to perform multiple concurrent tasks and distribute work across agents&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Work independently&lt;/strong&gt;: Operating for hours or days without intervention&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="distinction-from-regular-agents"&gt;Distinction from Regular Agents&lt;a class="anchor" href="#distinction-from-regular-agents"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Regular AI agents can plan and execute multi-step tasks with some autonomy. Frontier agents go further — they&amp;rsquo;re designed for long-running, independent operation at scale, not just responding to individual prompts or short interactive sessions.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/grove/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/grove/</guid><description>&lt;h1 id="grove"&gt;Grove&lt;a class="anchor" href="#grove"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;In &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, a Grove is a project workspace where &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;s live. It corresponds to a &lt;code&gt;.scion&lt;/code&gt; directory on the filesystem.&lt;/p&gt;
&lt;h2 id="scope"&gt;Scope&lt;a class="anchor" href="#scope"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Project-level&lt;/strong&gt;: Located at the root of a git repository&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Global&lt;/strong&gt;: In the user&amp;rsquo;s home folder (&lt;code&gt;~/.scion&lt;/code&gt;)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="identification"&gt;Identification&lt;a class="anchor" href="#identification"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Every grove has a unique Grove ID:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Git-backed groves&lt;/strong&gt;: Deterministic UUID v5 derived from namespace + normalized git URL — same repo always maps to same ID regardless of protocol&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Hub-native groves&lt;/strong&gt;: Random UUID v4&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="contents"&gt;Contents&lt;a class="anchor" href="#contents"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A grove contains:&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/</guid><description>&lt;h1 id="harness"&gt;Harness&lt;a class="anchor" href="#harness"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;In &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, a Harness is an adapter that allows a specific LLM tool to run within the Scion orchestration layer. It handles provisioning, configuration, and execution for that tool inside an OCI container.&lt;/p&gt;
&lt;h2 id="purpose"&gt;Purpose&lt;a class="anchor" href="#purpose"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;The harness ensures that generic Scion commands (&lt;code&gt;start&lt;/code&gt;, &lt;code&gt;stop&lt;/code&gt;, &lt;code&gt;attach&lt;/code&gt;, &lt;code&gt;resume&lt;/code&gt;) work consistently regardless of the underlying agent software.&lt;/p&gt;
&lt;h2 id="interface"&gt;Interface&lt;a class="anchor" href="#interface"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Each harness implements (Go):&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;code&gt;DiscoverAuth()&lt;/code&gt; — Locate credentials on the host&lt;/li&gt;
&lt;li&gt;&lt;code&gt;GetEnv()&lt;/code&gt; — Map credentials to container environment variables&lt;/li&gt;
&lt;li&gt;&lt;code&gt;GetCommand()&lt;/code&gt; — Build the correct CLI invocation&lt;/li&gt;
&lt;li&gt;&lt;code&gt;Provision()&lt;/code&gt; — Harness-specific setup during agent creation&lt;/li&gt;
&lt;li&gt;&lt;code&gt;PropagateFiles()&lt;/code&gt; — Copy config files into agent home&lt;/li&gt;
&lt;li&gt;&lt;code&gt;GetVolumes()&lt;/code&gt; — Define volume mounts&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="supported-harnesses"&gt;Supported Harnesses&lt;a class="anchor" href="#supported-harnesses"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Harness&lt;/th&gt;
 &lt;th&gt;Target Tool&lt;/th&gt;
 &lt;th&gt;Notes&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;code&gt;gemini&lt;/code&gt;&lt;/td&gt;
 &lt;td&gt;&lt;strong&gt;gemini-cli&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Default harness. API key / OAuth / Vertex AI auth&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;code&gt;claude&lt;/code&gt;&lt;/td&gt;
 &lt;td&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/td&gt;
 &lt;td&gt;Anthropic API key / Vertex AI auth. See &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt; for full capabilities.&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;code&gt;opencode&lt;/code&gt;&lt;/td&gt;
 &lt;td&gt;&lt;strong&gt;opencode&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Experimental. No hook support&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;code&gt;codex&lt;/code&gt;&lt;/td&gt;
 &lt;td&gt;&lt;strong&gt;codex&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Runs &lt;code&gt;--full-auto&lt;/code&gt; by default&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;code&gt;generic&lt;/code&gt;&lt;/td&gt;
 &lt;td&gt;Any CLI tool&lt;/td&gt;
 &lt;td&gt;Fallback adapter&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;h2 id="capability-matrix"&gt;Capability Matrix&lt;a class="anchor" href="#capability-matrix"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Capability&lt;/th&gt;
 &lt;th&gt;Gemini&lt;/th&gt;
 &lt;th&gt;Claude&lt;/th&gt;
 &lt;th&gt;OpenCode&lt;/th&gt;
 &lt;th&gt;Codex&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;Resume&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Resume with Prompt&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;❌&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Hooks&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;❌&lt;/td&gt;
 &lt;td&gt;❌&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;OpenTelemetry&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;❌&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;System Prompt Override&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;✅&lt;/td&gt;
 &lt;td&gt;❌&lt;/td&gt;
 &lt;td&gt;❌&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;h2 id="extensibility"&gt;Extensibility&lt;a class="anchor" href="#extensibility"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;New harnesses can be added via the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/plugin-system/"&gt;plugin-system&lt;/a&gt; (hashicorp/go-plugin over gRPC) without modifying the core codebase.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/hub/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/hub/</guid><description>&lt;h1 id="hub"&gt;Hub&lt;a class="anchor" href="#hub"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;In &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, the Hub is the central control plane of the hosted (distributed) architecture. It coordinates state across multiple users, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/grove/"&gt;grove&lt;/a&gt;s, and &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime-broker/"&gt;runtime-broker&lt;/a&gt;s.&lt;/p&gt;
&lt;h2 id="responsibilities"&gt;Responsibilities&lt;a class="anchor" href="#responsibilities"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Identity &amp;amp; Auth&lt;/strong&gt;: Manages user identities via OAuth, issues tokens for brokers and agents&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;State Persistence&lt;/strong&gt;: Stores definitive state of agents, groves, and templates in a central database (SQLite)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Orchestration&lt;/strong&gt;: Dispatches agent lifecycle commands to appropriate &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime-broker/"&gt;runtime-broker&lt;/a&gt;s&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Collaboration&lt;/strong&gt;: Provides shared view via Web Dashboard and Hub API&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="communication-with-brokers"&gt;Communication with Brokers&lt;a class="anchor" href="#communication-with-brokers"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Direct HTTP&lt;/strong&gt;: When broker has a reachable endpoint&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Control Channel (WebSocket tunnel)&lt;/strong&gt;: When broker is behind NAT/firewall&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="agent-creation-flow-hosted"&gt;Agent Creation Flow (Hosted)&lt;a class="anchor" href="#agent-creation-flow-hosted"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;CLI syncs grove with Hub&lt;/li&gt;
&lt;li&gt;&lt;code&gt;POST /api/v1/groves/{groveId}/agents&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;Hub selects a &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime-broker/"&gt;runtime-broker&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Merges environment variables and secrets from all scopes (user → grove → broker)&lt;/li&gt;
&lt;li&gt;Resolves &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/template/"&gt;template&lt;/a&gt; with content hash for broker-side caching&lt;/li&gt;
&lt;li&gt;Dispatches to broker&lt;/li&gt;
&lt;li&gt;Broker provisions and starts &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Status reported back via heartbeats&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime-broker/"&gt;runtime-broker&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/grove/"&gt;grove&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/kiro-powers/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/kiro-powers/</guid><description>&lt;h1 id="kiro-powers"&gt;Kiro Powers&lt;a class="anchor" href="#kiro-powers"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Specialized packages that enhance existing &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt; agents with prebuilt expertise for specific development tasks.&lt;/p&gt;
&lt;h2 id="contents"&gt;Contents&lt;a class="anchor" href="#contents"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Curated MCP servers&lt;/li&gt;
&lt;li&gt;Steering files&lt;/li&gt;
&lt;li&gt;Hooks&lt;/li&gt;
&lt;li&gt;Can be dynamically loaded on demand&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="purpose"&gt;Purpose&lt;a class="anchor" href="#purpose"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Focus on providing domain-specific knowledge and best practices. Distinct from the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt; autonomous agent — Powers enhance agents with expertise, while the autonomous agent is the execution engine that works independently.&lt;/p&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/"&gt;frontier-agent&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/</guid><description>&lt;h1 id="llm-wiki-pattern"&gt;LLM Wiki Pattern&lt;a class="anchor" href="#llm-wiki-pattern"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;A pattern for building personal knowledge bases where the LLM incrementally builds and maintains a persistent, interlinked wiki from raw sources. Proposed by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/andrej-karpathy/"&gt;andrej-karpathy&lt;/a&gt;. This wiki is a running instance of this pattern.&lt;/p&gt;
&lt;h2 id="core-insight"&gt;Core Insight&lt;a class="anchor" href="#core-insight"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;strong&gt;Wiki &amp;gt; RAG.&lt;/strong&gt; RAG rediscovers knowledge from scratch on every query — no accumulation. An LLM-maintained wiki compiles knowledge once and keeps it current. Cross-references, contradictions, and synthesis compound with every source added.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/mcp-protocol/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/mcp-protocol/</guid><description>&lt;h1 id="mcp-model-context-protocol"&gt;MCP (Model Context Protocol)&lt;a class="anchor" href="#mcp-model-context-protocol"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;An open protocol that standardizes how applications provide context and tools to LLMs. Enables communication between AI agents and external services.&lt;/p&gt;
&lt;h2 id="how-claude-code-uses-mcp"&gt;How Claude Code Uses MCP&lt;a class="anchor" href="#how-claude-code-uses-mcp"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt; uses MCP as its primary tool integration mechanism:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Connect to external data sources (Google Drive, Jira, Slack, custom tooling)&lt;/li&gt;
&lt;li&gt;Discover and install prebuilt plugins&lt;/li&gt;
&lt;li&gt;Create custom plugins&lt;/li&gt;
&lt;li&gt;Tool definitions deferred by default — only tool names consume context until Claude uses a specific tool (tool search)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="usage-across-the-ecosystem"&gt;Usage Across the Ecosystem&lt;a class="anchor" href="#usage-across-the-ecosystem"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/strong&gt;: Primary tool integration. Supports prebuilt and custom plugins.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt;&lt;/strong&gt;: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/kiro-powers/"&gt;kiro-powers&lt;/a&gt; contain curated MCP servers as part of expertise packages.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;&lt;/strong&gt;: Not directly mentioned, but the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/plugin-system/"&gt;plugin-system&lt;/a&gt; (hashicorp/go-plugin over gRPC) serves a similar role.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="significance"&gt;Significance&lt;a class="anchor" href="#significance"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;MCP is emerging as a shared standard across the coding agent ecosystem. Both Anthropic (Claude Code) and AWS (Kiro) use it, making it a potential interoperability layer between different agent tools. The &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt; (agentskills.io) is a complementary open standard: skills teach agents &lt;em&gt;how&lt;/em&gt; to do things; MCP connects agents to external &lt;em&gt;tools and data&lt;/em&gt;.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/</guid><description>&lt;h1 id="multi-agent-orchestration"&gt;Multi-Agent Orchestration&lt;a class="anchor" href="#multi-agent-orchestration"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;The practice of coordinating multiple LLM-based &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;s to work on tasks concurrently, with isolation, specialization, and collaboration.&lt;/p&gt;
&lt;h2 id="three-approaches-emerging"&gt;Three Approaches Emerging&lt;a class="anchor" href="#three-approaches-emerging"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;h3 id="infrastructure-first-scion"&gt;Infrastructure-first: Scion&lt;a class="anchor" href="#infrastructure-first-scion"&gt;#&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt; positions itself as a &amp;ldquo;hypervisor for agents&amp;rdquo; — providing the infrastructure layer (containers, isolation, lifecycle management) while treating higher-level concerns as orthogonal. Harness-agnostic. Emphasizes human interaction as imperative.&lt;/p&gt;
&lt;h3 id="product-first-kiro-autonomous-agent"&gt;Product-first: Kiro Autonomous Agent&lt;a class="anchor" href="#product-first-kiro-autonomous-agent"&gt;#&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt;&amp;rsquo;s autonomous agent is an opinionated product — a &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/"&gt;frontier-agent&lt;/a&gt; that handles the full stack from task intake to PR creation. Coordinates specialized sub-agents internally. Emphasizes autonomy and independence.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/plugin-system/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/plugin-system/</guid><description>&lt;h1 id="plugin-system"&gt;Plugin System&lt;a class="anchor" href="#plugin-system"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt; supports a plugin architecture built on &lt;code&gt;hashicorp/go-plugin&lt;/code&gt; for extending system capabilities. Plugins communicate over gRPC.&lt;/p&gt;
&lt;h2 id="plugin-types"&gt;Plugin Types&lt;a class="anchor" href="#plugin-types"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Message Broker Plugins&lt;/strong&gt;: Custom message delivery backends for agent notifications and structured messaging&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Agent Harness Plugins&lt;/strong&gt;: Custom &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/"&gt;harness&lt;/a&gt; implementations that integrate new LLM tools without modifying the core codebase&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="status"&gt;Status&lt;a class="anchor" href="#status"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Currently in foundational stage, with reference implementations available for both plugin types.&lt;/p&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/"&gt;harness&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/prompt-engineering-patterns/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/prompt-engineering-patterns/</guid><description>&lt;h1 id="prompt-engineering-patterns"&gt;Prompt Engineering Patterns&lt;a class="anchor" href="#prompt-engineering-patterns"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Structured techniques for crafting prompts within agentic skills, drawn from research and the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt; framework.&lt;/p&gt;
&lt;h2 id="chain-of-thought-cot"&gt;Chain of Thought (CoT)&lt;a class="anchor" href="#chain-of-thought-cot"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Structure prompts to encourage step-by-step reasoning. Break complex tasks into numbered steps. Wei et al. (2022).&lt;/p&gt;
&lt;h2 id="react-pattern"&gt;ReAct Pattern&lt;a class="anchor" href="#react-pattern"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Integrate reasoning and acting in a loop (Yao et al., 2023):&lt;/p&gt;
&lt;pre tabindex="0"&gt;&lt;code&gt;Thought: [reasoning about what to do]
Action: [specific action to take]
Observation: [result of the action]
... (repeat until task complete)&lt;/code&gt;&lt;/pre&gt;&lt;h2 id="self-reflection-reflexion"&gt;Self-Reflection (Reflexion)&lt;a class="anchor" href="#self-reflection-reflexion"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Enable agents to learn from mistakes (Shinn &amp;amp; Labash, 2023):&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime-broker/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime-broker/</guid><description>&lt;h1 id="runtime-broker"&gt;Runtime Broker&lt;a class="anchor" href="#runtime-broker"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;In &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, a Runtime Broker is a compute node that registers with a &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/hub/"&gt;hub&lt;/a&gt; to provide execution capacity for &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;s.&lt;/p&gt;
&lt;h2 id="responsibilities"&gt;Responsibilities&lt;a class="anchor" href="#responsibilities"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Manages local lifecycle of agents dispatched from the Hub&lt;/li&gt;
&lt;li&gt;Handles workspace synchronization&lt;/li&gt;
&lt;li&gt;Template hydration (with content hash caching)&lt;/li&gt;
&lt;li&gt;Log streaming&lt;/li&gt;
&lt;li&gt;Reports status back via heartbeats and agent status updates&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="communication"&gt;Communication&lt;a class="anchor" href="#communication"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Connects to the Hub via:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Direct HTTP&lt;/strong&gt; — when broker has a reachable endpoint&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;WebSocket Control Channel&lt;/strong&gt; — when behind NAT/firewall&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="examples-of-broker-nodes"&gt;Examples of Broker Nodes&lt;a class="anchor" href="#examples-of-broker-nodes"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A server, laptop, or Kubernetes cluster — any machine that can run containers.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime/</guid><description>&lt;h1 id="runtime"&gt;Runtime&lt;a class="anchor" href="#runtime"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;In &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, the Runtime is the infrastructure layer responsible for executing &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt; containers. Scion abstracts container execution behind a common interface.&lt;/p&gt;
&lt;h2 id="supported-runtimes"&gt;Supported Runtimes&lt;a class="anchor" href="#supported-runtimes"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Runtime&lt;/th&gt;
 &lt;th&gt;Platform&lt;/th&gt;
 &lt;th&gt;Notes&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;Docker&lt;/td&gt;
 &lt;td&gt;Linux / macOS / Windows&lt;/td&gt;
 &lt;td&gt;Default fallback. Supports remote Docker hosts&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Podman&lt;/td&gt;
 &lt;td&gt;Linux / macOS&lt;/td&gt;
 &lt;td&gt;Daemonless, rootless alternative&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Apple Container&lt;/td&gt;
 &lt;td&gt;macOS&lt;/td&gt;
 &lt;td&gt;Native Virtualization Framework, improved performance&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Kubernetes&lt;/td&gt;
 &lt;td&gt;Any (via kubeconfig)&lt;/td&gt;
 &lt;td&gt;Agents as Pods. Namespace isolation, resource specs, workspace sync via tar snapshots&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;h2 id="runtime-selection"&gt;Runtime Selection&lt;a class="anchor" href="#runtime-selection"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Resolved by &lt;code&gt;GetRuntime&lt;/code&gt; factory function:&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/</guid><description>&lt;h1 id="skill-evaluation"&gt;Skill Evaluation&lt;a class="anchor" href="#skill-evaluation"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Moving from &amp;ldquo;it feels better&amp;rdquo; to &amp;ldquo;I have proof&amp;rdquo; when measuring AI agent skill quality.&lt;/p&gt;
&lt;h2 id="the-problem"&gt;The Problem&lt;a class="anchor" href="#the-problem"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;LLM agents are non-deterministic. Manual testing captures one sample from a distribution. &amp;ldquo;Vibes-based&amp;rdquo; evaluation misses regressions, false positives, and edge cases. As Karpathy noted: &amp;ldquo;The eval is often harder than the task itself.&amp;rdquo;&lt;/p&gt;
&lt;h2 id="three-tier-framework"&gt;Three-Tier Framework&lt;a class="anchor" href="#three-tier-framework"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Tier&lt;/th&gt;
 &lt;th&gt;Method&lt;/th&gt;
 &lt;th&gt;Cost&lt;/th&gt;
 &lt;th&gt;Frequency&lt;/th&gt;
 &lt;th&gt;Catches&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;1&lt;/td&gt;
 &lt;td&gt;Deterministic graders&lt;/td&gt;
 &lt;td&gt;~$0&lt;/td&gt;
 &lt;td&gt;Every commit&lt;/td&gt;
 &lt;td&gt;Command execution, file existence, sequence, format&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;2&lt;/td&gt;
 &lt;td&gt;LLM-as-judge&lt;/td&gt;
 &lt;td&gt;$0.01–0.20/eval&lt;/td&gt;
 &lt;td&gt;PRs, nightly&lt;/td&gt;
 &lt;td&gt;Code quality, conventions, readability (rubric-based)&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;3&lt;/td&gt;
 &lt;td&gt;Human review&lt;/td&gt;
 &lt;td&gt;$0.50–5.00/eval&lt;/td&gt;
 &lt;td&gt;Sparingly&lt;/td&gt;
 &lt;td&gt;Calibration, edge cases, high-stakes decisions&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;&amp;ldquo;The best eval is one that actually gets run.&amp;rdquo; — Anthropic&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/concepts/template/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/concepts/template/</guid><description>&lt;h1 id="template"&gt;Template&lt;a class="anchor" href="#template"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;In &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, a Template is a versioned blueprint for creating an &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;. It defines the base configuration, system prompt, tools, and initial state.&lt;/p&gt;
&lt;h2 id="contents"&gt;Contents&lt;a class="anchor" href="#contents"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;code&gt;home/&lt;/code&gt; directory tree — copied into the agent&amp;rsquo;s home&lt;/li&gt;
&lt;li&gt;&lt;code&gt;scion-agent.json&lt;/code&gt; (or &lt;code&gt;.yaml&lt;/code&gt;) — specifies &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/"&gt;harness&lt;/a&gt; type, env vars, volumes, command args, model overrides, container image, resource requirements&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="inheritance"&gt;Inheritance&lt;a class="anchor" href="#inheritance"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Templates support inheritance via a &lt;code&gt;base&lt;/code&gt; field. Scion walks the chain and merges configurations bottom-up (base first, then overrides).&lt;/p&gt;
&lt;h2 id="scopes"&gt;Scopes&lt;a class="anchor" href="#scopes"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Project-level&lt;/strong&gt;: &lt;code&gt;.scion/templates/&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Global&lt;/strong&gt;: &lt;code&gt;~/.scion/templates/&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Hosted&lt;/strong&gt;: Can be scoped as global, grove, or user, with visibility controls (private, grove, public)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="management"&gt;Management&lt;a class="anchor" href="#management"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;pre tabindex="0"&gt;&lt;code&gt;scion templates create
scion templates clone
scion templates list
scion templates show
scion templates update-default&lt;/code&gt;&lt;/pre&gt;&lt;h2 id="defaults"&gt;Defaults&lt;a class="anchor" href="#defaults"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Scion ships default templates for each supported &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/"&gt;harness&lt;/a&gt;: &lt;code&gt;gemini&lt;/code&gt;, &lt;code&gt;claude&lt;/code&gt;, &lt;code&gt;opencode&lt;/code&gt;, &lt;code&gt;codex&lt;/code&gt;. Users can create custom templates for specialized roles (e.g., &amp;ldquo;Security Auditor&amp;rdquo;, &amp;ldquo;React Specialist&amp;rdquo;).&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/andrej-karpathy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/andrej-karpathy/</guid><description>&lt;h1 id="andrej-karpathy"&gt;Andrej Karpathy&lt;a class="anchor" href="#andrej-karpathy"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;AI researcher and educator. Former Director of AI at Tesla, founding member of OpenAI. Known for neural network education (cs231n, Zero to Hero series) and practical AI tooling.&lt;/p&gt;
&lt;h2 id="relevant-work"&gt;Relevant Work&lt;a class="anchor" href="#relevant-work"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/llm-wiki-karpathy/"&gt;llm-wiki-karpathy&lt;/a&gt; — The LLM Wiki pattern: using LLMs to incrementally build and maintain personal knowledge bases. The foundational idea behind this wiki.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/"&gt;llm-wiki-pattern&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/</guid><description>&lt;h1 id="anthropic"&gt;Anthropic&lt;a class="anchor" href="#anthropic"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;AI safety company. Builds the Claude family of models and &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;.&lt;/p&gt;
&lt;h2 id="related-projects"&gt;Related Projects&lt;a class="anchor" href="#related-projects"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt; — Agentic coding tool (terminal, IDE, web, Slack, GitHub)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/mcp-protocol/"&gt;mcp-protocol&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/aws/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/aws/</guid><description>&lt;h1 id="aws"&gt;AWS&lt;a class="anchor" href="#aws"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Amazon Web Services. Cloud computing platform by Amazon.&lt;/p&gt;
&lt;h2 id="related-projects"&gt;Related Projects&lt;a class="anchor" href="#related-projects"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt; — Agentic IDE with autonomous agent capabilities&lt;/li&gt;
&lt;li&gt;Defines the concept of &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/"&gt;frontier-agent&lt;/a&gt;s&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/"&gt;frontier-agent&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/bmad-method/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/bmad-method/</guid><description>&lt;h1 id="bmad-method"&gt;BMad Method&lt;a class="anchor" href="#bmad-method"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;AI-driven agile development framework with scale-adaptive intelligence. &amp;ldquo;Build More Architect Dreams.&amp;rdquo; MIT licensed.&lt;/p&gt;
&lt;p&gt;&amp;ldquo;Traditional AI tools do the thinking for you. BMad agents guide you through a structured process to bring out your best thinking.&amp;rdquo;&lt;/p&gt;
&lt;h2 id="core-features"&gt;Core Features&lt;a class="anchor" href="#core-features"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;12+ specialized agent personas&lt;/strong&gt;: PM, Architect, Developer, UX, and more&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;34+ structured workflows&lt;/strong&gt;: Grounded in agile best practices&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Scale-Domain-Adaptive&lt;/strong&gt;: Adjusts planning depth from bug fixes to enterprise systems&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Party Mode&lt;/strong&gt;: Multiple agent personas collaborate in one session&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;bmad-help&lt;/strong&gt;: Context-aware guidance on what&amp;rsquo;s next&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Complete lifecycle&lt;/strong&gt;: Brainstorming → analysis → architecture → implementation → deployment&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="modular-ecosystem"&gt;Modular Ecosystem&lt;a class="anchor" href="#modular-ecosystem"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;Module&lt;/th&gt;
 &lt;th&gt;Purpose&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;BMM (core)&lt;/td&gt;
 &lt;td&gt;34+ workflows, 12+ agents&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;BMad Builder&lt;/td&gt;
 &lt;td&gt;Create custom agents and workflows&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Test Architect (TEA)&lt;/td&gt;
 &lt;td&gt;Risk-based test strategy&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Game Dev Studio&lt;/td&gt;
 &lt;td&gt;Unity, Unreal, Godot workflows&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;Creative Intelligence Suite&lt;/td&gt;
 &lt;td&gt;Innovation, brainstorming, design thinking&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;h2 id="in-the-ecosystem"&gt;In the Ecosystem&lt;a class="anchor" href="#in-the-ecosystem"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;BMad operates at the &lt;strong&gt;methodology layer&lt;/strong&gt; alongside &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/"&gt;spec-kit&lt;/a&gt;, but with a different philosophy:&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/</guid><description>&lt;h1 id="claude-code"&gt;Claude Code&lt;a class="anchor" href="#claude-code"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;An agentic coding tool by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/"&gt;anthropic&lt;/a&gt; that lives in your terminal, understands your codebase, and helps you code faster through natural language commands.&lt;/p&gt;
&lt;h2 id="architecture"&gt;Architecture&lt;a class="anchor" href="#architecture"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Claude Code is the &amp;ldquo;agentic harness&amp;rdquo; around Claude models. It provides tools, context management, and execution environment that turn a language model into a coding agent.&lt;/p&gt;
&lt;h3 id="the-agentic-loop"&gt;The Agentic Loop&lt;a class="anchor" href="#the-agentic-loop"&gt;#&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;Three phases: &lt;strong&gt;gather context → take action → verify results&lt;/strong&gt;, chained together with course-correction. Claude decides what each step requires based on what it learned from the previous step.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/daniel-miessler/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/daniel-miessler/</guid><description>&lt;h1 id="daniel-miessler"&gt;Daniel Miessler&lt;a class="anchor" href="#daniel-miessler"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Security researcher and AI practitioner. Creator of &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt; and &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;.&lt;/p&gt;
&lt;h2 id="relevant-work"&gt;Relevant Work&lt;a class="anchor" href="#relevant-work"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt; — Open-source framework with 251+ curated AI prompt patterns&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt; — Personal AI Infrastructure built on &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;. Persistent memory, skills, goals, continuous learning.&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;The two projects are complementary: Fabric = patterns (what to ask AI). PAI = infrastructure (how the AI operates).&lt;/p&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/</guid><description>&lt;h1 id="fabric"&gt;Fabric&lt;a class="anchor" href="#fabric"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;An open-source framework (Go, MIT) for augmenting humans using AI. Created by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/daniel-miessler/"&gt;daniel-miessler&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Mission: &amp;ldquo;human flourishing via AI augmentation&amp;rdquo;&lt;/p&gt;
&lt;h2 id="core-concept-patterns"&gt;Core Concept: Patterns&lt;a class="anchor" href="#core-concept-patterns"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Patterns are curated, well-structured prompts organized by real-world task. 251+ patterns covering:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Content extraction (YouTube, podcasts, articles)&lt;/li&gt;
&lt;li&gt;Writing (essays, social media, documentation)&lt;/li&gt;
&lt;li&gt;Analysis (code, claims, debates, incidents, logs)&lt;/li&gt;
&lt;li&gt;Creation (art prompts, concept maps, changelogs)&lt;/li&gt;
&lt;li&gt;And many more&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Each pattern is a directory with a &lt;code&gt;system.md&lt;/code&gt; file. Markdown-based, clear instructions, System section focused.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/google-cloud-platform/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/google-cloud-platform/</guid><description>&lt;h1 id="google-cloud-platform"&gt;Google Cloud Platform&lt;a class="anchor" href="#google-cloud-platform"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Cloud computing platform by Google. Relevant to this wiki as the organization behind &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;.&lt;/p&gt;
&lt;h2 id="related-projects"&gt;Related Projects&lt;a class="anchor" href="#related-projects"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt; — Multi-agent orchestration testbed (hosted on GCP&amp;rsquo;s GitHub)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/</guid><description>&lt;h1 id="kiro"&gt;Kiro&lt;a class="anchor" href="#kiro"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Kiro is an agentic IDE by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/aws/"&gt;aws&lt;/a&gt; for software development. It has three main surfaces:&lt;/p&gt;
&lt;h2 id="kiro-ide"&gt;Kiro IDE&lt;a class="anchor" href="#kiro-ide"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Interactive, synchronous collaboration on your local machine. Pair programming, suggestions, real-time code iteration. Spec-driven development.&lt;/p&gt;
&lt;h2 id="kiro-cli"&gt;Kiro CLI&lt;a class="anchor" href="#kiro-cli"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Custom agents as configuration files that customize Kiro&amp;rsquo;s behavior for specific workflows. Define tool access, permissions, and context. Pre-approve tools, reduce interruptions, optimize for specific tasks. Interactive, runs on your local machine.&lt;/p&gt;
&lt;h2 id="kiro-autonomous-agent"&gt;Kiro Autonomous Agent&lt;a class="anchor" href="#kiro-autonomous-agent"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/"&gt;frontier-agent&lt;/a&gt; that works asynchronously in the background on complex, multi-step development tasks. Key capabilities:&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/notebooklm/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/notebooklm/</guid><description>&lt;h1 id="notebooklm"&gt;NotebookLM&lt;a class="anchor" href="#notebooklm"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;A Gemini-powered writing and research tool from Google Labs. Co-founded by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/steven-johnson/"&gt;steven-johnson&lt;/a&gt;. Designed for AI-assisted knowledge work grounded in user-curated sources.&lt;/p&gt;
&lt;h2 id="core-design"&gt;Core Design&lt;a class="anchor" href="#core-design"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Users upload sources (documents, articles, notes) — the AI only reasons over those sources, not the open web&lt;/li&gt;
&lt;li&gt;Two note types: Written Notes (user-authored, editable) and Saved Responses (AI-generated, immutable)&lt;/li&gt;
&lt;li&gt;Provenance tracking built in: clear separation between human-written and AI-generated content&lt;/li&gt;
&lt;li&gt;Source-integrated reading with AI actions (summarize, explain, find related ideas)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="relationship-to-the-wiki-landscape"&gt;Relationship to the Wiki Landscape&lt;a class="anchor" href="#relationship-to-the-wiki-landscape"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;NotebookLM occupies a different point in the AI knowledge management space than the tools tracked elsewhere in this wiki:&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/</guid><description>&lt;h1 id="pai-personal-ai-infrastructure"&gt;PAI (Personal AI Infrastructure)&lt;a class="anchor" href="#pai-personal-ai-infrastructure"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;An open-source personalized AI platform by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/daniel-miessler/"&gt;daniel-miessler&lt;/a&gt;, built natively on &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;. Turns Claude Code from a stateless tool into a persistent assistant that knows your goals, preferences, and history.&lt;/p&gt;
&lt;p&gt;Mission: &amp;ldquo;AI should magnify everyone — not just the top 1%.&amp;rdquo;&lt;/p&gt;
&lt;h2 id="three-levels-of-ai"&gt;Three Levels of AI&lt;a class="anchor" href="#three-levels-of-ai"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Chatbots&lt;/strong&gt;: Ask → Answer → Forget&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Agentic Platforms&lt;/strong&gt;: Ask → Use tools → Get result&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;PAI&lt;/strong&gt;: Observe → Think → Plan → Execute → Verify → &lt;strong&gt;Learn&lt;/strong&gt; → Improve&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;The key differentiator is the &lt;strong&gt;learn&lt;/strong&gt; step.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/paperclip/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/paperclip/</guid><description>&lt;h1 id="paperclip"&gt;Paperclip&lt;a class="anchor" href="#paperclip"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Open-source orchestration for zero-human companies. Node.js server + React UI. MIT licensed.&lt;/p&gt;
&lt;p&gt;&amp;ldquo;If OpenClaw is an employee, Paperclip is the company.&amp;rdquo;&lt;/p&gt;
&lt;h2 id="what-it-does"&gt;What It Does&lt;a class="anchor" href="#what-it-does"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Orchestrates teams of AI agents into companies with:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Org charts&lt;/strong&gt;: Hierarchies, roles, reporting lines&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Goal alignment&lt;/strong&gt;: Every task traces to company mission&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Budgets&lt;/strong&gt;: Monthly per-agent, atomic enforcement&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Governance&lt;/strong&gt;: Approval gates, rollback, audit logs&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Heartbeats&lt;/strong&gt;: Scheduled agent wake cycles&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Ticket system&lt;/strong&gt;: Every conversation traced, every decision explained&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Multi-company&lt;/strong&gt;: One deployment, many isolated companies&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="agent-support"&gt;Agent Support&lt;a class="anchor" href="#agent-support"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Agent-agnostic: Claude Code, Codex, Cursor, OpenClaw, Bash, HTTP. &amp;ldquo;If it can receive a heartbeat, it&amp;rsquo;s hired.&amp;rdquo;&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/promptfoo/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/promptfoo/</guid><description>&lt;h1 id="promptfoo"&gt;Promptfoo&lt;a class="anchor" href="#promptfoo"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Open-source CLI for LLM evaluation and red teaming. Now part of OpenAI. MIT licensed.&lt;/p&gt;
&lt;p&gt;YAML-based test cases, CI/CD integration, model comparison, red teaming. Runs locally — prompts never leave your machine. Powers apps serving 10M+ users.&lt;/p&gt;
&lt;p&gt;The closest existing tool to a turnkey skill eval pipeline.&lt;/p&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/"&gt;skill-evaluation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/analyses/how-to-eval-a-skill/"&gt;how-to-eval-a-skill&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/</guid><description>&lt;h1 id="scion"&gt;Scion&lt;a class="anchor" href="#scion"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Scion is an experimental multi-agent orchestration testbed by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/google-cloud-platform/"&gt;google-cloud-platform&lt;/a&gt;. It manages concurrent LLM-based &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;s running in &lt;strong&gt;container&lt;/strong&gt;s across local machines and remote clusters.&lt;/p&gt;
&lt;h2 id="what-it-is"&gt;What It Is&lt;a class="anchor" href="#what-it-is"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A &amp;ldquo;hypervisor for agents&amp;rdquo; — infrastructure for running, isolating, and managing LLM agent processes. It is explicitly &lt;em&gt;not&lt;/em&gt; a full multi-agent framework. Components like agent memory, chatrooms, and task management are treated as orthogonal concerns.&lt;/p&gt;
&lt;h2 id="architecture"&gt;Architecture&lt;a class="anchor" href="#architecture"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Scion follows a Manager-Worker pattern:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;CLI&lt;/strong&gt; (&lt;code&gt;scion&lt;/code&gt;): Host-side orchestrator managing agent lifecycle and &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/grove/"&gt;grove&lt;/a&gt;s&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Agents&lt;/strong&gt;: Isolated containers running LLM software via &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/"&gt;harness&lt;/a&gt; adapters&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;Two operating modes:&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/</guid><description>&lt;h1 id="spec-kit"&gt;Spec Kit&lt;a class="anchor" href="#spec-kit"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Open-source toolkit by GitHub for Spec-Driven Development. CLI (&lt;code&gt;specify&lt;/code&gt;) + slash commands across 30+ AI agents. MIT licensed.&lt;/p&gt;
&lt;p&gt;&amp;ldquo;Build high-quality software faster. Focus on product scenarios and predictable outcomes instead of vibe coding.&amp;rdquo;&lt;/p&gt;
&lt;h2 id="core-workflow"&gt;Core Workflow&lt;a class="anchor" href="#core-workflow"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;pre tabindex="0"&gt;&lt;code&gt;/speckit.constitution → Establish project principles
/speckit.specify → Define requirements (the &amp;#34;what&amp;#34; and &amp;#34;why&amp;#34;)
/speckit.clarify → Structured questioning to reduce rework
/speckit.plan → Technical implementation plan (the &amp;#34;how&amp;#34;)
/speckit.tasks → Actionable task breakdown with dependencies
/speckit.implement → Execute all tasks&lt;/code&gt;&lt;/pre&gt;&lt;p&gt;Optional: &lt;code&gt;/speckit.analyze&lt;/code&gt; (consistency check), &lt;code&gt;/speckit.checklist&lt;/code&gt; (quality validation)&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/entities/steven-johnson/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/entities/steven-johnson/</guid><description>&lt;h1 id="steven-johnson"&gt;Steven Johnson&lt;a class="anchor" href="#steven-johnson"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;Editorial Director and Co-Founder of &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/notebooklm/"&gt;notebooklm&lt;/a&gt;. Author of 14 books (latest: &lt;em&gt;The Infernal Machine&lt;/em&gt;). Writes how-to guides and thought pieces on using NotebookLM for research and writing workflows.&lt;/p&gt;
&lt;h2 id="role-in-the-wiki"&gt;Role in the Wiki&lt;a class="anchor" href="#role-in-the-wiki"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Johnson is the primary voice explaining NotebookLM&amp;rsquo;s design philosophy — particularly the &amp;ldquo;pin → organize → structure&amp;rdquo; workflow and the deliberate separation of human-written vs. AI-generated notes for provenance tracking.&lt;/p&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/notebooklm/"&gt;notebooklm&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/notebooklm-notes-guide/"&gt;notebooklm-notes-guide&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/ai-technique-podcast/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/ai-technique-podcast/</guid><description>&lt;h1 id="straight-talk-ai-tips-for-amazonians-podcast-notes"&gt;strAIght talk: AI Tips for Amazonians (Podcast Notes)&lt;a class="anchor" href="#straight-talk-ai-tips-for-amazonians-podcast-notes"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://podcasts.apple.com/ca/podcast/straight-talk-ai-tips-for-amazonians/id1881350943"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/human/ai-technique-podcast.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Notes from the &amp;ldquo;strAIght talk&amp;rdquo; podcast — real Amazon employees sharing tactics they actually use with AI at work. Execution-focused, no theory. The core thesis: AI isn&amp;rsquo;t just a tool, it&amp;rsquo;s a workflow replacement layer.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;One prompt can replace entire workflows&lt;/strong&gt;: People are collapsing tools (Slack, email, docs, planning) into a single AI interface. Tasks that took hours → minutes. The shift: from tool-driven work to prompt-driven work where AI orchestrates everything.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;The &amp;ldquo;daily prompt&amp;rdquo; is the real leverage&lt;/strong&gt;: High performers build a repeatable daily prompt anchored around priorities, constraints, and context. Acts like &amp;ldquo;a lightweight operating system for your day.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Context beats clever prompting&lt;/strong&gt;: Best results come not from smarter prompts but from feeding AI your role, goals, and constraints. Maintaining a &amp;ldquo;context document&amp;rdquo; is a recurring technique across episodes.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;AI as thinking partner, not just executor&lt;/strong&gt;: Ask AI questions, let AI ask &lt;em&gt;you&lt;/em&gt; questions back. This is where the real leverage happens — better decisions, not just faster output.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Treat prompts like code&lt;/strong&gt;: Version them, refine them, reuse them. One guest needed ~20 iterations to get a reliable research framework.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="three-core-moves"&gt;Three Core Moves&lt;a class="anchor" href="#three-core-moves"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Build a personal AI operating loop&lt;/strong&gt;: Morning (what matters today?) → During (execute + refine) → End (summarize + improve)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Stop using AI like search&lt;/strong&gt;: Delegate thinking loops to AI, not just tasks&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Treat prompts like code&lt;/strong&gt;: Version, refine, reuse&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 id="connections"&gt;Connections&lt;a class="anchor" href="#connections"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&lt;/strong&gt;: PAI&amp;rsquo;s TELOS system (MISSION.md, GOALS.md, etc.) is exactly the &amp;ldquo;context document&amp;rdquo; technique this podcast describes — persistent context about who you are and what you&amp;rsquo;re working toward. PAI&amp;rsquo;s &amp;ldquo;observe → think → plan → execute → verify → learn → improve&amp;rdquo; loop mirrors the &amp;ldquo;personal AI operating loop.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt;&lt;/strong&gt;: Fabric&amp;rsquo;s Patterns are the &amp;ldquo;treat prompts like code&amp;rdquo; idea at scale — 251+ versioned, reusable prompts organized by task. The podcast&amp;rsquo;s advice to version and refine prompts is what Fabric systematizes.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/"&gt;llm-wiki-pattern&lt;/a&gt;&lt;/strong&gt;: The wiki pattern is a form of the &amp;ldquo;context document&amp;rdquo; technique — persistent, compounding context that the AI reads at the start of every interaction.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;&lt;/strong&gt;: The &amp;ldquo;context beats clever prompting&amp;rdquo; insight validates the progressive disclosure approach — what matters is getting the right context loaded, not crafting the perfect prompt.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/prompt-engineering-patterns/"&gt;prompt-engineering-patterns&lt;/a&gt;&lt;/strong&gt;: The &amp;ldquo;daily prompt&amp;rdquo; template is a concrete instance of the structured prompting patterns (system message with role, constraints, success criteria).&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/"&gt;llm-wiki-pattern&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/prompt-engineering-patterns/"&gt;prompt-engineering-patterns&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/anthropic-eval-guide/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/anthropic-eval-guide/</guid><description>&lt;h1 id="anthropic-define-success-criteria-and-build-evaluations"&gt;Anthropic: Define Success Criteria and Build Evaluations&lt;a class="anchor" href="#anthropic-define-success-criteria-and-build-evaluations"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://docs.anthropic.com/en/docs/build-with-claude/develop-tests"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/anthropic-eval-guide.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/"&gt;anthropic&lt;/a&gt;&amp;rsquo;s canonical guide to building evaluations for LLM applications. Establishes the methodology that the Caparas article and the broader eval ecosystem builds on. Covers success criteria design, eval types (exact match, cosine similarity, LLM-graded), and the principle that automated volume beats hand-graded quality.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Success criteria must be SMART&lt;/strong&gt;: Specific, Measurable, Achievable, Relevant. &amp;ldquo;The model should classify sentiments well&amp;rdquo; is bad. &amp;ldquo;F1 score of at least 0.85 on 10,000 diverse tweets&amp;rdquo; is good.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Common criteria dimensions&lt;/strong&gt;: Task fidelity, consistency, relevance/coherence, tone/style, privacy preservation, context utilization, latency, price. Most use cases need multidimensional evaluation.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Three eval design principles&lt;/strong&gt;: (1) Be task-specific — mirror real-world distribution including edge cases. (2) Automate when possible — structure for automated grading. (3) Prioritize volume over quality — more questions with automated grading beats fewer with human grading.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Eval types by complexity&lt;/strong&gt;:
&lt;ul&gt;
&lt;li&gt;Exact match: Binary correct/incorrect. Best for categorical tasks.&lt;/li&gt;
&lt;li&gt;Cosine similarity: Semantic similarity between embeddings. Best for consistency testing.&lt;/li&gt;
&lt;li&gt;LLM-as-judge: Use a model to grade another model&amp;rsquo;s output against a rubric.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Even &amp;ldquo;hazy&amp;rdquo; topics can be quantified&lt;/strong&gt;: Ethics and safety can be measured — e.g., &amp;ldquo;less than 0.1% of outputs flagged for toxicity out of 10,000 trials.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Edge cases matter&lt;/strong&gt;: Irrelevant input, overly long input, harmful user input, ambiguous cases where even humans disagree.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="connections"&gt;Connections&lt;a class="anchor" href="#connections"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/"&gt;skill-evaluation&lt;/a&gt;&lt;/strong&gt;: This guide provides the foundational methodology. The three-tier framework (deterministic → LLM-judge → human) from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/evaluating-agent-skills-caparas/"&gt;evaluating-agent-skills-caparas&lt;/a&gt; is a direct application of these principles to skills specifically.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/analyses/how-to-eval-a-skill/"&gt;how-to-eval-a-skill&lt;/a&gt;&lt;/strong&gt;: Our practical guide extends Anthropic&amp;rsquo;s prompt eval methodology to the five surfaces of skill evaluation (routing, tool selection, process, side effects, output quality).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt;&lt;/strong&gt;: Pillar 7 (Testing and Validation) is operationalized by this guide&amp;rsquo;s methodology.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/"&gt;skill-evaluation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/evaluating-agent-skills-caparas/"&gt;evaluating-agent-skills-caparas&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/analyses/how-to-eval-a-skill/"&gt;how-to-eval-a-skill&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/"&gt;anthropic&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/anthropic-skills-repo/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/anthropic-skills-repo/</guid><description>&lt;h1 id="anthropic-skills-repository--agent-skills-spec"&gt;Anthropic Skills Repository &amp;amp; Agent Skills Spec&lt;a class="anchor" href="#anthropic-skills-repository--agent-skills-spec"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/anthropics/skills"&gt;Original&lt;/a&gt; | &lt;a href="https://agentskills.io/specification"&gt;Spec&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/anthropic-skills-repo.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/"&gt;anthropic&lt;/a&gt;&amp;rsquo;s official skills repository for Claude, plus the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt; specification. Skills are folders of instructions, scripts, and resources that Claude loads dynamically. The repo contains 17 skills across creative, technical, enterprise, and document categories — including the production document skills that power Claude.ai&amp;rsquo;s file capabilities.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Agent Skills is an open standard&lt;/strong&gt;: Defined at agentskills.io, not Anthropic-proprietary. Designed for cross-tool compatibility. &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt; extends it with invocation control, subagent execution, and dynamic context injection.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Progressive disclosure is the core design principle&lt;/strong&gt;: Metadata (~100 tokens) loaded at startup for all skills. Full instructions (&amp;lt;5000 tokens recommended) loaded only when activated. Resources loaded only when needed. This is how skills scale without blowing up context.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;SKILL.md is the only required file&lt;/strong&gt;: YAML frontmatter (name + description) + markdown instructions. Optional: scripts/, references/, assets/ directories. Keep under 500 lines.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Document skills are production code&lt;/strong&gt;: The docx, pdf, pptx, xlsx skills power Claude.ai&amp;rsquo;s document capabilities. Source-available (not open source) — shared as reference for complex skill patterns.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Plugin marketplace for Claude Code&lt;/strong&gt;: &lt;code&gt;/plugin marketplace add anthropics/skills&lt;/code&gt; registers the repo. Skills installable individually.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Partner ecosystem emerging&lt;/strong&gt;: Notion has published skills for Claude. The &lt;code&gt;allowed-tools&lt;/code&gt; field (experimental) hints at future tool-level permission control per skill.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/"&gt;anthropic&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/mcp-protocol/"&gt;mcp-protocol&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/bmad-method/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/bmad-method/</guid><description>&lt;h1 id="bmad-method-ai-driven-agile-development"&gt;BMad Method: AI-Driven Agile Development&lt;a class="anchor" href="#bmad-method-ai-driven-agile-development"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/bmad-code-org/BMAD-METHOD"&gt;Original&lt;/a&gt; | &lt;a href="https://docs.bmad-method.org"&gt;Docs&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/bmad-method-readme.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/bmad-method/"&gt;bmad-method&lt;/a&gt; is an AI-driven agile development framework with scale-adaptive intelligence — it adjusts planning depth from bug fixes to enterprise systems. Uses 12+ specialized agent personas (PM, Architect, Developer, UX, etc.) with structured workflows grounded in agile best practices. Modular ecosystem with official modules for testing, game dev, creative intelligence, and custom agent building. MIT licensed.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Agents as expert collaborators, not autopilots&lt;/strong&gt;: &amp;ldquo;Traditional AI tools do the thinking for you, producing average results. BMad agents act as expert collaborators who guide you through a structured process to bring out your best thinking.&amp;rdquo; This is a fundamentally different philosophy from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt;&amp;rsquo;s autonomous frontier agents.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Scale-Domain-Adaptive&lt;/strong&gt;: Automatically adjusts planning depth based on project complexity. A bug fix doesn&amp;rsquo;t need the same ceremony as an enterprise system. This addresses the &amp;ldquo;sometimes agents aren&amp;rsquo;t needed&amp;rdquo; finding from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;12+ specialized agent personas&lt;/strong&gt;: PM, Architect, Developer, UX, and more. Each is a domain expert with a specific role. Related to &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/skills-pipeline-sleestk/"&gt;skills-pipeline-sleestk&lt;/a&gt;&amp;rsquo;s persona-driven skills and &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&amp;rsquo;s agent personalities.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Party Mode&lt;/strong&gt;: Bring multiple agent personas into one session to collaborate and discuss. A novel approach to multi-agent interaction — not parallel execution (like &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;) but collaborative dialogue within a single session.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Modular ecosystem&lt;/strong&gt;: Core framework (BMM, 34+ workflows) + official modules: BMad Builder (create custom agents), Test Architect (risk-based testing), Game Dev Studio, Creative Intelligence Suite. Extensible like &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt; but at the methodology level.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;bmad-help skill&lt;/strong&gt;: Invoke anytime for guidance on what&amp;rsquo;s next. Context-aware — knows your project state and installed modules. This is the &amp;ldquo;AI as thinking partner&amp;rdquo; pattern from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ai-technique-podcast/"&gt;ai-technique-podcast&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Complete lifecycle&lt;/strong&gt;: Brainstorming → analysis → architecture → implementation → deployment. Broader than &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/"&gt;spec-kit&lt;/a&gt;&amp;rsquo;s spec-focused workflow.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Structured workflows over prompts&lt;/strong&gt;: 34+ workflows grounded in agile best practices. Not just prompt templates — structured processes with phases, checkpoints, and handoffs.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="comparison-with-spec-kit"&gt;Comparison with Spec Kit&lt;a class="anchor" href="#comparison-with-spec-kit"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;table&gt;
 &lt;thead&gt;
 &lt;tr&gt;
 &lt;th&gt;&lt;/th&gt;
 &lt;th&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/"&gt;spec-kit&lt;/a&gt;&lt;/th&gt;
 &lt;th&gt;BMad Method&lt;/th&gt;
 &lt;/tr&gt;
 &lt;/thead&gt;
 &lt;tbody&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Focus&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Spec-driven (specs as primary artifact)&lt;/td&gt;
 &lt;td&gt;Agile-driven (structured workflows as primary process)&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Agents&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Agent-agnostic (30+ supported)&lt;/td&gt;
 &lt;td&gt;12+ specialized personas built in&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Scaling&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Same workflow for all projects&lt;/td&gt;
 &lt;td&gt;Scale-adaptive (adjusts depth to complexity)&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Collaboration&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;Single agent executes steps&lt;/td&gt;
 &lt;td&gt;Party Mode (multi-persona dialogue)&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Extensions&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;50+ community extensions&lt;/td&gt;
 &lt;td&gt;Official modules (testing, game dev, creative)&lt;/td&gt;
 &lt;/tr&gt;
 &lt;tr&gt;
 &lt;td&gt;&lt;strong&gt;Philosophy&lt;/strong&gt;&lt;/td&gt;
 &lt;td&gt;&amp;ldquo;Specs before code&amp;rdquo;&lt;/td&gt;
 &lt;td&gt;&amp;ldquo;Expert collaboration over autopilot&amp;rdquo;&lt;/td&gt;
 &lt;/tr&gt;
 &lt;/tbody&gt;
&lt;/table&gt;
&lt;p&gt;Both operate at the methodology layer. Spec Kit is more prescriptive (seven fixed steps). BMad is more adaptive (adjusts to project scale).&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/claude-code-docs/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/claude-code-docs/</guid><description>&lt;h1 id="claude-code-documentation"&gt;Claude Code Documentation&lt;a class="anchor" href="#claude-code-documentation"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://code.claude.com/docs/en"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/anthropics/claude-code"&gt;GitHub&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/claude-code-docs.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Comprehensive documentation for &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/"&gt;anthropic&lt;/a&gt;&amp;rsquo;s agentic coding tool. Terminal-native CLI that understands your codebase, edits files, runs commands, and handles git workflows through natural language. Deeply extensible via &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/mcp-protocol/"&gt;mcp-protocol&lt;/a&gt;, plugins, skills, hooks, and custom subagents.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Agentic loop architecture&lt;/strong&gt;: Three phases — gather context, take action, verify results — chained together with course-correction. Claude Code is the &amp;ldquo;agentic harness&amp;rdquo; that provides tools, context management, and execution environment around the Claude model.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Five tool categories&lt;/strong&gt;: File operations, search, execution (shell), web, and code intelligence. This is the foundation; everything else extends it.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Dual memory system&lt;/strong&gt;: CLAUDE.md (human-written instructions, scoped to org/project/user/local) + auto memory (Claude-written learnings, per working tree). Both loaded at session start. Target under 200 lines per CLAUDE.md.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Six permission modes&lt;/strong&gt;: From &lt;code&gt;default&lt;/code&gt; (reads only) to &lt;code&gt;bypassPermissions&lt;/code&gt; (everything). &lt;code&gt;auto&lt;/code&gt; mode uses a separate classifier model to review actions — a research preview. This is the trust/autonomy dial.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Subagents with isolation&lt;/strong&gt;: Each subagent gets its own context window, system prompt, tool access, and permissions. Built-in: Explore (Haiku, read-only), Plan (research), General-purpose (all tools). Custom subagents via markdown files. Cannot spawn sub-subagents.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Skills as open standard&lt;/strong&gt;: Follows Agent Skills (agentskills.io). SKILL.md files with YAML frontmatter. Bundled skills include &lt;code&gt;/batch&lt;/code&gt; (parallel changes across codebase in git worktrees), &lt;code&gt;/simplify&lt;/code&gt;, &lt;code&gt;/loop&lt;/code&gt;, &lt;code&gt;/debug&lt;/code&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;25+ hook events&lt;/strong&gt;: Shell commands, HTTP endpoints, or LLM prompts at lifecycle points. Can block actions (PreToolUse deny), inject context, automate workflows. Extremely granular.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Three execution environments&lt;/strong&gt;: Local (your machine), Cloud (Anthropic VMs), Remote Control (local machine, browser UI).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Session portability&lt;/strong&gt;: Move between terminal, desktop, web, mobile, Slack. Same CLAUDE.md and MCP servers everywhere. Resume, fork, parallel sessions via git worktrees.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;AGENTS.md compatibility&lt;/strong&gt;: Claude Code reads CLAUDE.md, not AGENTS.md, but can import AGENTS.md for cross-tool compatibility.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/anthropic/"&gt;anthropic&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/mcp-protocol/"&gt;mcp-protocol&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/"&gt;multi-agent-orchestration&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/evaluating-agent-skills-caparas/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/evaluating-agent-skills-caparas/</guid><description>&lt;h1 id="how-to-evaluate-ai-agent-skills-without-relying-on-vibes"&gt;How to Evaluate AI Agent Skills Without Relying on Vibes&lt;a class="anchor" href="#how-to-evaluate-ai-agent-skills-without-relying-on-vibes"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://pub.towardsai.net/how-to-evaluate-ai-agent-skills-without-relying-on-vibes-9a5764ad18c4"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/evaluating-agent-skills-caparas.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Author: JP Caparas (building on OpenAI&amp;rsquo;s &amp;ldquo;Testing Agent Skills Systematically with Evals&amp;rdquo;)&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A practical guide to moving from &amp;ldquo;it feels better&amp;rdquo; to &amp;ldquo;I have proof&amp;rdquo; when evaluating AI agent skills. Proposes a three-tier evaluation framework (deterministic → LLM-as-judge → human review) with concrete economics, and argues that the industry&amp;rsquo;s convergence on JSON Schema skill formats makes these principles platform-agnostic.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/fabric-github/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/fabric-github/</guid><description>&lt;h1 id="fabric-github-repository"&gt;Fabric GitHub Repository&lt;a class="anchor" href="#fabric-github-repository"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/danielmiessler/fabric"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/fabric-github.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Author: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/daniel-miessler/"&gt;daniel-miessler&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt; is an open-source framework (Go, MIT) for augmenting humans using AI. Its core contribution is &lt;strong&gt;Patterns&lt;/strong&gt; — 251+ curated, well-structured prompts organized by real-world task. Also implements composable &lt;strong&gt;prompt strategies&lt;/strong&gt; (CoT, ToT, Reflexion, etc.) as modifiers on top of patterns. Model-agnostic with 30+ provider integrations.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Patterns as the fundamental unit&lt;/strong&gt;: Fabric&amp;rsquo;s insight is that AI has a capabilities problem but an &lt;em&gt;integration&lt;/em&gt; problem. Patterns solve this by packaging prompts as reusable, discoverable, shareable units organized by task. 251+ patterns covering everything from YouTube extraction to academic paper summarization.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Pattern design principles&lt;/strong&gt;: Markdown for readability, extremely clear instructions, System section almost exclusively. Each pattern is a directory with a &lt;code&gt;system.md&lt;/code&gt; file.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Composable strategies&lt;/strong&gt;: Nine prompt strategies (CoT, CoD, ToT, AoT, LtM, self-consistent, self-refine, reflexion, standard) applied as modifiers on top of any pattern. Stored as JSON. This separates &lt;em&gt;what&lt;/em&gt; to do (pattern) from &lt;em&gt;how&lt;/em&gt; to reason (strategy).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;CLI-first, Unix philosophy&lt;/strong&gt;: Pipe input, compose with other tools. &lt;code&gt;echo &amp;quot;input&amp;quot; | fabric --strategy cot -p analyze_code&lt;/code&gt;. Shell aliases turn each pattern into a command.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Model-agnostic&lt;/strong&gt;: 30+ providers. Per-pattern model mapping via environment variables.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Obsidian integration&lt;/strong&gt;: Save output as dated markdown files — directly relevant to the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/"&gt;llm-wiki-pattern&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Community-driven&lt;/strong&gt;: Open source pattern library. The &amp;ldquo;wisdom of crowds&amp;rdquo; approach to prompt curation.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="connections"&gt;Connections&lt;a class="anchor" href="#connections"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Patterns are conceptually similar to skills in the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt; — both are curated prompt packages organized by task. But patterns are simpler (just a system.md) while skills support scripts, references, assets, and progressive disclosure.&lt;/li&gt;
&lt;li&gt;Fabric&amp;rsquo;s strategies map directly to the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/prompt-engineering-patterns/"&gt;prompt-engineering-patterns&lt;/a&gt; described in academic literature and the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt; framework (Pillar 5).&lt;/li&gt;
&lt;li&gt;Fabric is model-agnostic like &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt; is harness-agnostic — both abstract over the underlying AI provider.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/daniel-miessler/"&gt;daniel-miessler&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/prompt-engineering-patterns/"&gt;prompt-engineering-patterns&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/"&gt;llm-wiki-pattern&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/kiro-autonomous-agent/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/kiro-autonomous-agent/</guid><description>&lt;h1 id="kiro-autonomous-agent-page"&gt;Kiro Autonomous Agent Page&lt;a class="anchor" href="#kiro-autonomous-agent-page"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://kiro.dev/autonomous-agent/"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/kiro-autonomous-agent.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Product page and FAQ for &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt;&amp;rsquo;s autonomous agent feature — a &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/"&gt;frontier-agent&lt;/a&gt; that works independently on development tasks in the background. It takes high-level task descriptions, plans implementation, writes code across multiple repositories, runs tests, and creates pull requests, all asynchronously without requiring an active session.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Asynchronous by design&lt;/strong&gt;: Unlike the Kiro IDE (interactive pair programming) and Kiro CLI (local custom agents), the autonomous agent runs in isolated sandbox environments in the background. You assign tasks from kiro.dev or GitHub.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Multi-repo coordination&lt;/strong&gt;: Can plan a change once and create coordinated edits and PRs across multiple repositories — not just one repo at a time.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Learns from feedback&lt;/strong&gt;: Maintains persistent context across tasks, repos, and PRs. Uses code review feedback to shape future changes. This is a key differentiator from stateless agent approaches.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Never auto-merges&lt;/strong&gt;: Always creates PRs for human review. Safety-first approach.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Sub-agent coordination&lt;/strong&gt;: Coordinates specialized sub-agents to complete complex development work.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Team features&lt;/strong&gt;: Integrates with Jira, Confluence, GitLab, GitHub, Teams, Slack. Handles routine fixes and follow-ups to protect engineer focus time.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&amp;ldquo;Frontier agent&amp;rdquo; branding&lt;/strong&gt;: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/aws/"&gt;aws&lt;/a&gt; positions this as a new class of agent — autonomous, massively scalable, works independently for hours or days.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Preview status&lt;/strong&gt;: Rolling out to Pro/Pro+/Power users. Free during preview with weekly limits.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="scope"&gt;Scope&lt;a class="anchor" href="#scope"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Product marketing page with FAQ. No deep technical architecture details — this is the public-facing description of capabilities and positioning.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/llm-wiki-karpathy/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/llm-wiki-karpathy/</guid><description>&lt;h1 id="llm-wiki-karpathy"&gt;LLM Wiki (Karpathy)&lt;a class="anchor" href="#llm-wiki-karpathy"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://gist.github.com/karpathy/442a6bf555914893e9891c11519de94f"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/llm-wiki-karpathy.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Author: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/andrej-karpathy/"&gt;andrej-karpathy&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;The foundational idea file for the LLM Wiki pattern — the methodology this entire wiki is built on. Proposes that instead of RAG (re-deriving knowledge on every query), LLMs should incrementally build and maintain a persistent, interlinked wiki from raw sources. The wiki is a compounding artifact: cross-references already exist, contradictions are flagged, synthesis reflects everything ingested.&lt;/p&gt;
&lt;h2 id="key-ideas"&gt;Key Ideas&lt;a class="anchor" href="#key-ideas"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Wiki &amp;gt; RAG&lt;/strong&gt;: RAG rediscovers knowledge from scratch every query. A wiki compiles knowledge once and keeps it current. The synthesis compounds.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Three-layer architecture&lt;/strong&gt;: Raw sources (immutable) → Wiki (LLM-maintained markdown) → Schema (CLAUDE.md/AGENTS.md defining conventions and workflows).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Three operations&lt;/strong&gt;: Ingest (process source → update 10-15 pages), Query (search index → synthesize answer → optionally file back), Lint (health-check for contradictions, orphans, gaps).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Index + Log&lt;/strong&gt;: index.md is content-oriented (catalog for navigation), log.md is chronological (append-only timeline). Index-first navigation works at moderate scale without embedding infrastructure.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Human role vs. LLM role&lt;/strong&gt;: Human curates sources, directs analysis, asks questions, thinks about meaning. LLM does everything else — summarizing, cross-referencing, filing, bookkeeping.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Why wikis fail and this doesn&amp;rsquo;t&lt;/strong&gt;: Humans abandon wikis because maintenance burden grows faster than value. LLMs don&amp;rsquo;t get bored, don&amp;rsquo;t forget cross-references, can touch 15 files in one pass. Maintenance cost ≈ zero.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Memex lineage&lt;/strong&gt;: Related to Vannevar Bush&amp;rsquo;s Memex (1945) — personal knowledge store with associative trails. Bush couldn&amp;rsquo;t solve who does the maintenance. The LLM handles that.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="meta-this-wikis-relationship-to-this-source"&gt;Meta: This Wiki&amp;rsquo;s Relationship to This Source&lt;a class="anchor" href="#meta-this-wikis-relationship-to-this-source"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;This wiki is a direct instantiation of Karpathy&amp;rsquo;s LLM Wiki pattern. Our CLAUDE.md schema implements the three-layer architecture. Our ingest/query/lint workflows follow the operations described. Our index.md and log.md serve exactly the roles specified. The pattern is the blueprint; this wiki is a running instance.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/notebooklm-notes-guide/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/notebooklm-notes-guide/</guid><description>&lt;h1 id="getting-the-most-out-of-notes-in-notebooklm"&gt;Getting The Most Out Of Notes In NotebookLM&lt;a class="anchor" href="#getting-the-most-out-of-notes-in-notebooklm"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://medium.com/@stevenbjohnson/getting-the-most-out-of-notes-in-notebooklm-d9d70316b780"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/notebooklm-notes-guide.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;A how-to guide by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/steven-johnson/"&gt;steven-johnson&lt;/a&gt;, co-founder and editorial director of &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/notebooklm/"&gt;notebooklm&lt;/a&gt;, published March 2024. Part of a series on using the Gemini-powered writing and research tool from Google Labs.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Two note types with clear provenance&lt;/strong&gt;: Written Notes (user-authored, editable) and Saved Responses (AI-generated or source quotes, immutable). The immutability of Saved Responses is a deliberate design choice for tracking what the human wrote vs. what the AI wrote.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Pin-then-organize workflow&lt;/strong&gt;: Have an exploratory conversation with the AI grounded in your sources, pin interesting responses as you go, then revisit and structure later. Brainstorming first, organizing second.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Reading-integrated note-taking&lt;/strong&gt;: Selecting text in a source triggers AI actions — Add to Note, Summarize to Note, Help Me Understand, Suggest Related Ideas. The AI enhances the reading process, not just the writing process.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;5,000-word note query limit&lt;/strong&gt;: Selected notes exceeding 5,000 words cannot be queried via the chatbox. Workaround: combine all notes into one, then re-add as a source — converting notes into a queryable source.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Cross-notebook portability&lt;/strong&gt;: The combine-and-copy pattern lets you move note collections between notebooks by pasting combined notes as a new source.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="design-philosophy"&gt;Design Philosophy&lt;a class="anchor" href="#design-philosophy"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;NotebookLM&amp;rsquo;s notes system embodies a specific stance on AI-assisted knowledge work: the AI is grounded in user-curated sources (not the open web), conversations are exploratory, and the user captures what matters. This is closer to the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/"&gt;llm-wiki-pattern&lt;/a&gt;&amp;rsquo;s &amp;ldquo;human curates, LLM processes&amp;rdquo; division of labor than to a general-purpose chatbot.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/paperclip/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/paperclip/</guid><description>&lt;h1 id="paperclip-open-source-orchestration-for-zero-human-companies"&gt;Paperclip: Open-Source Orchestration for Zero-Human Companies&lt;a class="anchor" href="#paperclip-open-source-orchestration-for-zero-human-companies"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/paperclipai/paperclip"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/paperclip-readme.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/paperclip/"&gt;paperclip&lt;/a&gt; is an open-source Node.js server + React UI that orchestrates a team of AI agents to run a business. Not an agent framework — it&amp;rsquo;s the company layer above agents. Org charts, budgets, governance, goal alignment, and agent coordination. &amp;ldquo;If OpenClaw is an employee, Paperclip is the company.&amp;rdquo; MIT licensed.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Company-level orchestration, not agent-level&lt;/strong&gt;: Paperclip doesn&amp;rsquo;t build agents — it organizes them into companies with org charts, roles, reporting lines, budgets, and governance. This is a layer above &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt; (infrastructure), &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt; (tool), and &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt; (product).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Bring your own agent&lt;/strong&gt;: Works with Claude Code, Codex, Cursor, OpenClaw, Bash, HTTP. &amp;ldquo;If it can receive a heartbeat, it&amp;rsquo;s hired.&amp;rdquo; Agent-agnostic like &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, but at the business layer instead of the infrastructure layer.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Goal-aware execution&lt;/strong&gt;: Every task traces back to the company mission. Tasks carry full goal ancestry so agents see the &amp;ldquo;why,&amp;rdquo; not just a title. This is &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&amp;rsquo;s TELOS concept (goal orientation) applied to multi-agent companies.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Heartbeat-based scheduling&lt;/strong&gt;: Agents wake on a schedule, check work, and act. Delegation flows up and down the org chart. Similar to &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&amp;rsquo;s scheduled tasks but with organizational hierarchy.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Cost control as first-class&lt;/strong&gt;: Monthly budgets per agent. Atomic budget enforcement — when they hit the limit, they stop. No runaway costs. Addresses the evaluation economics gap from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/evaluating-agent-skills-caparas/"&gt;evaluating-agent-skills-caparas&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Governance with rollback&lt;/strong&gt;: Approval gates enforced, config changes revisioned, bad changes rolled back. &amp;ldquo;You&amp;rsquo;re the board.&amp;rdquo; This is the human-in-the-loop pattern from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt; (Pillar 6) at the organizational level.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Persistent agent state&lt;/strong&gt;: Agents resume the same task context across heartbeats instead of restarting from scratch. Addresses the memory/context persistence challenge from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Runtime skill injection&lt;/strong&gt;: Agents learn Paperclip workflows and project context at runtime. Related to &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&amp;rsquo;s progressive disclosure.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Clipmart (coming soon)&lt;/strong&gt;: Download and run entire companies with one click. Pre-built company templates — full org structures, agent configs, and skills. This is the marketplace concept from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt; applied to entire organizations.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Multi-company isolation&lt;/strong&gt;: One deployment, many companies. Complete data isolation. One control plane for a portfolio.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="what-paperclip-is-not"&gt;What Paperclip Is NOT&lt;a class="anchor" href="#what-paperclip-is-not"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;Not a chatbot (agents have jobs, not chat windows)&lt;/li&gt;
&lt;li&gt;Not an agent framework (doesn&amp;rsquo;t tell you how to build agents)&lt;/li&gt;
&lt;li&gt;Not a workflow builder (no drag-and-drop pipelines)&lt;/li&gt;
&lt;li&gt;Not a prompt manager (agents bring their own prompts)&lt;/li&gt;
&lt;li&gt;Not a single-agent tool (this is for teams of 20+ agents)&lt;/li&gt;
&lt;li&gt;Not a code review tool (orchestrates work, not PRs)&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="connections"&gt;Connections&lt;a class="anchor" href="#connections"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;&lt;/strong&gt;: Both are multi-agent orchestration, but at different layers. Scion = infrastructure (containers, runtimes, harnesses). Paperclip = business (org charts, budgets, goals, governance). Paperclip could theoretically use Scion as its runtime layer.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/"&gt;multi-agent-orchestration&lt;/a&gt;&lt;/strong&gt;: Paperclip introduces a fourth approach — &lt;strong&gt;company-level orchestration&lt;/strong&gt; — beyond the infrastructure (Scion), product (Kiro), and tool (Claude Code) approaches already in the wiki.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&lt;/strong&gt;: Both are goal-oriented. PAI&amp;rsquo;s TELOS (mission, goals, projects) maps to Paperclip&amp;rsquo;s company mission → project goals → task hierarchy. But PAI is personal; Paperclip is organizational.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;&lt;/strong&gt;: Persistent agent state across heartbeats + goal ancestry flowing down = a form of hierarchical context management.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/strong&gt;: Runtime skill injection + Clipmart (company templates) extends the skill concept to organizational blueprints.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/paperclip/"&gt;paperclip&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/"&gt;multi-agent-orchestration&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/personal-ai-infrastructure/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/personal-ai-infrastructure/</guid><description>&lt;h1 id="personal-ai-infrastructure-pai"&gt;Personal AI Infrastructure (PAI)&lt;a class="anchor" href="#personal-ai-infrastructure-pai"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/danielmiessler/Personal_AI_Infrastructure"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/personal-ai-infrastructure.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Author: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/daniel-miessler/"&gt;daniel-miessler&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;PAI is a personalized AI platform built natively on &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;, designed to turn it from a stateless tool into a persistent assistant that knows your goals, preferences, and history. It adds memory, skills, hooks, security, voice, and a &amp;ldquo;TELOS&amp;rdquo; goal system on top of Claude Code&amp;rsquo;s primitives. Open source (MIT), TypeScript/Bash. v4.0.3 as of March 2026.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Three levels of AI&lt;/strong&gt;: Chatbots (ask→answer→forget) → Agentic platforms (ask→use tools→get result) → PAI (observe→think→plan→execute→verify→learn→improve). The key differentiator is the &lt;strong&gt;learn&lt;/strong&gt; step — continuous feedback capture and self-improvement.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Goal-oriented, not task-oriented&lt;/strong&gt;: PAI&amp;rsquo;s primary focus is the human and their goals, not the tech. TELOS system: 10 files capturing who you are (MISSION.md, GOALS.md, PROJECTS.md, BELIEFS.md, MODELS.md, STRATEGIES.md, NARRATIVES.md, LEARNED.md, CHALLENGES.md, IDEAS.md).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Built on Claude Code, not replacing it&lt;/strong&gt;: &amp;ldquo;Claude Code is the engine. PAI is everything else that makes it &lt;em&gt;your&lt;/em&gt; car.&amp;rdquo; Uses Claude Code&amp;rsquo;s hooks, slash commands, MCP servers, context files as building blocks.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;16 design principles&lt;/strong&gt;: User centricity, foundational algorithm (scientific method loop), scaffolding &amp;gt; model, deterministic infrastructure, code before prompts, UNIX philosophy, CLI as interface, skill management, memory system, agent personalities, and more.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Primitives (architecture)&lt;/strong&gt;:
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;TELOS&lt;/strong&gt;: Deep goal understanding via 10 structured files&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;User/System separation&lt;/strong&gt;: USER/ (your stuff, upgrade-safe) vs SYSTEM/ (PAI infra)&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Skill system&lt;/strong&gt;: Deterministic outcomes first — CODE → CLI → PROMPT → SKILL hierarchy&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Memory system&lt;/strong&gt;: Three-tier (hot/warm/cold), continuous learning from ratings/sentiment/outcomes&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Hook system&lt;/strong&gt;: 8 event types for lifecycle automation&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Security system&lt;/strong&gt;: Policy-based, no need for &lt;code&gt;--dangerously-skip-permissions&lt;/code&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Voice system&lt;/strong&gt;: ElevenLabs TTS with prosody enhancement&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Notification system&lt;/strong&gt;: ntfy push, Discord, duration-aware routing&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Packs&lt;/strong&gt;: Standalone, AI-installable capability modules (Research, Security, Thinking, Media, etc.) that work without full PAI installation.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Scale&lt;/strong&gt;: v4.0.0 has 63 skills, 21 hooks, 180 workflows, 14 agents.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Relationship to Fabric&lt;/strong&gt;: &amp;ldquo;Fabric is a collection of AI prompts (patterns) for specific tasks. PAI is infrastructure for &lt;em&gt;how your DA operates&lt;/em&gt;. They&amp;rsquo;re complementary.&amp;rdquo;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="connections"&gt;Connections&lt;a class="anchor" href="#connections"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/"&gt;llm-wiki-pattern&lt;/a&gt;&lt;/strong&gt;: PAI&amp;rsquo;s memory system and TELOS are a different instantiation of the same insight — persistent, compounding knowledge. The wiki pattern compiles knowledge from external sources; PAI compiles knowledge about &lt;em&gt;you&lt;/em&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/strong&gt;: PAI is the most ambitious layer built on top of Claude Code&amp;rsquo;s primitives. It validates that Claude Code&amp;rsquo;s hooks, skills, and memory are sufficient building blocks for a full personal AI platform.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt;&lt;/strong&gt;: Same author. Fabric = patterns (what to ask). PAI = infrastructure (how the AI operates). Complementary. Many PAI users integrate Fabric patterns.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/strong&gt;: PAI&amp;rsquo;s skill system follows a CODE → CLI → PROMPT → SKILL hierarchy — more opinionated than the Agent Skills spec, with deterministic outcomes prioritized.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt;&lt;/strong&gt;: PAI implements many of the ten pillars: architecture (Pillar 1), scope (Pillar 3), modularity (Pillar 4), tool integration (Pillar 6), testing (Pillar 7), versioning (Pillar 8), and anti-patterns (Pillar 10).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;&lt;/strong&gt;: PAI&amp;rsquo;s three-tier memory (hot/warm/cold) is a concrete implementation of context management strategies.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/pai/"&gt;pai&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/daniel-miessler/"&gt;daniel-miessler&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/llm-wiki-pattern/"&gt;llm-wiki-pattern&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/promptfoo/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/promptfoo/</guid><description>&lt;h1 id="promptfoo-llm-evals--red-teaming"&gt;Promptfoo: LLM Evals &amp;amp; Red Teaming&lt;a class="anchor" href="#promptfoo-llm-evals--red-teaming"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/promptfoo/promptfoo"&gt;Original&lt;/a&gt; | &lt;a href="https://www.promptfoo.dev/docs/"&gt;Docs&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/promptfoo-readme.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Open-source CLI and library for evaluating and red-teaming LLM apps. Now part of OpenAI. MIT licensed. The closest thing to a turnkey skill eval tool — YAML-based test cases, CI/CD integration, model comparison, and red teaming. Powers LLM apps serving 10M+ users in production.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Developer-first&lt;/strong&gt;: Fast, with live reload and caching. Runs 100% locally — prompts never leave your machine.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;YAML-based test cases&lt;/strong&gt;: Define inputs, expected outputs, and grading criteria in YAML. Directly maps to the &lt;code&gt;eval.yaml&lt;/code&gt; format proposed in &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/analyses/how-to-eval-a-skill/"&gt;how-to-eval-a-skill&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Multiple grading methods&lt;/strong&gt;: Exact match, contains, regex, LLM-as-judge, custom functions. Covers both deterministic (Tier 1) and LLM-graded (Tier 2) evaluation.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;CI/CD integration&lt;/strong&gt;: Run evals in GitHub Actions, block merges on failures. This is the &amp;ldquo;eval on every commit&amp;rdquo; pattern from &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/evaluating-agent-skills-caparas/"&gt;evaluating-agent-skills-caparas&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Red teaming&lt;/strong&gt;: Vulnerability scanning for prompt injection, jailbreaks, and other attacks. Relevant to &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt; Pillar 6 (security).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Model comparison&lt;/strong&gt;: Side-by-side comparison across providers (OpenAI, Anthropic, Azure, Bedrock, Ollama). Useful for the per-pattern model mapping that &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt; supports.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Now part of OpenAI&lt;/strong&gt;: Acquired but remains open source and MIT licensed.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="connections"&gt;Connections&lt;a class="anchor" href="#connections"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/"&gt;skill-evaluation&lt;/a&gt;&lt;/strong&gt;: Promptfoo is the most practical tool for implementing the three-tier eval framework. Its YAML test cases + CI/CD integration + LLM-as-judge support covers Tiers 1 and 2.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/analyses/how-to-eval-a-skill/"&gt;how-to-eval-a-skill&lt;/a&gt;&lt;/strong&gt;: The &lt;code&gt;eval.yaml&lt;/code&gt; format we proposed could be implemented directly in Promptfoo&amp;rsquo;s YAML config format.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/strong&gt;: An &lt;code&gt;evals/&lt;/code&gt; directory in the skill structure could contain Promptfoo config files, making skills self-evaluating.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/"&gt;skill-evaluation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/analyses/how-to-eval-a-skill/"&gt;how-to-eval-a-skill&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/evaluating-agent-skills-caparas/"&gt;evaluating-agent-skills-caparas&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/scion-docs/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/scion-docs/</guid><description>&lt;h1 id="scion-documentation"&gt;Scion Documentation&lt;a class="anchor" href="#scion-documentation"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://googlecloudplatform.github.io/scion"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/scion-docs.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Scion is an experimental multi-agent orchestration testbed by &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/google-cloud-platform/"&gt;google-cloud-platform&lt;/a&gt;, designed to manage concurrent LLM-based &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;s running in &lt;strong&gt;container&lt;/strong&gt;s across local machines and remote &lt;strong&gt;kubernetes&lt;/strong&gt; clusters. It acts as a &amp;ldquo;hypervisor for agents&amp;rdquo; — not a full multi-agent framework, but an infrastructure layer for running, isolating, and managing agent processes.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Manager-Worker architecture&lt;/strong&gt;: A host-side CLI (&lt;code&gt;scion&lt;/code&gt;) orchestrates agent containers. Each agent runs an LLM + &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/"&gt;harness&lt;/a&gt; loop in isolation.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Harness-agnostic&lt;/strong&gt;: Supports &lt;strong&gt;gemini-cli&lt;/strong&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;, &lt;strong&gt;opencode&lt;/strong&gt;, and &lt;strong&gt;codex&lt;/strong&gt; through a common adapter interface. New harnesses can be added via a &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/plugin-system/"&gt;plugin-system&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Two operating modes&lt;/strong&gt;: Solo mode (local Docker/Podman, zero-config) and Hosted mode (centralized &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/hub/"&gt;hub&lt;/a&gt; dispatching to &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime-broker/"&gt;runtime-broker&lt;/a&gt;s).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Strict isolation&lt;/strong&gt;: Each agent gets its own filesystem, credentials, environment variables, and git workspace. Shadow mounts (tmpfs) prevent cross-agent access.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Git-native workspaces&lt;/strong&gt;: Local mode uses git worktrees; hosted mode uses git init + fetch. Each agent works on its own branch.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/template/"&gt;template&lt;/a&gt;s as blueprints&lt;/strong&gt;: Agents are created from templates that define system prompts, tools, images, and configuration. Templates support inheritance.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-state-model/"&gt;agent-state-model&lt;/a&gt;&lt;/strong&gt;: Three-dimensional state tracking — Phase (lifecycle), Activity (cognitive state), Detail (freeform context).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Philosophy&lt;/strong&gt;: Favors isolation over constraints, interaction over autonomy, diversity of models/harnesses, and action over planning. Runs agents in &lt;code&gt;--yolo&lt;/code&gt; mode with infrastructure-level guardrails.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="scope"&gt;Scope&lt;a class="anchor" href="#scope"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;The documentation covers: architecture, core concepts, all four supported harnesses, configuration system, Hub/Broker distributed architecture, workspace strategies, security model, plugin system, and the Go package structure.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/skills-pipeline-sleestk/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/skills-pipeline-sleestk/</guid><description>&lt;h1 id="skills-pipeline-sleestk"&gt;Skills Pipeline (Sleestk)&lt;a class="anchor" href="#skills-pipeline-sleestk"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/Sleestk/Skills-Pipeline"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/Sleestk/Skills-Pipeline/tree/main/Obsidian"&gt;Obsidian Skill&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A collection of Claude skills organized as multi-stage pipelines. Three skill sets: a six-stage YouTube video production pipeline, a four-skill SaaS development stack, and a comprehensive Obsidian power user skill. Demonstrates how skills can be chained — each stage takes the previous stage&amp;rsquo;s output as input.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Skills as pipelines, not standalone units&lt;/strong&gt;: The YouTube pipeline chains 6 skills sequentially: Research → Script → SEO → Visual Director → Editor Brief → Thumbnail. Each skill&amp;rsquo;s output is the next skill&amp;rsquo;s input. This is a concrete implementation of the multi-skill pipeline pattern described in &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt; (Pillar 9: Context Management Recipes).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Follows the Agent Skills format&lt;/strong&gt;: Each skill is a directory with a &lt;code&gt;SKILL.md&lt;/code&gt; file using YAML frontmatter (&lt;code&gt;name&lt;/code&gt;, &lt;code&gt;description&lt;/code&gt;) — exactly the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt; spec. The Obsidian skill also uses &lt;code&gt;references/&lt;/code&gt; subdirectory for progressive disclosure.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Progressive disclosure in practice&lt;/strong&gt;: The Obsidian skill loads reference files on demand (&amp;ldquo;Read the relevant reference file(s) before responding&amp;rdquo;). 8 reference files covering editing, linking, canvas, bases, plugins, publish, vault architecture. Quick-reference syntax is inline; deep details are deferred. This is the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&amp;rsquo;s progressive disclosure pattern working in production.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Persona-driven skills&lt;/strong&gt;: Each skill defines a persona (&amp;ldquo;seasoned Obsidian knowledge architect&amp;rdquo;, &amp;ldquo;expert Next.js developer&amp;rdquo;). The persona sets tone, output standards, and a core rule. This maps to the agent persona concept in &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt; (Pillar 5).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Output format standards as tables&lt;/strong&gt;: The Obsidian skill defines exact output formats per type (notes → markdown, canvas → JSON, bases → YAML, folder structures → tree + bash script). This is deterministic output specification — related to &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/"&gt;skill-evaluation&lt;/a&gt;&amp;rsquo;s outcome goals.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Decision logic before responding&lt;/strong&gt;: The Obsidian skill has explicit decision logic: identify output type → load reference files → produce complete output → handle multi-category requests. This is a lightweight version of the agentic loop.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Test prompts included&lt;/strong&gt;: 10 validation prompts built into the skill itself. This is inline evaluation — the skill ships with its own test cases, directly implementing &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/"&gt;skill-evaluation&lt;/a&gt;&amp;rsquo;s &amp;ldquo;start small with a targeted prompt set&amp;rdquo; advice.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="the-three-skill-sets"&gt;The Three Skill Sets&lt;a class="anchor" href="#the-three-skill-sets"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;h3 id="youtube-pipeline-6-stages"&gt;YouTube Pipeline (6 stages)&lt;a class="anchor" href="#youtube-pipeline-6-stages"&gt;#&lt;/a&gt;&lt;/h3&gt;
&lt;ol&gt;
&lt;li&gt;&lt;code&gt;research-agent.md&lt;/code&gt; → Research Brief&lt;/li&gt;
&lt;li&gt;&lt;code&gt;script-agent.md&lt;/code&gt; → Production-ready script&lt;/li&gt;
&lt;li&gt;&lt;code&gt;seo-agent.md&lt;/code&gt; → YouTube metadata package&lt;/li&gt;
&lt;li&gt;&lt;code&gt;visual-director.md&lt;/code&gt; → Visual Production Brief&lt;/li&gt;
&lt;li&gt;&lt;code&gt;editor-brief.md&lt;/code&gt; → Complete editing guide&lt;/li&gt;
&lt;li&gt;&lt;code&gt;thumbnail-agent.md&lt;/code&gt; → Thumbnail Creative Brief&lt;/li&gt;
&lt;/ol&gt;
&lt;h3 id="saas-stack-4-skills"&gt;SaaS Stack (4 skills)&lt;a class="anchor" href="#saas-stack-4-skills"&gt;#&lt;/a&gt;&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;&lt;code&gt;nextjs-developer&lt;/code&gt; — Next.js 16.2.1 expert&lt;/li&gt;
&lt;li&gt;&lt;code&gt;supabase-js&lt;/code&gt; — Full-stack Supabase developer&lt;/li&gt;
&lt;li&gt;&lt;code&gt;stripe-developer&lt;/code&gt; — Stripe payment integrations&lt;/li&gt;
&lt;li&gt;&lt;code&gt;vercel-developer&lt;/code&gt; — Deploy and manage on Vercel&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id="obsidian-power-user-1-skill--8-references"&gt;Obsidian Power User (1 skill + 8 references)&lt;a class="anchor" href="#obsidian-power-user-1-skill--8-references"&gt;#&lt;/a&gt;&lt;/h3&gt;
&lt;p&gt;Comprehensive Obsidian expert covering vault design, canvas, bases, Dataview, Templater, all core plugins, Publish, Web Clipper, CSS snippets, URI links, and MOC notes.&lt;/p&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/spec-kit/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/spec-kit/</guid><description>&lt;h1 id="spec-kit-spec-driven-development-toolkit"&gt;Spec Kit: Spec-Driven Development Toolkit&lt;a class="anchor" href="#spec-kit-spec-driven-development-toolkit"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/github/spec-kit"&gt;Original&lt;/a&gt; | &lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/spec-kit-readme.md"&gt;Raw&lt;/a&gt;&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/"&gt;spec-kit&lt;/a&gt; is an open-source toolkit by GitHub for Spec-Driven Development (SDD) — a methodology where specifications become executable, directly generating working implementations rather than just guiding them. Provides a CLI (&lt;code&gt;specify&lt;/code&gt;) and a set of slash commands that work across 30+ AI agents. MIT licensed.&lt;/p&gt;
&lt;h2 id="key-takeaways"&gt;Key Takeaways&lt;a class="anchor" href="#key-takeaways"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Specifications as the primary artifact&lt;/strong&gt;: SDD flips the script — specs define the &amp;ldquo;what&amp;rdquo; before the &amp;ldquo;how.&amp;rdquo; Code is generated from specs, not the other way around. This is the opposite of &amp;ldquo;vibe coding.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Seven-step workflow&lt;/strong&gt;: Constitution (principles) → Specify (requirements) → Clarify → Plan (tech stack) → Tasks (breakdown) → Implement → Verify. Each step produces a persistent artifact.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Agent-agnostic&lt;/strong&gt;: Supports 30+ AI agents including &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;, Codex, Gemini CLI, Cursor, Kiro CLI, Copilot, and many more. Uses slash commands (&lt;code&gt;/speckit.specify&lt;/code&gt;) or agent skills (&lt;code&gt;$speckit-specify&lt;/code&gt;).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Extension ecosystem&lt;/strong&gt;: 50+ community extensions covering docs, code review, process orchestration, integrations (Jira, Azure DevOps, Confluence, GitHub Projects), visibility, and security. Categories: docs, code, process, integration, visibility.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Presets for customization&lt;/strong&gt;: Override templates and commands without changing tooling. Stack multiple presets with priority ordering. Examples: pirate speak, compliance-oriented formats, domain-specific terminology.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Constitution as governance&lt;/strong&gt;: A &lt;code&gt;constitution.md&lt;/code&gt; file establishes project principles that guide all subsequent development — similar to &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&amp;rsquo;s CLAUDE.md but focused on project governance rather than agent behavior.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Feature branching built in&lt;/strong&gt;: Each feature gets its own branch (&lt;code&gt;001-create-taskify&lt;/code&gt;), spec directory, plan, and task breakdown. Git-native workflow.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Clarification before planning&lt;/strong&gt;: &lt;code&gt;/speckit.clarify&lt;/code&gt; runs structured questioning to reduce rework downstream. &amp;ldquo;Do not treat the first attempt as final.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Task breakdown with dependency management&lt;/strong&gt;: Tasks ordered by dependency, parallel execution markers (&lt;code&gt;[P]&lt;/code&gt;), file path specifications, TDD structure, checkpoint validation.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Skills integration&lt;/strong&gt;: Installs as Claude Code skills (&lt;code&gt;.claude/skills/&lt;/code&gt;) or Codex skills (&lt;code&gt;.agents/skills/&lt;/code&gt;). Follows the &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt; pattern.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="connections"&gt;Connections&lt;a class="anchor" href="#connections"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/strong&gt;: Spec Kit installs its commands as skills following the standard. Each &lt;code&gt;/speckit.*&lt;/code&gt; command is effectively a skill with a specific workflow.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/"&gt;ten-pillars-agentic-skill-design&lt;/a&gt;&lt;/strong&gt;: SDD operationalizes several pillars — Architecture (Pillar 1: structured artifacts), Documentation (Pillar 2: specs as documentation), Scope (Pillar 3: one feature per branch), Testing (Pillar 7: verify step), Version Control (Pillar 8: feature branching).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/strong&gt;: First-class integration. Skills installed in &lt;code&gt;.claude/skills/&lt;/code&gt;. Constitution parallels CLAUDE.md.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/"&gt;multi-agent-orchestration&lt;/a&gt;&lt;/strong&gt;: Agent-agnostic like &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, but at the methodology layer. The workflow is the same regardless of which agent executes it.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;&lt;/strong&gt;: The seven-step workflow is progressive disclosure applied to development — each step produces a focused artifact that feeds the next, rather than dumping everything into one prompt.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/skill-evaluation/"&gt;skill-evaluation&lt;/a&gt;&lt;/strong&gt;: The &lt;code&gt;/speckit.analyze&lt;/code&gt; command (cross-artifact consistency check) and &lt;code&gt;/speckit.checklist&lt;/code&gt; (quality validation) are built-in evaluation mechanisms — &amp;ldquo;unit tests for English.&amp;rdquo;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/paperclip/"&gt;paperclip&lt;/a&gt;&lt;/strong&gt;: Both address the &amp;ldquo;above the agent&amp;rdquo; layer, but differently. Paperclip = organizational orchestration (org charts, budgets). Spec Kit = methodological orchestration (specs, plans, tasks).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/fabric/"&gt;fabric&lt;/a&gt;&lt;/strong&gt;: Spec Kit&amp;rsquo;s slash commands are like Fabric&amp;rsquo;s patterns — reusable, named workflows. But Spec Kit&amp;rsquo;s are sequential (a pipeline) while Fabric&amp;rsquo;s are independent.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="see-also"&gt;See Also&lt;a class="anchor" href="#see-also"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/spec-kit/"&gt;spec-kit&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-skills-standard/"&gt;agent-skills-standard&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/claude-code/"&gt;claude-code&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/"&gt;multi-agent-orchestration&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/context-management/"&gt;context-management&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;</description></item><item><title/><link>https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/sources/ten-pillars-agentic-skill-design/</guid><description>&lt;h1 id="the-ten-pillars-of-agentic-skill-design"&gt;The Ten Pillars of Agentic Skill Design&lt;a class="anchor" href="#the-ten-pillars-of-agentic-skill-design"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;p&gt;&lt;a href="https://github.com/imforster/llm-wiki/blob/main/raw/llm/ten-pillars-agentic-skill-design.md"&gt;Original&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;Author: Ian Forster (with support of Kiro)
Version: 2.0 | November 2024&lt;/p&gt;
&lt;h2 id="summary"&gt;Summary&lt;a class="anchor" href="#summary"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;A research paper proposing a comprehensive ten-pillar framework for designing agentic skills files — the modular extensions that encapsulate domain knowledge, workflows, and tool integrations for AI agents. Synthesizes software engineering principles, prompt engineering best practices, and analysis of 4,476+ GitHub repositories to address the lack of standardized design methodologies.&lt;/p&gt;</description></item><item><title>Activity Log</title><link>https://blog.imfsoftware.com/llm-wiki/docs/meta/log/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://blog.imfsoftware.com/llm-wiki/docs/meta/log/</guid><description>&lt;h1 id="wiki-log"&gt;Wiki Log&lt;a class="anchor" href="#wiki-log"&gt;#&lt;/a&gt;&lt;/h1&gt;
&lt;h2 id="2026-04-07-create--wiki-initialized"&gt;[2026-04-07] create | Wiki initialized&lt;a class="anchor" href="#2026-04-07-create--wiki-initialized"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Wiki structure created: &lt;code&gt;CLAUDE.md&lt;/code&gt; schema, &lt;code&gt;wiki/index.md&lt;/code&gt;, &lt;code&gt;wiki/log.md&lt;/code&gt;, directory scaffolding. Ready for first ingest.&lt;/p&gt;
&lt;h2 id="2026-04-07-ingest--scion-documentation"&gt;[2026-04-07] ingest | Scion Documentation&lt;a class="anchor" href="#2026-04-07-ingest--scion-documentation"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Source: &lt;a href="https://googlecloudplatform.github.io/scion"&gt;https://googlecloudplatform.github.io/scion&lt;/a&gt; — multi-agent orchestration testbed by Google Cloud Platform. Created 13 wiki pages:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Source: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/scion-docs/"&gt;scion-docs&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Entities: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/scion/"&gt;scion&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/google-cloud-platform/"&gt;google-cloud-platform&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Concepts: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent/"&gt;agent&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/agent-state-model/"&gt;agent-state-model&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/grove/"&gt;grove&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/harness/"&gt;harness&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/hub/"&gt;hub&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/template/"&gt;template&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime/"&gt;runtime&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/runtime-broker/"&gt;runtime-broker&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/plugin-system/"&gt;plugin-system&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/"&gt;multi-agent-orchestration&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="2026-04-07-ingest--kiro-autonomous-agent"&gt;[2026-04-07] ingest | Kiro Autonomous Agent&lt;a class="anchor" href="#2026-04-07-ingest--kiro-autonomous-agent"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Source: &lt;a href="https://kiro.dev/autonomous-agent/"&gt;https://kiro.dev/autonomous-agent/&lt;/a&gt; — AWS&amp;rsquo;s frontier agent for autonomous development tasks. Created 6 wiki pages, updated 1:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Source: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/sources/kiro-autonomous-agent/"&gt;kiro-autonomous-agent&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Entities: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/kiro/"&gt;kiro&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/entities/aws/"&gt;aws&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Concepts: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/frontier-agent/"&gt;frontier-agent&lt;/a&gt;, &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/kiro-powers/"&gt;kiro-powers&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Updated: &lt;a href="https://blog.imfsoftware.com/llm-wiki/docs/concepts/multi-agent-orchestration/"&gt;multi-agent-orchestration&lt;/a&gt; — added Kiro as second approach, comparison with Scion&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id="2026-04-07-ingest--claude-code-documentation"&gt;[2026-04-07] ingest | Claude Code Documentation&lt;a class="anchor" href="#2026-04-07-ingest--claude-code-documentation"&gt;#&lt;/a&gt;&lt;/h2&gt;
&lt;p&gt;Source: &lt;a href="https://code.claude.com/docs/en"&gt;https://code.claude.com/docs/en&lt;/a&gt; — Anthropic&amp;rsquo;s agentic coding tool. Created 4 wiki pages, updated 2:&lt;/p&gt;</description></item></channel></rss>