The coding agents tracker

One bookmark for coding agents, LLMs, and MCP.

This pillar collects everything on the site about AI coding agents, LLMs, and the Model Context Protocol: Claude Code, Cursor, Aider, GitHub Copilot’s agent mode, the Microsoft Agent Framework, MCP servers and skills, and the patterns for wiring LLM-driven automation into real workflows.

What to read first

If you’re new to this track, start with Microsoft Agent Framework 1.0: Building AI Agents in Pure C# for the .NET side of the story, and Wiring Model Context Protocol servers from C# on .NET 10 for the MCP plumbing most of the 2026 posts build on. Then How to Build a Custom MCP Server in C# on .NET 11 walks you through wiring one end-to-end.

For Copilot specifically, Agent Skills land in Visual Studio 2026 18.5 and What 878 Copilot Coding Agent PRs in dotnet/runtime actually look like cover both the feature surface and the real-world results. For Claude Code specifically, How to Write a CLAUDE.md That Actually Changes Model Behaviour is the highest-leverage thing to read, and How to Structure a Monorepo So Claude Code’s Context Stays Small is the natural follow-up once a CLAUDE.md is in place.

What’s on this page

The table below is auto-generated from posts tagged with any of: ai-agents, llm, mcp, claude-code, cursor, github-copilot, agent-skills, microsoft-agent-framework. A weekly job re-sorts it so the newest coverage bubbles up.

If you want the .NET-centric view instead, the companion .NET 11 tracker pillar collects every post about the .NET 11 release cycle; a lot of those posts overlap with this one via the ai and mcp tags.

Index (66 posts)

2026 / 06

MCP vs OpenAPI Plugins vs Custom Tool Calling for AI Agents: Which Should You Pick in 2026?
Use custom tool calling for one app you own end to end, MCP for any integration you want reused across Claude Code, Cursor, and ChatGPT, and OpenAPI plugins only as a bridge when you already have a spec. MCP is the cross-client standard in 2026; plugins are legacy.
Claude Code vs Cursor vs Copilot Agent Mode: Where Each Wins in 2026
Claude Code wins on raw agent quality and predictable flat pricing, Cursor wins as the editor you live in with the best multi-model routing, and GitHub Copilot wins when the work starts and ends on GitHub. The June 2026 token-billing shift is the tiebreaker for cost.
Claude Code vs Cursor vs Aider for a .NET 11 Repo in 2026
For a large .NET 11 / C# 14 solution, Claude Code wins on agent quality and runs anywhere a terminal does. Cursor wins if your team wants a GUI and tab completion, as long as you can live without C# Dev Kit. Aider wins on cost and openness.
Fix: MCP Servers Stopped Working After a Claude Desktop Update on Windows
After an MSIX update, Claude Desktop on Windows reads claude_desktop_config.json from a virtualized path while Edit Config opens the old %APPDATA% one. Edit the file under AppData\Local\Packages\Claude_pzs8sxrjxfjjc and restart.

2026 / 05

Claude Code's Dynamic Workflows Fan a Single Prompt Out to Up to 1,000 Subagents
Anthropic shipped Opus 4.8 on May 28, 2026 with Dynamic Workflows, a research preview in Claude Code that writes a JavaScript orchestration script to run tens to hundreds of subagents in parallel, capped at 1,000 per run.
Fix: GitHub MCP server tool calls fail silently when the PAT isn't passed
GitHub MCP tools show up but every call returns empty or 'Bad credentials'? The token is unset. Put the PAT in the env block, verify with curl, done.
Fix: "Claude reached its tool-use limit for this turn"
This banner is a pause, not a failure. It maps to the API's pause_turn stop reason (default 10 server-tool iterations). Click Continue, or in code re-send the assistant response. Full fix here.
The Dart and Flutter MCP server: one command to give Claude Code your running Flutter app
Dart 3.12 ships dart mcp-server, the official MCP bridge into the Dart and Flutter toolchain. Register it once in Claude Code, Cursor, or Codex CLI and your agent gets hot reload, pub.dev search, and live widget introspection without copy-pasting a DTD URI.
Fix: HTTP MCP Server URL Won't Connect in Claude Desktop (stdio vs HTTP Transport)
Claude Desktop's claude_desktop_config.json only validates stdio servers. Drop a 'url' field in and it silently strips the entry, crashes on startup, or boots with zero tools. Use a custom connector for remote URLs and mcp-remote as a stdio bridge for local HTTP servers.
How to Assign a Jira Ticket to a Cursor Cloud Agent and Get a PR Back
Step-by-step guide to the Cursor in Jira integration (launched May 19, 2026): install the Marketplace app, assign a ticket to @Cursor, wire repository access and automation rules, and read the PR the cloud agent links back.
How to Let Aider Edit Files Outside the Git Repository
Aider is git-first by default. To edit files outside the repo, pass them on the command line, use /read for reference docs, drop --no-git for unversioned dirs, or symlink. Five working patterns with exact flags.
How to Reduce the Number of MCP Tools Claude Loads to Avoid the Tool-Use Limit
Five MCP servers can burn 55k tokens and wreck tool-selection accuracy before Claude does any work. How to fix it in Claude Code 2.1.x with ENABLE_TOOL_SEARCH and alwaysLoad, on the raw API with the tool search tool and defer_loading, and on the MCP connector with mcp_toolset. Anchored to claude-opus-4-7, claude-sonnet-4-6, and the 20251119 tool search variants.
How to Author a Function Tool in the Microsoft Agent Framework: Inline, Method, or Class
If you came from Semantic Kernel looking for 'skills,' the Microsoft Agent Framework 1.0 calls them function tools and builds every one from AIFunctionFactory.Create. Here are the three ways to author one in C# -- an inline lambda, a named method, or a class of related tools -- plus what the declarative YAML 'file' approach can and cannot do.
Fix: Extra Inputs Are Not Permitted on a Tool Call With a Structured Argument
Your agent's tool call fails with Pydantic's 'Extra inputs are not permitted'. The model added a field your structured argument forbids. Turn on strict tool use so the grammar blocks it, relax the validator, or recover in the loop.
Fix: `ECONNREFUSED` When a Local MCP Server Starts Before the Client Is Ready
ECONNREFUSED from an MCP client means nothing was listening on that host:port yet. Fix the startup race, the localhost IPv4/IPv6 trap, and wrong-port mistakes for HTTP MCP servers.
Cursor 3.4 Adds Multi-Repo Environments and Faster Dockerfile Builds for Cloud Agents
Cursor 3.4 (May 13, 2026) lets one cloud agent environment include multiple repositories, adds Dockerfile build secrets, layer-cached rebuilds that run 70% faster, and an agent-led setup step that validates credentials before the first run.
Fix: GitHub Copilot Ignores Repository Custom Instructions in VS Code
Copilot Chat reads your .github/copilot-instructions.md fine, but Agent mode and *.instructions.md files often look ignored. Here is the actual checklist: settings, file locations, applyTo globs, and how to prove the file made it into context.
Fix: Cursor's Apply Button Does Nothing on a Large Diff
Cursor's Apply silently cancels on big files because the Fast Apply model has an implicit size ceiling. Shrink the edit scope, clean git state, or fall back to a unified diff. Six ranked fixes for Cursor 3.3.
Fix: Claude Code Reports `MCP server disconnected` Inside WSL
Why Claude Code shows 'MCP server disconnected' or 'Connection closed' for a working MCP server when launched from WSL2. Covers stdio spawning across the Windows/Linux boundary, npx and node ENOENT, mirrored networking for HTTP servers, VMMem reaping, and how to diagnose the right one with claude --debug.
GPT-5.3-Codex Becomes the Copilot Business and Enterprise Base Model
On May 17, 2026 GitHub flipped the default Copilot model on Business and Enterprise plans from GPT-4.1 to GPT-5.3-Codex. GPT-4.1 stays free until June 1, then it falls under usage-based billing. Here is what changes for pinned models in your repo and CI.
Fix: rate_limit_error on Claude Sonnet 4.6 in a Long Agent Loop
A 429 rate_limit_error on claude-sonnet-4-6 in a long agent loop is almost always ITPM, not RPM. Read retry-after, cache the system prompt, and gate on anthropic-ratelimit-input-tokens-remaining. Step-by-step fix with code.
dotnet new mcpserver Now Ships in the .NET 11 Preview 4 SDK
.NET 11 Preview 4 bundles the mcpserver project template directly into the SDK. No separate Microsoft.McpServer.ProjectTemplates install, no preview feed dance. Pick stdio or HTTP transport, opt into Native AOT, and dotnet new mcpserver -o MyServer is the whole setup.
Fix: Context Window Exceeded During an Aider Refactor
Aider 0.86 hit the token limit mid-refactor. The fix is /clear, /drop, --map-tokens, and switching to architect mode with Claude Sonnet 4.6's 1M window. Step-by-step repro, error breakdown, and config.
Fix: Tool Call Arguments Did Not Match Schema in Anthropic Tool Use
Why your Claude tool call fails schema validation, in both flavours: the API rejecting your tool definition at request time, and Claude returning a tool_use block your runner cannot accept. Concrete fixes for each, with strict mode, oneOf/anyOf, additionalProperties, and the retry loop pattern.
Fix: MCP Server stdio Hang When Launched From Claude Code
Why your Model Context Protocol server gets stuck in 'connecting' from Claude Code 2.x and never registers any tools. Covers stdout pollution, the npx install prompt, MCP_TIMEOUT, buffered output, and WSL pitfalls, with verifiable repros.
How to Add Retrieval-Augmented Generation to a Claude Code Session
A 2026 walkthrough for wiring RAG into Claude Code 2.1.x: when agentic grep stops scaling, how to attach a hybrid BM25 + dense vector MCP server, how to wrap a retrieval CLI in a custom skill, and how Anthropic's contextual embeddings technique pushes recall above 92%. Anchored to claude-sonnet-4-6, claude-opus-4-7, and Claude Context 0.x.
How to Structure a Monorepo So Claude Code's Context Stays Small
A 2026 playbook for keeping Claude Code's 200k token context lean in a monorepo: launch from the subtree you are touching, split CLAUDE.md into nested files that load on demand, push path-scoped rules into .claude/rules/, use skills and subagents for the noisy reads, and exclude other teams' files with claudeMdExcludes. Anchored to Claude Code 2.1.x, claude-sonnet-4-6, and claude-opus-4-7.
Cursor Bugbot Adds Default, High, and Custom Effort Levels
On May 11, 2026, Cursor shipped effort levels for Bugbot. Default finds 0.7 bugs per review, High pushes it to 0.95, and Custom lets you describe in plain English when each mode should kick in.
How to Cache Multi-Turn Claude Conversations Across API Calls
Place rolling cache_control breakpoints across messages, respect the 20-block lookback, and refresh the 5-minute TTL automatically so a 50-turn agent loop pays the prefix once, not fifty times. Verified against anthropic 0.42 (Python) and @anthropic-ai/sdk 0.30 (Node) in May 2026.
Cursor 3.3 Adds Build in Parallel, Split PRs, and a Unified PR Review
Cursor 3.3 (May 7, 2026) ships async subagents that work on independent steps of a plan at the same time, a quick action that splits one chat into multiple pull requests, and a redesigned PR workflow that keeps reviews, commits, and changes in one place.
How to Set Up an LLM-as-Judge Eval Harness for a Coding Agent
Build a working LLM-as-judge eval harness for a coding agent in Python: golden tasks, deterministic checks, a rubric judge on Claude Sonnet 4.6, calibration against human labels, and a CI gate that fails the build when scores regress.
GitHub Copilot Drops Claude Sonnet 4 From Every Surface
GitHub deprecated claude-sonnet-4 on May 6, 2026 across Copilot Chat, inline edits, ask and agent modes, and code completions. Recommended migration target is Claude Sonnet 4.6. What to grep for in your repo before the next pinned model selection silently breaks.
How to Pipe Cursor's Context to an Aider Session for Multi-Agent Refactors
Cursor is the best place to plan a refactor. Aider is the best place to execute it from a terminal with cheap models and atomic git commits. This guide shows the exact pipe: dump the Cursor chat to markdown, hand it to Aider as a read-only context file, and run an architect/editor split that finishes the work.
How to Write a Claude Code Subagent That Runs Browser Tests
Build a project-scoped Claude Code subagent that drives Playwright in a real browser, scoped to its own MCP server so the main session never sees the 25 browser_* tools. Covers the .claude/agents/browser-tester.md frontmatter, mcpServers inline definition, allowed tool list, isolation: worktree, the Playwright Test Agents init flow, and the Sonnet-vs-Haiku model choice.
Microsoft Agent Framework workflows now survive process restarts via the Durable Task stack
Wrap an Agent Framework Workflow in Microsoft.Agents.AI.DurableTask and each executor step is checkpointed. Crash, redeploy, restart - the run continues where it stopped.
How to Give a Copilot Agent Skill Access to Your Repo Conventions
Turn the unwritten rules in your repo into a SKILL.md that GitHub Copilot loads on demand. Frontmatter, descriptions that route, file references, and how to verify it actually fires.
Microsoft Agent Framework gates risky tool calls behind FunctionApprovalRequestContent
Wrap an AIFunction in ApprovalRequiredAIFunction and the agent stops mid-run to ask permission. Here is how the request and response flow works in C#.
How to Run a Semantic Kernel Plugin From a BackgroundService
Wire a Microsoft.SemanticKernel 1.75.0 plugin into a hosted BackgroundService on .NET 11 and invoke its KernelFunctions on a PeriodicTimer schedule. Covers DI scopes, [KernelFunction] resolution, prompt-cache-friendly invocation, cancellation, and the lifetime gotchas that bite when you move a plugin off the request path.
Claude Code 2.1.128 Loads Plugins From .zip Archives and Stops Dropping Unpushed Commits
Claude Code v2.1.128 (May 4, 2026) ships --plugin-dir support for .zip archives, makes EnterWorktree branch from local HEAD, and stops the CLI from leaking its own OTLP endpoint into Bash subprocesses.
How to Expose an EF Core Database to an AI Agent via MCP
Wire an EF Core 10 DbContext into a Model Context Protocol server so Claude Code, Cursor, or any compliant client can run safe, scoped queries against your database. Covers IDbContextFactory lifetime, read-only projections, schema discovery tools, AsNoTracking, parameterised filters, row-level scoping, and the destructive-tool gates you need before letting an agent touch UPDATE.
Cursor Ships a TypeScript SDK That Turns Its Coding Agent Into a Library
Cursor's new @cursor/sdk public beta exposes the same runtime, harness, and models that drive the desktop app, CLI, and web UI as a TypeScript package. You get sandboxed cloud VMs, subagents, hooks, MCP, and token-based pricing in a few lines of code.
Claude Code 2.1.126 Adds `claude project purge` to Wipe All State for a Repo
Claude Code v2.1.126 ships claude project purge, a new CLI subcommand that deletes every transcript, task, file-history entry, and config block tied to a project path in a single shot. Includes --dry-run, --yes, --interactive, and --all.
How to Add Tool Calling to a Microsoft.Extensions.AI Chat Client
Wire AIFunctionFactory.Create, ChatOptions.Tools, and ChatClientBuilder.UseFunctionInvocation in Microsoft.Extensions.AI 10.5 so an IChatClient can call your .NET methods automatically. Covers OpenAI and Azure OpenAI providers, the FunctionInvokingChatClient knobs that actually matter (iteration limits, concurrent calls, approval prompts, error handling), and streaming responses with tools.
Agent Governance Toolkit puts a YAML policy in front of every MCP tool call from .NET
Microsoft's new Microsoft.AgentGovernance package wraps MCP tool calls with a policy kernel, a security scanner, and a response sanitizer. Here is what each piece does and how the wiring looks in C#.
How to Migrate a Semantic Kernel Plugin to an MCP Server
Take an existing Semantic Kernel plugin with [KernelFunction] methods and turn it into a Model Context Protocol server other agents can call. Covers the drop-in WithTools(kernel) bridge, the native [McpServerTool] rewrite, parameter binding, dependency injection, and the gotchas that bite during the cutover.
How to Run Claude Code in a GitHub Action for Autonomous PR Review
Wire up anthropics/claude-code-action@v1 so every pull request gets an autonomous Claude Code review with no @claude trigger. Includes the v1 YAML, claude_args for claude-sonnet-4-6 vs claude-opus-4-7, inline-comment tooling, path filters, REVIEW.md, and the choice between the self-hosted action and the managed Code Review research preview.

2026 / 04

Claude Code 2.1.122 Lets You Pick a Bedrock Service Tier From an Env Var
Claude Code v2.1.122 adds the ANTHROPIC_BEDROCK_SERVICE_TIER environment variable, sent as the X-Amzn-Bedrock-Service-Tier header. Set it to flex for a 50 percent discount on agent calls or priority for faster responses, without touching SDK code.
How to Call the Claude API from a .NET 11 Minimal API with Streaming
Stream Claude responses from an ASP.NET Core 11 minimal API end-to-end: the official Anthropic .NET SDK, TypedResults.ServerSentEvents, SseItem, IAsyncEnumerable, cancellation flow, and the gotchas that buffer your tokens silently. With Claude Sonnet 4.6 and Opus 4.7 examples.
cowork-terminal-mcp: Host Terminal Access for Claude Cowork in One MCP Server
cowork-terminal-mcp v0.4.1 bridges Claude Cowork's sandboxed VM to your host shell. One tool, stdio transport, hard-pinned Git Bash on Windows.
Export Claude Code Conversations to PDF With jsonl-to-pdf
A practical guide to turning the JSONL files Claude Code writes under ~/.claude/projects/ into shareable PDFs using jsonl-to-pdf, with sub-agent nesting, secret redaction, compact and dark themes, and CI-friendly batch recipes.
How to Add Prompt Caching to an Anthropic SDK App and Measure the Hit Rate
Add prompt caching to a Python or TypeScript Anthropic SDK app, place cache_control breakpoints correctly, and read cache_read_input_tokens and cache_creation_input_tokens to compute a real hit rate. With pricing math for Claude Sonnet 4.6 and Opus 4.7.
How to Write a CLAUDE.md That Actually Changes Model Behaviour
A 2026 playbook for CLAUDE.md files that Claude Code actually follows: the 200-line target, when to use path-scoped rules in .claude/rules/, @import hierarchy and 5-hop max depth, the user-message vs system-prompt gap, the line between CLAUDE.md and auto memory, and when to give up and write a hook instead. Anchored to Claude Code 2.1.x and verified against the official memory docs.
Claude Code 2.1.119 Pulls PRs From GitLab, Bitbucket, and GitHub Enterprise
Claude Code v2.1.119 expands --from-pr beyond github.com. The CLI now accepts GitLab merge-request, Bitbucket pull-request, and GitHub Enterprise PR URLs, and a new prUrlTemplate setting points the footer PR badge at the right code-review host.
How to Schedule a Recurring Claude Code Task That Triages GitHub Issues
Three ways to put Claude Code on a schedule that triages GitHub issues unattended in 2026: cloud Routines (the new /schedule), the claude-code-action v1 with cron + issues.opened, and the session-scoped /loop. Includes a runnable Routine prompt, a complete GitHub Actions YAML, jitter and identity gotchas, and when to pick which.
GitHub Copilot Chat BYOK Goes GA in VS Code: Anthropic, Ollama, Foundry Local
GitHub Copilot for VS Code shipped Bring Your Own Key on April 22, 2026. Wire your own Anthropic, OpenAI, Gemini, OpenRouter, or Azure account into Chat, or point at a local Ollama or Foundry Local model. Billing skips the Copilot quota and goes straight to the provider.
How to Build a Custom MCP Server in C# on .NET 11
Build a working Model Context Protocol server in C# 14 / .NET 11 using the official ModelContextProtocol 1.2 SDK. Covers stdio transport, [McpServerTool] attributes, dependency injection, the stderr logging trap, and registration with Claude Code, Claude Desktop, and VS Code.
How to Build a Custom MCP Server in Python with the Official SDK
Build a working Model Context Protocol server in Python using the official mcp 1.27 SDK and FastMCP. Covers Pydantic schemas, the stdio stdout trap, mcp dev / mcp install, and registration with Claude Desktop and Claude Code.
How to Build a Custom MCP Server in TypeScript That Wraps a CLI
Step-by-step guide to wrapping any command-line tool as a Model Context Protocol server using the TypeScript SDK 1.29. Covers the stdout trap, child_process patterns, error propagation, and a full working git server.
Azure MCP Server Ships Inside Visual Studio 2022 17.14.30, No Extension Required
Visual Studio 2022 17.14.30 bundles the Azure MCP Server into the Azure development workload. Copilot Chat can hit 230+ Azure tools across 45 services without installing a thing.
Visual Studio 18.5's Debugger Agent Turns Copilot Into a Live Bug-Hunting Partner
Visual Studio 18.5 GA ships a guided Debugger Agent workflow in Copilot Chat that forms a hypothesis, sets breakpoints, rides along through a repro, validates against runtime state, and proposes a fix.
Agent Skills Land in Visual Studio 2026 18.5: Copilot Auto-Discovers SKILL.md From Your Repo
Visual Studio 2026 18.5.0 lets GitHub Copilot load Agent Skills from .github/skills, .claude/skills, and ~/.copilot/skills. Reusable SKILL.md instruction packs travel with your repo.
GitHub Copilot Modernization: The Assessment Report Is the Actual Product
GitHub Copilot Modernization is pitched as an Assess, Plan, Execute loop for migrating legacy .NET apps. The assessment phase is where the value lives: an inventory report, categorized blockers, and file-level remediation guidance you can diff like code.
Microsoft Agent Framework 1.0: Building AI Agents in Pure C#
Microsoft Agent Framework hits 1.0 with stable APIs, multi-provider connectors, multi-agent orchestration, and A2A/MCP interop. Here is what it looks like in practice on .NET 10.

2026 / 03

What 878 Copilot Coding Agent PRs in dotnet/runtime Actually Look Like
The .NET team shares ten months of real data on running GitHub's Copilot Coding Agent in dotnet/runtime: 878 PRs, a 67.9% merge rate, and clear lessons on where AI-assisted development helps and where it still falls short.
Generative AI for Beginners .NET v2: Rebuilt for .NET 10 with Microsoft.Extensions.AI
Microsoft's free generative AI course for .NET developers ships Version 2, rebuilt for .NET 10 and migrated from Semantic Kernel to Microsoft.Extensions.AI's IChatClient pattern.

2026 / 01

Microsoft `mcp`: Wiring Model Context Protocol Servers from C# on .NET 10
How to wire Model Context Protocol (MCP) servers in C# on .NET 10 using microsoft/mcp. Covers tool contracts, input validation, auth, observability, and production-readiness patterns.