Start Debugging

The coding agents tracker

One bookmark for coding agents, LLMs, and MCP.

This pillar collects everything on the site about AI coding agents, LLMs, and the Model Context Protocol: Claude Code, Cursor, Aider, GitHub Copilot’s agent mode, the Microsoft Agent Framework, MCP servers and skills, and the patterns for wiring LLM-driven automation into real workflows.

What to read first

If you’re new to this track, start with Microsoft Agent Framework 1.0: Building AI Agents in Pure C# for the .NET side of the story, and Wiring Model Context Protocol servers from C# on .NET 10 for the MCP plumbing most of the 2026 posts build on. Then How to Build a Custom MCP Server in C# on .NET 11 walks you through wiring one end-to-end.

For Copilot specifically, Agent Skills land in Visual Studio 2026 18.5 and What 878 Copilot Coding Agent PRs in dotnet/runtime actually look like cover both the feature surface and the real-world results. For Claude Code specifically, How to Write a CLAUDE.md That Actually Changes Model Behaviour is the highest-leverage thing to read.

What’s on this page

The table below is auto-generated from posts tagged with any of: ai-agents, llm, mcp, claude-code, cursor, github-copilot, agent-skills, microsoft-agent-framework. A weekly job re-sorts it so the newest coverage bubbles up.

If you want the .NET-centric view instead, the companion .NET 11 tracker pillar collects every post about the .NET 11 release cycle; a lot of those posts overlap with this one via the ai and mcp tags.

Index (42 posts)

2026 / 05

  • Fix: MCP Server stdio Hang When Launched From Claude Code

    Why your Model Context Protocol server gets stuck in 'connecting' from Claude Code 2.x and never registers any tools. Covers stdout pollution, the npx install prompt, MCP_TIMEOUT, buffered output, and WSL pitfalls, with verifiable repros.

  • How to Add Retrieval-Augmented Generation to a Claude Code Session

    A 2026 walkthrough for wiring RAG into Claude Code 2.1.x: when agentic grep stops scaling, how to attach a hybrid BM25 + dense vector MCP server, how to wrap a retrieval CLI in a custom skill, and how Anthropic's contextual embeddings technique pushes recall above 92%. Anchored to claude-sonnet-4-6, claude-opus-4-7, and Claude Context 0.x.

  • How to Structure a Monorepo So Claude Code's Context Stays Small

    A 2026 playbook for keeping Claude Code's 200k token context lean in a monorepo: launch from the subtree you are touching, split CLAUDE.md into nested files that load on demand, push path-scoped rules into .claude/rules/, use skills and subagents for the noisy reads, and exclude other teams' files with claudeMdExcludes. Anchored to Claude Code 2.1.x, claude-sonnet-4-6, and claude-opus-4-7.

  • Cursor Bugbot Adds Default, High, and Custom Effort Levels

    On May 11, 2026, Cursor shipped effort levels for Bugbot. Default finds 0.7 bugs per review, High pushes it to 0.95, and Custom lets you describe in plain English when each mode should kick in.

  • How to Cache Multi-Turn Claude Conversations Across API Calls

    Place rolling cache_control breakpoints across messages, respect the 20-block lookback, and refresh the 5-minute TTL automatically so a 50-turn agent loop pays the prefix once, not fifty times. Verified against anthropic 0.42 (Python) and @anthropic-ai/sdk 0.30 (Node) in May 2026.

  • Cursor 3.3 Adds Build in Parallel, Split PRs, and a Unified PR Review

    Cursor 3.3 (May 7, 2026) ships async subagents that work on independent steps of a plan at the same time, a quick action that splits one chat into multiple pull requests, and a redesigned PR workflow that keeps reviews, commits, and changes in one place.

  • How to Set Up an LLM-as-Judge Eval Harness for a Coding Agent

    Build a working LLM-as-judge eval harness for a coding agent in Python: golden tasks, deterministic checks, a rubric judge on Claude Sonnet 4.6, calibration against human labels, and a CI gate that fails the build when scores regress.

  • GitHub Copilot Drops Claude Sonnet 4 From Every Surface

    GitHub deprecated claude-sonnet-4 on May 6, 2026 across Copilot Chat, inline edits, ask and agent modes, and code completions. Recommended migration target is Claude Sonnet 4.6. What to grep for in your repo before the next pinned model selection silently breaks.

  • How to Pipe Cursor's Context to an Aider Session for Multi-Agent Refactors

    Cursor is the best place to plan a refactor. Aider is the best place to execute it from a terminal with cheap models and atomic git commits. This guide shows the exact pipe: dump the Cursor chat to markdown, hand it to Aider as a read-only context file, and run an architect/editor split that finishes the work.

  • How to Write a Claude Code Subagent That Runs Browser Tests

    Build a project-scoped Claude Code subagent that drives Playwright in a real browser, scoped to its own MCP server so the main session never sees the 25 browser_* tools. Covers the .claude/agents/browser-tester.md frontmatter, mcpServers inline definition, allowed tool list, isolation: worktree, the Playwright Test Agents init flow, and the Sonnet-vs-Haiku model choice.

  • Microsoft Agent Framework workflows now survive process restarts via the Durable Task stack

    Wrap an Agent Framework Workflow in Microsoft.Agents.AI.DurableTask and each executor step is checkpointed. Crash, redeploy, restart - the run continues where it stopped.

  • How to Give a Copilot Agent Skill Access to Your Repo Conventions

    Turn the unwritten rules in your repo into a SKILL.md that GitHub Copilot loads on demand. Frontmatter, descriptions that route, file references, and how to verify it actually fires.

  • Microsoft Agent Framework gates risky tool calls behind FunctionApprovalRequestContent

    Wrap an AIFunction in ApprovalRequiredAIFunction and the agent stops mid-run to ask permission. Here is how the request and response flow works in C#.

  • How to Run a Semantic Kernel Plugin From a BackgroundService

    Wire a Microsoft.SemanticKernel 1.75.0 plugin into a hosted BackgroundService on .NET 11 and invoke its KernelFunctions on a PeriodicTimer schedule. Covers DI scopes, [KernelFunction] resolution, prompt-cache-friendly invocation, cancellation, and the lifetime gotchas that bite when you move a plugin off the request path.

  • Claude Code 2.1.128 Loads Plugins From .zip Archives and Stops Dropping Unpushed Commits

    Claude Code v2.1.128 (May 4, 2026) ships --plugin-dir support for .zip archives, makes EnterWorktree branch from local HEAD, and stops the CLI from leaking its own OTLP endpoint into Bash subprocesses.

  • How to Expose an EF Core Database to an AI Agent via MCP

    Wire an EF Core 10 DbContext into a Model Context Protocol server so Claude Code, Cursor, or any compliant client can run safe, scoped queries against your database. Covers IDbContextFactory lifetime, read-only projections, schema discovery tools, AsNoTracking, parameterised filters, row-level scoping, and the destructive-tool gates you need before letting an agent touch UPDATE.

  • Cursor Ships a TypeScript SDK That Turns Its Coding Agent Into a Library

    Cursor's new @cursor/sdk public beta exposes the same runtime, harness, and models that drive the desktop app, CLI, and web UI as a TypeScript package. You get sandboxed cloud VMs, subagents, hooks, MCP, and token-based pricing in a few lines of code.

  • Claude Code 2.1.126 Adds `claude project purge` to Wipe All State for a Repo

    Claude Code v2.1.126 ships claude project purge, a new CLI subcommand that deletes every transcript, task, file-history entry, and config block tied to a project path in a single shot. Includes --dry-run, --yes, --interactive, and --all.

  • How to Add Tool Calling to a Microsoft.Extensions.AI Chat Client

    Wire AIFunctionFactory.Create, ChatOptions.Tools, and ChatClientBuilder.UseFunctionInvocation in Microsoft.Extensions.AI 10.5 so an IChatClient can call your .NET methods automatically. Covers OpenAI and Azure OpenAI providers, the FunctionInvokingChatClient knobs that actually matter (iteration limits, concurrent calls, approval prompts, error handling), and streaming responses with tools.

  • Agent Governance Toolkit puts a YAML policy in front of every MCP tool call from .NET

    Microsoft's new Microsoft.AgentGovernance package wraps MCP tool calls with a policy kernel, a security scanner, and a response sanitizer. Here is what each piece does and how the wiring looks in C#.

  • How to Migrate a Semantic Kernel Plugin to an MCP Server

    Take an existing Semantic Kernel plugin with [KernelFunction] methods and turn it into a Model Context Protocol server other agents can call. Covers the drop-in WithTools(kernel) bridge, the native [McpServerTool] rewrite, parameter binding, dependency injection, and the gotchas that bite during the cutover.

  • How to Run Claude Code in a GitHub Action for Autonomous PR Review

    Wire up anthropics/claude-code-action@v1 so every pull request gets an autonomous Claude Code review with no @claude trigger. Includes the v1 YAML, claude_args for claude-sonnet-4-6 vs claude-opus-4-7, inline-comment tooling, path filters, REVIEW.md, and the choice between the self-hosted action and the managed Code Review research preview.

2026 / 04

2026 / 03

2026 / 01

All pillars Home