Blog

Make your to-do list do the work

Field notes on AI to-do lists, MCP, and getting coding agents to clear your tasks for you.

·7 min read

I run Claude and Codex on a loop on my Mac mini — they clear my to-do list while I sleep

How I set up Claude Code and OpenAI Codex to poll my Lume task list on a Mac mini, pick up anything I assign them, do the work, and hand it back for review.

Read →
·4 min read

Things 3 still has no AI. Here’s what I moved to.

Things 3 is still the most beautiful to-do app — and still has zero AI. After years of waiting, here’s why I switched and what I was looking for.

Read →
·5 min read

The best task managers with an MCP server in 2026

MCP lets your AI read and write your real tasks. Here are the task managers that actually support it — TickTick, Todoist (community), Linear, and Lume — and how they differ.

Read →
·5 min read

How to manage your to-do list from Claude

Connect your tasks to Claude over MCP so you can add, find, and reschedule to-dos from a conversation — instead of describing your list by hand every time.

Read →
·4 min read

Your to-do list should delegate, not just remind

Every productivity app helps you do tasks faster. The bigger unlock is not doing some of them yourself — handing scoped work to AI agents and keeping the review.

Read →
·4 min read

I trust AI to write my code, not my email

An engineer’s agent once sent a real reply by accident. The lesson isn’t “don’t use agents” — it’s that trust should scale with blast radius, and your task list should enforce it.

Read →
·5 min read

Why your agent’s to-do list keeps rotting

Markdown task lists bitrot because agents don’t update them. Steve Yegge’s fix — and why an AI to-do list needs to be a structured, queryable store, not a text file.

Read →
·5 min read

857 sessions on a 2015 MacBook: what overnight agents really need

One developer ran Claude Code 24/7 for six months on an old laptop. The intelligence was never the hard part — the queue, the handoff, and the guardrails were.

Read →
·4 min read

The $1,800 cron job: when “run it overnight” goes wrong

Stories of overnight agent runs racking up four-figure bills. The fix isn’t fear — it’s a bounded, reviewed queue instead of an open-ended loop with no off switch.

Read →
·4 min read

“Are you done?” “Yes.” (The build is broken.)

Agents reliably over-report success. Why a separate verification step — tests, a second model, or a human review gate — is the difference between “done” and actually done.

Read →
·4 min read

1,600 emails in one conversation — and the one a human had to catch

An engineer cleared 1,600 alerts with an AI agent in a day. The lesson isn’t “let it run” — it’s that agents do the mechanics and humans keep the judgment.

Read →
·4 min read

Loop engineering: the skill isn’t prompting anymore

The shift from writing clever prompts to designing self-correcting loops — and why every good loop needs a queue to draw from and a stop condition to end on.

Read →