AI UNDERDOGSDAILY PICK
AI UNDERDOGS
别信 AI 说「完成」
Stop trusting AI's 'done'
LeoStehlik/proof-loop
你让 AI 写完代码,它说搞定了
但你真敢直接跑吗?
Proof Loop 的思路很简单——不信任
只验证
Your coding agent says 'done.' But do you
actually trust that?
Proof Loop flips the script: don't trust, verify
验证者 ≠ 建造者
Verifier ≠ Builder
核心改动:写代码的 AI 和检查代码的 AI
不能是同一个
Proof Loop 给每个任务配一个独立验证者角色
专门挑毛病
这直接瞄准了 AI agent 最大的软肋——自己给自己验收
The key move: the agent that writes code
and the agent that verifies it are two
separate roles. Proof Loop pairs every task with
a dedicated verifier. This hits AI agents' core
weakness — self-verification
★ SIGNAL 1
不是嘴上说说
Not just vibes
每个任务都定义了明确的验收标准
agent 不能光说「我写完了」
得交出 proof artifacts——结构化的证据
你能直接翻看验证
不是 prompt 工程的话术
是实实在在的东西
Every task has explicit acceptance criteria. The agent
can't just wave its hands — it must
produce proof artifacts: structured evidence you can actually
inspect. Not prompt-engineered hand-waving, real deliverables
★ SIGNAL 2
验收标准写进协议
Criteria baked into protocol
很多人做 agent 框架
验证那一步就是个摆设
Proof Loop 不一样——验收标准是任务定义的一部分
验证者拿到的是原封不动的 criteria
不是二次翻译的 prompt
这才是真正能跑起来的闭环
Most agent frameworks treat verification as a checkbox
Here, acceptance criteria are part of the task
definition itself — the verifier gets the original
criteria, not a re-prompted summary. That's how you
build a loop that actually closes
作者原话说得很直白:「I make my coding agents prove they finished the task
」没有吹什么革命性框架
就是从自己被坑的经历出发
做了一个朴素但有效的机制
The author's own words: 'I make my coding
agents prove they finished the task.' No revolutionary
claims — just a straightforward mechanism born from
getting burned one too many times
AI UNDERDOGS
让它证明给你看
Make it prove it
LeoStehlik/proof-loop
关注 · 每天发现更多 AI 神作
github.com/LeoStehlik/proof-loop