Tag: llm-as-judge
All the articles with the tag "llm-as-judge".
-
Streaming Journals: Kafka Meets LLMs
The automated journal entries were fiction. Every single one. Here's how a broken pipeline got replaced in 10 days — Kafka, vLLM workers, nine code iterations, a context preservation fix, and an LLM-as-Judge quality gate — all tracked through git commits and vault evidence.
-
From Interview to Implementation: Perfect 100/100 Anthropic Alignment
BT SRE interview Oct 17th morning (strong networking/Linux, exposed K8s gaps), followed by 5.5 hours implementing Anthropic patterns after work. Achieved PERFECT 100/100 alignment (66→94→100) via parallel execution, enhanced MCP docs, LLM-as-judge, bash tools, subagent pattern, resumable execution. Skills migration Day 5-7 complete (103 tests, 99.4% token reduction). The honest gap - AI-assisted execution masked foundational knowledge. Claude Code rebuilt K3s cluster, but I couldn't explain control plane in interview. Manual rebuild plan to close the gap.