Project history

Studio and AI products

blogeval

An AI eval that grades how well AI systems generate blog posts.

Current studio workNo demo listedDeveloper toolAI evaluationLLM evalsAI gradingAutomation

Writeup

What this project explored

An AI eval that grades how well AI systems generate blog posts.

This sits in the studio/product side of the portfolio: practical AI systems, interfaces, and tools aimed at turning rough ideas into usable workflows.

The project is part of a broader pattern in Kyle's work: build the smallest useful version, learn from the behavior of the system, then use that prototype to decide whether the idea deserves more time, polish, or a totally different direction.