All Projects

2025
Claude Code running as a daemon with isolated user permissions

Claude Daemon

Tutorial, 2025

Running Claude Code as an isolated macOS user with browser access

Visual acuity test chart with letters of decreasing size for AI model evaluation

Visual Acuity Test for LLMs

Research, 2025

Evaluating AI models' ability to read fine details in high-resolution images

Performance comparison chart showing tokens per second between flex-nano-vllm and vLLM

flex-nano-vllm

Code, 2025

A minimal vLLM inference engine built from scratch using PyTorch FlexAttention

MIST robot prototype showing expressive face

Project MIST

Experiment, 2025

Building the MIST robot prototype with AI-assisted design

Agent architecture diagram showing LLM and Environment interaction loop with middleware

Agent-Environment Middleware

Research, 2025

A plugin architecture for composing LLM agents with middleware patterns

voyager-mcp

Voyager MCP

Tool, 2025

A Model Context Protocol (MCP) server for Claude Code integration

Claude Code System Prompts

Claude Code System Prompts

Web App, 2025

A website that shows Claude Code's system prompts across different versions

wikimcp

WikiMCP

Tool, 2025

A MCP server to let Claude explore random Wikipedia pages

uvx llmcp serve

LLMCP

Tool, 2025

A minimal MCP server for LLM to query other LLMs via LiteLLM and MCP.

LLMProc logo - Unix-inspired framework for LLM applications

LLMProc

Experiment, 2025

A Unix-inspired framework for building robust, scalable LLM applications

fork()

Forking an AI Agent

Experiment, 2025

A MVP exploring fork() pattern for AI agents

kodx

Kodx

Tool, 2025

Docker-based AI coding agent inspired by GitHub Copilot Codex

2024
Santa Hat AI

Santa Hat AI

Web App, 2024

A webapp that uses MediaPipe face detection to automatically place festive santa hats on profile pictures

vFLUX

vFLUX

Code, 2024

An optimized FLUX model inference engine

T5 FlexAttention

T5 FlexAttention

Code, 2024

T5 model optimized with FlexAttention

AI Shell

AI Shell

Tool, 2024

A transparent shell wrapper for building context-aware AI tools

Visualization of Additive Rotary Embedding technique for position encoding in LLMs

Additive Rotary Embedding

Research, 2024

A competitive variant of rotary position embedding (RoPE)

Multi-head Latent Attention

Multi-head Latent Attention

Implementation, 2024

I implemented Multi-head Latent Attention from deepseek-v2

Mixture of Depths

Mixture of Depths

Implementation, 2024

I implemented Mixture of Depths from Google DeepMind's paper

2023
Code snippet showing minLoRA implementation - minimal LoRA library in 200 lines of code

minLoRA

Code, 2023

A minimal library for LoRA (200 LoC!), supports any model in PyTorch

Flex Diffusion

Flex Diffusion

Model, 2023

I fine-tuned Stable Diffusion 2 for dynamic aspect ratio generation

2022
DDIM inversion notebook

DDIM inversion notebook

Demo, 2022

My popular notebook demonstrating DDIM inversion using Stable Diffusion

Anim·E

Anim·E

Model, 2022

State-of-the-art anime image generator at the time, before Stable Diffusion fine-tuned models

Back to home