Question 1

What is LLMProc and how does it differ from traditional agent frameworks?

Accepted Answer

LLMProc reframes LLM applications as Unix-like processes rather than agents. It provides powerful abstractions like fork() for parallel exploration, goto() for context management and time-travel debugging, and a file descriptor system for handling large tool outputs. This process-based approach makes scaling and managing LLM execution more intuitive using familiar computing paradigms.

Question 2

How does minLoRA achieve such parameter efficiency?

Accepted Answer

minLoRA implements Low-Rank Adaptation (LoRA) in just ~100 lines of code. It freezes pre-trained model weights and injects trainable low-rank decomposition matrices, dramatically reducing trainable parameters from millions to thousands. Built on PyTorch's native parametrization system, it works seamlessly with any torch.nn.Module without modifying model definitions.

Question 3

What makes Additive Rotary Embedding different from standard RoPE?

Accepted Answer

Additive Rotary Embedding (AddRoPE) modifies RoPE by making position encoding additive rather than multiplicative, with learnable weights and phase offsets. This allows models to selectively ignore certain frequencies and provides more natural attention patterns. In experiments, it matches or slightly outperforms RoPE while being computationally faster.

Question 4

What programming languages and frameworks do you primarily work with?

Accepted Answer

I primarily work with Python for machine learning projects, using frameworks like PyTorch, Transformers, and FastAPI. For web applications, I use modern JavaScript/TypeScript with tools like Bun and Tailwind CSS. I also have experience with shell scripting and Unix-based systems, which influences my approach to building tools like LLMProc.

Question 5

Are your projects open source and how can I contribute?

Accepted Answer

Yes, all my projects are open source and available on GitHub. I welcome contributions! Each repository has its own contribution guidelines, but generally I appreciate bug reports, feature suggestions, documentation improvements, and code contributions. Feel free to open issues or pull requests on any project that interests you.

Jonathan Chang

Projects

flex-nano-vllm

Agent-Environment Middleware

Additive Rotary Embedding

Claude Code System Prompts

LLMProc

Kodx

vFLUX

AI Shell

minLoRA

Voyager MCP

WikiMCP

LLMCP

Forking an AI Agent

Santa Hat AI

T5 FlexAttention

Multi-head Latent Attention

Mixture of Depths

Anim·E

Flex Diffusion

DDIM inversion notebook

Publications

vLLM from scratch with FlexAttention

Agent-Environment Middleware

LLMProc: Thinking in Processes, Not Agents

Maximizing PyTorch Throughput with FastAPI

Exploring the Effective Rank of Projection Weights in Attention

Additive Rotary Embedding

Timeline

2022 - 2024 · Taboola

2021-2022 · BigScience Project

2021-2022 · ASUS AICS

2020-2021 · NTU MiuLab

2020 · Google

2017-2021 · National Taiwan University

Jonathan Chang - Personal Website

Jonathan Chang

Projects

flex-nano-vllm

Agent-Environment Middleware

Additive Rotary Embedding

Claude Code System Prompts

LLMProc

Kodx

vFLUX

AI Shell

minLoRA

Voyager MCP

WikiMCP

LLMCP

Forking an AI Agent

Santa Hat AI

T5 FlexAttention

Multi-head Latent Attention

Mixture of Depths

Anim·E

Flex Diffusion

DDIM inversion notebook

Publications

vLLM from scratch with FlexAttention

Agent-Environment Middleware

LLMProc: Thinking in Processes, Not Agents

Maximizing PyTorch Throughput with FastAPI

Exploring the Effective Rank of Projection Weights in Attention

Additive Rotary Embedding

Timeline

2022 - 2024 · Taboola

2021-2022 · BigScience Project

2021-2022 · ASUS AICS

2020-2021 · NTU MiuLab

2020 · Google

2017-2021 · National Taiwan University

Frequently Asked Questions

What is LLMProc and how does it differ from traditional agent frameworks?

How does minLoRA achieve such parameter efficiency?

What makes Additive Rotary Embedding different from standard RoPE?

What programming languages and frameworks do you primarily work with?

Are your projects open source and how can I contribute?