Documentation

How MACU Studio actually works

This is the deep dive - the reference behind the friendly guide. The guide shows you which buttons to press; these pages explain what happens when you do. The render pipeline stage by stage, every model that gets loaded and where its weights sit on disk, where the prompts live and how to rewrite them, how to point Studio at your own models, how to put a Claude Code agent in the driver’s seat, and how an episode gets from your GPU to YouTube to this site.

The whole idea

A pipeline built to be driven by AI

MACU Studio is an unapologetically AI-native tool. The pictures are made by a diffusion video model. The voices are cloned and synthesized by a neural TTS. The subtitles are aligned by a speech-recognition model. The shot lists, the title-card copy, the sound-effect spotting, and the 48-language dubs are all proposed by a local large language model running on your own GPU. Even the show you make with it is, at its core, a stack of model outputs stitched together by a deterministic pipeline you can read top to bottom.

We built it this way on purpose, and we built it this way too - Studio itself was written hand-in-hand with Claude. So the most important page here isn’t the pipeline or the model list; it’s Connect an agent. Wiring a Claude Code agent into Studio turns “write the script, click eight times, fix the four things that came out wrong” into a conversation: “tighten the cold open, regenerate Ron’s second line slower, re-render the lemonade-stand b-roll, and ship it.”Everything in these docs - every prompt file, every config knob, every endpoint - is something an agent can read and operate on your behalf. That’s the payoff: the more of this you hand to the agent, the less of it you have to do yourself.

Everything here runs on your machine.

Studio is free and open source, and the whole pipeline runs locally on your own NVIDIA GPU - no per-render fees, no cloud TTS, no API key for the core workflow. The only things that ever leave your box are the ones you explicitly send out: a YouTube upload, or a publish to mayorawesome.com ↗.

Start here

The reference, page by page

Install & updates

What you need, the one-command installer, running on boot, and the in-app self-updater.

The pipeline

The eight render stages, end to end - what each one does, what it reads, what it writes, and how caching keeps re-renders cheap.

Models

Every ML model Studio loads - voices, video, interpolation, transcription, the local LLM - where the weights live, and how to swap your own in.

Cloud & engines

Local by default - but route stills, video, or lipsync to Higgsfield in the cloud when you want bigger models or run the light install. Engine routing, the character library, and billing-safe caching that never pays twice.

Prompts

Where the generation prompts live, the house style strings, the character bibles, and how to edit any of them.

Connect an agent

Studio is an MCP server - point any agent at /mcp and the whole pipeline becomes callable tools. Plus the chat tile, writers' room, and a live terminal. The single biggest efficiency unlock in the stack.

YouTube & publishing

Render → upload → caption → publish. The full workflow, plus the Google API setup an agent needs to do it for you.

New to Studio entirely? Read the friendly guide first, or click through the live demo - a real, fully-rendered episode where every button works and nothing you do is saved.