All work
13Production (Mac mini)

SDCCD Captioning System

Whisper hears it, a local LLM polishes it, a pattern-flagger catches the names — shipped as a real Mac app.

  • FastAPI
  • whisper.cpp
  • Ollama/Qwen
  • ffmpeg
  • SQLite
  • macOS .app
FIG.13waveform

A production captioning service: whisper.cpp transcription, a local LLM polish pass, and a four-layer vocabulary pipeline for names and acronyms — packaged as a signed macOS app and used in production on a Mac mini.

Role
Architect & engineer
Status
Production (Mac mini)
Access
Private — architecture & status only
Problem

Accurate captioning of institutional media — especially names and acronyms — is where automatic transcription fails. The institution needed reliable captions with a tight human-review loop, running locally.

Architecture

whisper.cpp (large-v3-turbo) handles transcription, Ollama/Qwen2.5 polishes it, and ffmpeg manages media. A four-layer vocabulary brain — a Whisper hint window, an LLM polish, a pattern flagger, and human review — catches the hard names. Captions render to the DCMP standard. The whole service is FastAPI packaged as a signed macOS app with closed-onboarding auth.

Role

Architect and engineer — the transcription pipeline, the vocabulary brain, the caption renderer, and the macOS packaging.

Outcome

Running in production on a Mac mini. (Private system — architecture and status only.)

What it took

Technical proof.

  • whisper.cpp large-v3-turbo + Ollama/Qwen2.5 + ffmpeg transcription pipeline.
  • 4-layer vocab brain: hint window → LLM polish → pattern flagger → human review.
  • DCMP-standard caption rendering.
  • FastAPI packaged as a signed macOS .app with closed-onboarding auth.
Private system — no live link. Architecture and status only; never data, positions, or credentials.Back to all work