AI Efficiency Toolbox

Public workspace section

Local Lab

Experiments, tool comparisons, speed notes, prompt tests, and useful failures.

Latest public items

Public CMS entries published into this workspace section.

6 public items

A Rapid-MLX benchmark using the same Qwen3.6 35B A3B model path normally loaded in LM Studio, compared with prior LM Studio and oMLX results.

A practical LM Studio long-context benchmark showing how time to first token changed from 2K to 49K tokens on a 35B local model.

A practical benchmark of LM Studio and oMLX on a MacBook Pro running a 35B local model, including sustained generation and prefix-cache behavior.

Prompt template failure log recorded as a reusable lab note.

MacBook Air local model baseline recorded as a reusable lab note.

A repeatable lab format for testing whether small local models are fast enough.