Home | Mitchell's Log

March 11, 2026 Poem Experiment Log: Prompt Framing for Divergent Thinking

Poem experiment on prompt framing with best-of-3 quality bars, rubric hit rates, and word-level behavior across 90 generations.

experiments writing poetry prompt-design slm evaluation

February 27, 2026 Writing Experiment Rerun: gemma3:1b with 3 Samples per Story

Comparison of 2-sample vs 3-sample runs, with focus on all-bad story rate versus at-least-one fair/good story rate.

experiments writing storytelling slm evaluation

February 25, 2026 Writing Experiment Rerun: gemma3:1b at 82.0% Fair+Good

Updated gemma3:1b writing experiment with 100 outputs, 82.0% fair+good, case-level usability rates, and the full rerun quality rubric.

experiments writing storytelling slm evaluation

February 23, 2026 Writing Experiment: gemma3:1b vs qwen3:1.7b Quality Mix

Head-to-head writing quality distribution with stacked percentage columns and request-compliance summary.

experiments writing storytelling slm evaluation

February 22, 2026 Paper Notes: Narrative Reversals for Voice-Stable Context Retrieval

Notes on narrative reversals, plus a design for reversal-aware segmentation and retrieval that is optimized for maintaining voice and pacing during drafting.

context-compiler retrieval writing storytelling

February 21, 2026 REALM Combined Loop: Multi-Size Results Toward Context Compiler

Combined Read + Edit/Analyze experiment showing strong multi-size token behavior for the context compiler path.

experiments context-curation context-compiler slm

February 18, 2026 Context Compiling: ctxc and vectorless builds

I’m building a ‘context compiler’ that walks large docs like a book and emits a tight, testable context packet—without embeddings—so even tiny models can rel...

architecture context-compiler retrieval writing

February 4, 2026 REALM Continuation: Local SLM Evaluation with gemma3:1b

Continuation experiment evaluating gemma3:1b as a local SLM for REALM context curation.

experiments context-curation slm

February 3, 2026 Context Curation: Preliminary REALM Tests

First experiment write-up validating baseline navigation behavior and context scaling.

experiments context-curation

February 1, 2026 REALM: Read Edit/Analyze Loop Monitor

A modular architecture that curates context from large documents at low cost, with a path to small on-device models.

architecture context-curation

Recent Entries