Training Recipes

v0.77 adds src/closed_world_lm/training_recipe.py and two required run artifacts for self-improvement answer cycles and transformer answer-training runs:

training_recipe.json
constraint_first_promotion.json

The recipe records how to reproduce a run. The constraint-first report records whether a run is even allowed to consider quality metrics for promotion.

Recipe Contents

training_recipe.json records:

recipe id and version;
component and run id;
model configuration;
tokenizer provenance;
data sources;
objective settings;
optimizer settings;
replay status;
planned artifacts;
required gates;
rerun surface.

This keeps recipe identity out of hidden argparse memory. A future screen should be reconstructable from the recipe artifact and the admitted project state.

Constraint-First Promotion

constraint_first_promotion.json separates constraints from quality checks.

Constraints must pass before quality metrics are considered. For transformer runs, those constraints include closed-world verifier approval, closed-world training data, no pretrained weights, no pretrained tokenizer, no external embeddings, direct-answer evidence, branch-context coverage, branch diversity, and target-coverage preservation.

Only after those constraints pass can exact direct-answer quality checks affect promotion. Loss, NLL, rank, and top-k movement remain advisory until the constraint report says quality metrics are eligible.

Status

Self-improvement runs can still promote through their existing exact-eval discipline, now with a constraint-first report attached. Transformer runs now have a promotion gate, but current evidence is still expected to reject until the model satisfies the branch-context, diversity, coverage, and exact-answer quality checks.

Recipe Contents​

Constraint-First Promotion​

Status​

Recipe Contents

Constraint-First Promotion

Status