Simula Research Validation

This repository is for validating a Simula-style synthetic data framework before production integration. The goal is to reproduce the core mechanism-design ideas from the Simula research line: treat dataset construction as a controllable system across independent axes of coverage, complexity, and quality.

Scope

This phase is research-first and validation-focused.

Build and evaluate the generation mechanism, not a full production platform.
Verify that decomposition into independent control axes produces measurable gains.
Establish reproducible experimentation and decision gates for promotion.

System overview

The validation pipeline is designed around four generation stages and one evaluation stage:

Global diversification: build hierarchical taxonomies to map domain coverage.
Local diversification: produce diverse instantiations within each taxonomy concept.
Complexification: raise difficulty for a controlled fraction of samples.
Dual-critic quality checks: independently verify correctness and reject low-quality samples.
Evaluation: compute coverage, complexity calibration, and quality metrics for run decisions.

flowchart TD
  domainObjective[DomainObjective] --> globalDiversification[GlobalDiversificationTaxonomy]
  globalDiversification --> localDiversification[LocalDiversificationMetaPrompts]
  localDiversification --> complexification[ComplexificationStage]
  complexification --> dualCritic[DualCriticQualityChecks]
  dualCritic --> curatedDataset[CuratedSyntheticDataset]
  curatedDataset --> metricsEval[CoverageComplexityQualityEvaluation]
  metricsEval --> validationDecision[ValidationGateDecision]
  validationDecision --> iterationLoop[IterationOrPromotion]

Documentation map

Use the structured docs index first:

docs/README.md

Domain language anchors:

First validation quickstart

The exact scripts and commands will be added as code lands. Use this staged flow for the first end-to-end validation cycle:

Define target domain and taxonomy depth/branching policy.
Generate taxonomy and inspect node coverage map.
Generate local instantiations from taxonomy nodes.
Apply complexification policy to the configured sample fraction.
Run dual-critic checks and regenerate rejected samples.
Compute evaluation metrics and compare against baseline/ablations.
Fill run report template and decide: iterate or promote.

Definition of done for initial validation

Initial validation is complete when all of the following are true:

Coverage, complexity, and quality are each measurable with explicit metrics.
At least one full baseline and one ablation matrix are executed and reported.
Run artifacts are reproducible from stored config, seed, and model metadata.
A validation gate decision is made with documented evidence and trade-offs.

Source guide

The primary research reference for this repository is:

docs/research_paper.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
.cursor/rules		.cursor/rules
artifacts		artifacts
contexts		contexts
docs		docs
scripts		scripts
src/simula_research		src/simula_research
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CONTEXT-MAP.md		CONTEXT-MAP.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simula Research Validation

Scope

System overview

Documentation map

First validation quickstart

Definition of done for initial validation

Source guide

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Simula Research Validation

Scope

System overview

Documentation map

First validation quickstart

Definition of done for initial validation

Source guide

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages