Simulation: LLMs

Analytical training forecast for language models. Runs SAE gradient decomposition, LiSSA influence scoring, NTK-linearised weight prediction, and dataset quality analysis without real optimizer steps. Produces a synthetic checkpoint you can diff and inspect. After simulate saves a run under ~/.aquin/runs/, use list simulation to find IDs, replay simulation to reopen one result on the web, and compare simulation to diff two runs (loss, SAE features, influence, attack-surface). Requires LLM mode.

Prerequisiteaquin login · aquin load model gpt2-small

5 commands

aquin check dataset

agent tool: analyze_training_dataset

Dataset quality report without loading the model: harmful content scan, AI-fingerprint detection (assistant-speak phrases), length distribution, instruction diversity, and sequence-length violations. Run this before committing GPU time to simulation.

Flag	Description
--dataset	Path to .jsonl / .json / .csv file (default: most recent in cwd).

example

aquin simulate

agent tool: run_simulation

Full LLM simulation pipeline: Pass 0 dataset quality, Pass 1 SAE baseline activations, Pass 2 gradient landscape + SAE gradient decomposition, Pass 2b LiSSA influence, Pass 3 NTK-linearised weight delta. Saves a synthetic checkpoint locally. Takes 2–10 minutes on larger models.

Flag	Description
--dataset	Path to training dataset (.jsonl).
--algo	Path to LoRA/training config YAML.
--topic	Quick probe topic when no dataset yet (auto-generates).
--rank / --lr / --epochs	LoRA hyperparameters (override algo file).
--use_rlhf / --rlhf_beta	Enable RLHF simulation pass.

example

Saved to ~/.aquin/runs/<id>/. End of run includes SAE diff (base vs synthetic checkpoint) in stream and result. For real checkpoint diff after training, see Checkpoint SAE (/docs/checkpoint-sae). For external training metrics, see Training watch (/docs/watch).

aquin list simulation

agent tool: list_simulation_runs

Lists all saved simulation run IDs on local disk with metadata (topic, model, timestamp).

example

Run IDs are printed at the end of aquin simulate and stored under ~/.aquin/runs/<id>/. Legacy alias: aquin list-runs.

aquin replay simulation

agent tool: load_simulation_run

Reopens one saved simulation run by ID and renders the full result card on the web.

Flag	Description
--run_id*	Run ID from list simulation.

example

Legacy: aquin load simulation / aquin load-run.

aquin compare simulation

agent tool: compare_simulations

Side-by-side comparison of two saved runs: predicted loss delta, SAE feature diffs, influence score diffs, and attack-surface metrics (consistency, suppression, robustness) from model diff.

Flag	Description
--run_id_a*	First run (before).
--run_id_b*	Second run (after).
--label_a / --label_b	Display labels for the comparison table.

example

Legacy alias: aquin compare-runs. Cannot compare LLM runs with embedding runs.