Simulation: Embedding
Contrastive training forecast for embedding models. Audits pair/triplet geometry, predicts contrastive loss trajectory, and flags hard negatives without updating weights. Requires embedding mode.
7 commands
aquin embed-pairs-generate
agent tool: embed_pairs_generate
Generates contrastive training data for a topic: pairs (anchor/positive), triplets (anchor/positive/negative), or SimCSE-style duplicates. Writes JSON to cwd.
| Flag | Description |
|---|---|
| --topic* | Subject for generated examples. |
| --mode | pairs, triplet, or simcse (default: pairs). |
| --count | Number of examples (default: 8). |
| --output | Output filename. |
aquin embed-dataset-analyze
agent tool: analyze_embed_training_dataset
Pair quality report for contrastive data: margin violations, duplicate anchors, negative hardness, and mode-specific checks (triplet margin, SimCSE duplicate ratio).
| Flag | Description |
|---|---|
| --pairs | Path to pairs/triplets JSON. |
| --mode | pairs, triplet, or simcse. |
| --margin / --temperature / --batch_size | Training hyperparameters for audit thresholds. |
aquin embed-simulate
agent tool: run_embed_simulation
Contrastive fine-tuning forecast: geometry audit, predicted loss trajectory, hard-negative analysis. Pass pairs file + config, or a full payload JSON. No weight updates.
| Flag | Description |
|---|---|
| --pairs | Path to pairs/triplets JSON. |
| --config | YAML/JSON with mode, lr, epochs, margin, temperature. |
| --payload | Full simulate request JSON (alternative to pairs+flags). |
| --mode | pairs, triplet, or simcse. |
| --lr / --epochs / --batch_size / --margin / --temperature | Inline hyperparameters. |
Use --payload for pre-built JSON fixtures (e.g. simulate-triplets.gte-small.json).
aquin list simulations
agent tool: list_simulation_runs
Lists saved embed simulation runs (same command as LLM, filtered by mode on disk).
aquin load simulation
agent tool: load_simulation_run
Load a saved embed simulation by run ID.
| Flag | Description |
|---|---|
| --run_id* | Run ID. |
aquin compare simulation
agent tool: compare_simulations
Compare two embed simulation runs: loss trajectory, geometry metrics, triplet margin satisfaction.
| Flag | Description |
|---|---|
| --run_id_a* | First run. |
| --run_id_b* | Second run. |
aquin embed-train
agent tool: embed_train_start
Start real contrastive fine-tuning on the loaded embedding model. Coming soon; simulation forecast runs first via embed-simulate.
Not yet implemented in CLI. Use embed-simulate for analytical forecast.
