Aquin LogoAquinLabs
Login

Simulation: Embedding

Contrastive training forecast for embedding models. Audits pair/triplet geometry, predicts contrastive loss trajectory, and flags hard negatives without updating weights. Requires embedding mode.

Prerequisiteaquin load --model gte-small

7 commands

aquin embed-pairs-generate

agent tool: embed_pairs_generate

Generates contrastive training data for a topic: pairs (anchor/positive), triplets (anchor/positive/negative), or SimCSE-style duplicates. Writes JSON to cwd.

FlagDescription
--topic*Subject for generated examples.
--modepairs, triplet, or simcse (default: pairs).
--countNumber of examples (default: 8).
--outputOutput filename.
example

aquin embed-dataset-analyze

agent tool: analyze_embed_training_dataset

Pair quality report for contrastive data: margin violations, duplicate anchors, negative hardness, and mode-specific checks (triplet margin, SimCSE duplicate ratio).

FlagDescription
--pairsPath to pairs/triplets JSON.
--modepairs, triplet, or simcse.
--margin / --temperature / --batch_sizeTraining hyperparameters for audit thresholds.
example

aquin embed-simulate

agent tool: run_embed_simulation

Contrastive fine-tuning forecast: geometry audit, predicted loss trajectory, hard-negative analysis. Pass pairs file + config, or a full payload JSON. No weight updates.

FlagDescription
--pairsPath to pairs/triplets JSON.
--configYAML/JSON with mode, lr, epochs, margin, temperature.
--payloadFull simulate request JSON (alternative to pairs+flags).
--modepairs, triplet, or simcse.
--lr / --epochs / --batch_size / --margin / --temperatureInline hyperparameters.
example

Use --payload for pre-built JSON fixtures (e.g. simulate-triplets.gte-small.json).

aquin list simulations

agent tool: list_simulation_runs

Lists saved embed simulation runs (same command as LLM, filtered by mode on disk).

example

aquin load simulation

agent tool: load_simulation_run

Load a saved embed simulation by run ID.

FlagDescription
--run_id*Run ID.
example

aquin compare simulation

agent tool: compare_simulations

Compare two embed simulation runs: loss trajectory, geometry metrics, triplet margin satisfaction.

FlagDescription
--run_id_a*First run.
--run_id_b*Second run.
example

aquin embed-train

agent tool: embed_train_start

Start real contrastive fine-tuning on the loaded embedding model. Coming soon; simulation forecast runs first via embed-simulate.

example

Not yet implemented in CLI. Use embed-simulate for analytical forecast.