NewThe public SAE feature database is now live.Public SAE feature database is live.

Aquin is the research company reverse engineering intelligence with interpretability.

We're building the tooling to debug and improve ML models by pin-pointing issues and simulating fixes, from reducing hallucinations to ensuring safety. Based on mechanistic interpretability.

Get Early Access

GPT-2 Small

Causal Trace

Causal Trace

Circuit Attribution

ask anything about your model...

GPT-2 Small ∨

Backed by

Emergent Ventures

Founders Inc

The Residency

NVIDIA Inception

build models with same precision as writing code.

Observe features and layers. Simulate training runs and catch failures. Diff base vs fine-tuned to see exactly what changed and why. Trace any output token back to the exact prompt span and layer where the answer formed.

Observe & Find

prompt

What

is

the

capital

of

France

?

response · inherited signal

The

capital

of

France

is

Paris

.

Simulate & Fix

click a checkpoint ● to compare outputs

Debug & Improve

Prompt injection

71%

Role confusion

58%

Suppression

44%

Boundary bypass

33%

Context bleed

29%

Multi-turn drift

18%

trojan layer scan

layers.8.mlp.down_proj0%

layers.4.mlp.gate_proj0%

layers.12.self_attn.v0%

Simulate, debug and improve ML models.

Inspect dense LLMs, MoE, vision, and embedding models. Trace any output token back to the exact prompt span and layer where the answer formed. Simulate LoRA, QLoRA, DPO, and distillation runs, catch failure modes before they compound, and diff base vs fine-tuned to see exactly what changed and why.

build transformers & llms

layered attention

simulate lora

low-rank weight adaptation

Interpret your existing infrastructure and pipelines.

Instrument any training loop in minutes. Capture loss curves, grad norms, epoch summaries, and a final checkpoint locally, then push once for full post-hoc inspection.

$pip install aquin

Read Docs

init + config

Start a run and set your hyperparameters

log + push

Record every step, checkpoint, then push

The science: Mechanistic interpretability

Mechanistic interpretability reverse-engineers how neural networks compute, not just what they output. Aquin applies sparse autoencoders, logit lens, activation patching, and causal tracing to expose which features fire, which layers encode a concept, and which circuits produce each token. Stop guessing why a model hallucinated, drifted, or refused, trace the answer back to the exact prompt span that caused it and patch at the source.

active row

firing feature

inactive

Browse the public SAE feature database.

Every SAE Aquin trains is indexed and public. Browse features by model, layer, and interpretability score, inspect the feature space in 3D, and pull weights directly from the CLI.

feature space

sae database cli

attribution

logit lens

circuit

prompt · causal weights

WhatisthecapitalofFrance?

response · inherited signal

ThecapitalofFranceisParis.

causal weight

low → high

logit lens · prediction per layer

layer 1

the12%

layer 4

capital34%

layer 8

city58%

layer 14

Paris81%

layer 16

Paris97%

edge weight = causal signalread the methodology

ask anything about your model...

GPT-2 Small ∨

live stream

layer flags

output diff

loss · live streamlive

loss

grad norm

signals · output diff

What is the capital of France?17+ 17−

base

The capital of France is Paris. It sits along the Seine river and has been the country's capital for many centuries.

fine-tuned

Paris is the capital of France. The city has served as France's political and cultural center since the early medieval period.

ask anything about your model...

GPT-2 Small ∨

trojans

attack surface

red team

weight trojans · tensor scan

layers.8.mlp.down_proj0%

layers.4.mlp.gate_proj0%

layers.12.self_attn.v0%

attack surface · base vs fine-tuned

prompt injection▲ 13%

base 0.71fine-tuned 0.84

role confusion▼ 9%

base 0.58fine-tuned 0.49

suppression bypass▲ 17%

base 0.44fine-tuned 0.61

base

fine-tuned

red team · attack vectorsbase model

prompt injection

0.71

role confusion

0.58

suppress bypass

0.44

boundary robust.

0.33

context manip.

0.29

multi-turn extract

0.18

ask anything about your model...

GPT-2 Small ∨

attribution

logit lens

circuit

prompt · causal weights

WhatisthecapitalofFrance?

response · inherited signal

ThecapitalofFranceisParis.

causal weight

low → high

logit lens · prediction per layer

layer 1

the12%

layer 4

capital34%

layer 8

city58%

layer 14

Paris81%

layer 16

Paris97%

edge weight = causal signalread the methodology

ask anything about your model...

GPT-2 Small ∨

live stream

layer flags

output diff

loss · live streamlive

loss

grad norm

signals · output diff

What is the capital of France?17+ 17−

base

The capital of France is Paris. It sits along the Seine river and has been the country's capital for many centuries.

fine-tuned

Paris is the capital of France. The city has served as France's political and cultural center since the early medieval period.

ask anything about your model...

GPT-2 Small ∨

trojans

attack surface

red team

weight trojans · tensor scan

layers.8.mlp.down_proj0%

layers.4.mlp.gate_proj0%

layers.12.self_attn.v0%

attack surface · base vs fine-tuned

prompt injection▲ 13%

base 0.71fine-tuned 0.84

role confusion▼ 9%

base 0.58fine-tuned 0.49

suppression bypass▲ 17%

base 0.44fine-tuned 0.61

base

fine-tuned

red team · attack vectorsbase model

prompt injection

0.71

role confusion

0.58

suppress bypass

0.44

boundary robust.

0.33

context manip.

0.29

multi-turn extract

0.18

ask anything about your model...

GPT-2 Small ∨

Work with us

Interpretability tooling, custom SAE databases, mechanistic audits, circuit reports, and hands-on research, experiments, and studies for teams of all sizes. Reach us at aquin@aquin.app

Book a call

Not sure if Aquin is right for you?

Join the Aquin Research Community

LLM researchers & ML engineers — open research, fellowships, hackathons, and early beta access.

Join Discord

Follow our research and open source work

GitHub

HuggingFace Research

Policies Research Bounty Community aquin@aquin.app

Aquin