Train Models with Powers

The simplest way to train models with capabilities built into them, affordable, fast, and fully private and safe.

Watch the launch

Aquin Labs

New Chat

Project Discussion

Research Notes

Message Limits25k / 30k

Recovery+10k / 30min

I need to create a training dataset for a general-purpose AI assistant.

Great! For a general-purpose model, you'll want diverse question-answer pairs. I've created a sample dataset with 5 examples to get you started.

How many examples do I need for good results?

For basic fine-tuning, start with 100-200 examples. Quality matters more than quantity - well-crafted examples beat repetitive ones.

001 | [INFO] Loading Llama-2-7B base model...

002 | [INFO] Applying LoRA config: rank=8, alpha=16

003 | [INFO] Dataset loaded: "general-qa-v1" - 412 examples

004 | [INFO] Starting training run...

005 | [EPOCH 1/3] Step 25/52 - Loss: 2.341

006 | [SUCCESS] Training converged successfully!

Aquin Labs

New Chat

Project Discussion

User

What is the capital of France?

Assistant

The capital of France is Paris.

User

How do I make scrambled eggs?

Assistant

Beat 2-3 eggs in a bowl with milk and salt. Heat butter in a pan over medium heat, pour in the eggs, and gently stir until soft curds form.

User

What causes the seasons?

Assistant

The seasons are caused by Earth's tilted axis (23.5°) as it orbits the sun. When the Northern Hemisphere tilts toward the sun, it experiences summer.

User

How does photosynthesis work?

Assistant

Plants convert sunlight, water, and CO2 into glucose and oxygen. Chlorophyll absorbs light energy to power these chemical reactions.

User

Why is the sky blue?

Assistant

The sky appears blue due to Rayleigh scattering. Blue wavelengths scatter more when sunlight hits air molecules in the atmosphere.

advanced ml methods
simplified training

supervised-fine-tuning

Update all model weights

W (original)

W' (updated)

LoRA

Low-Rank Adaptation: ΔW = B × A

B (d×r)

A (r×k)

ΔW (d×k)

QLoRA

Quantized + Low-Rank Adaptation

W (4-bit)

Frozen

Quantize base model to 4-bit, train small adapter matrices

RAG

Retrieval Augmented Generation

Knowledge Base

Documents

Query

Retrieved

Context

LLM

Response

train in any format
automatic formatting

JSON

Export your training data as structured JSON arrays with full metadata and nested properties for maximum flexibility.

JSONL

Export as JSON Lines format - one complete JSON object per line, ideal for streaming and processing large datasets efficiently.

CSV

Export as comma-separated values with headers - simple, lightweight, and universally compatible with spreadsheets and databases.

TXT

Export as plain text with custom delimiters - perfect for simple question-answer pairs and human-readable training data.

[
  {
    "question": "What is the capital of France?",
    "answer": "The capital of France is Paris, a major European city and a global center for art, fashion, gastronomy and culture.",
    "category": "geography",
    "difficulty": "easy",
    "source": "world_facts_v2"
  },
  {
    "question": "How does photosynthesis work?",
    "answer": "Photosynthesis is the process by which plants use sunlight, water, and carbon dioxide to create glucose and oxygen. This occurs primarily in the chloroplasts of plant cells.",
    "category": "science",
    "difficulty": "medium",
    "source": "biology_basics"
  },
  {
    "question": "Write a haiku about coding",
    "answer": "Code flows like water\nDebugging through the night\nCoffee keeps me alive",
    "category": "creative",
    "difficulty": "medium",
    "source": "creative_writing"
  },
  {
    "question": "What is machine learning?",
    "answer": "Machine learning is a subset of artificial intelligence that enables systems to learn and improve from experience without being explicitly programmed. It uses algorithms to parse data, learn from it, and make predictions.",
    "category": "technology",
    "difficulty": "medium",
    "source": "ai_fundamentals"
  },
  {
    "question": "Explain the theory of relativity in simple terms",
    "answer": "Einstein's theory of relativity shows that space and time are interwoven and relative to the observer. The faster you move through space, the slower you move through time. Massive objects also bend spacetime, creating what we experience as gravity.",
    "category": "science",
    "difficulty": "hard",
    "source": "physics_explained"
  },
  {
    "question": "What are the main differences between Python and JavaScript?",
    "answer": "Python is primarily used for backend development, data science, and automation, with indentation-based syntax. JavaScript is mainly for web development, running in browsers, with curly-brace syntax. Python is interpreted and strongly typed, while JavaScript is also interpreted but weakly typed.",
    "category": "programming",
    "difficulty": "medium",
    "source": "coding_comparisons"
  },
  {
    "question": "Describe the water cycle",
    "answer": "The water cycle is the continuous movement of water on, above, and below Earth's surface. It includes evaporation, condensation, precipitation, and collection. Water evaporates from surfaces, forms clouds, falls as rain or snow, and collects in bodies of water.",
    "category": "science",
    "difficulty": "easy",
    "source": "earth_science"
  },
  {
    "question": "What is blockchain technology?",
    "answer": "Blockchain is a distributed ledger technology that records transactions across multiple computers in a way that makes them nearly impossible to alter retroactively. Each block contains transaction data, a timestamp, and is cryptographically linked to the previous block.",
    "category": "technology",
    "difficulty": "hard",
    "source": "fintech_guide"
  }
]