RL Glossary
Contact
Home
  • What is this?
  • Training +
  • Data +
  • Rewards +
  • Optimization +
  • Agents +
  • Inference +
  • Evaluation +
  • RLOps +
  • GPU cost calculator
  • FAQ
Home
  • What is this?
  • Training +
  • Data +
  • Rewards +
  • Optimization +
  • Agents +
  • Inference +
  • Evaluation +
  • RLOps +
  • GPU cost calculator
  • FAQ
← Home Data

Data

What models learn from. Different training phases need different data.

Pretraining consumes raw text at massive scale. SFT needs structured input-output pairs. Models can also generate their own training data.

Talk to an RL expert
← Previous Reinforcement learning Next → Training data