Model Training

25 building blocks and models in the model training category.

Train/Test Split

Split data into training, validation, and test sets

Model Training

Train ML model with hyperparameter tuning

Hyperparameter Tuning

Grid search, random search, or Bayesian optimization

Cross-Validation

K-fold, stratified, or time-series cross-validation

Full Fine-tuning

Update all model parameters on task-specific data

LoRA

Low-Rank Adaptation - add small trainable matrices to attention layers

QLoRA

Quantized LoRA - 4-bit quantization + LoRA for memory efficiency

Adapter Layers

Insert small trainable modules between frozen transformer layers

Prefix Tuning

Learn continuous soft prompts prepended to each layer

Prompt Tuning

Learn task-specific prompt embeddings (input layer only)

IA³

Infused Adapter - learn rescaling vectors for activations

Instruction Tuning

Fine-tune on instruction-following datasets (e.g., Alpaca, ShareGPT)

RLHF

Reinforcement Learning from Human Feedback with reward model

Reward Modeling

Train reward model from human preference comparisons

DPO

Direct Preference Optimization - simplified RLHF without reward model

ORPO

Odds Ratio Preference Optimization - single-stage SFT + preference

Constitutional AI

Self-improvement via AI feedback based on constitutional principles

Feature Extraction

Freeze base model, train only the classification head

Domain Adaptation

Adapt pretrained model to a new domain (e.g., medical, legal)

Continued Pretraining

Further pretrain on domain-specific corpus before fine-tuning

Knowledge Distillation

Train smaller student model to mimic larger teacher model

Multi-Task Learning

Train on multiple tasks simultaneously with shared representations

Transfer Learning

Reuse pretrained model knowledge for new tasks via fine-tuning or feature extraction

Active Learning

Iteratively select most informative samples for labeling to minimize annotation cost

Model Quantization

Reduce model precision (FP32→INT8/INT4) for faster inference and smaller footprint

ML System Design Reference · Built by QnA Lab