Model Training
25 building blocks and models in the model training category.
Train/Test Split
Split data into training, validation, and test sets
Model Training
Train ML model with hyperparameter tuning
Hyperparameter Tuning
Grid search, random search, or Bayesian optimization
Cross-Validation
K-fold, stratified, or time-series cross-validation
Full Fine-tuning
Update all model parameters on task-specific data
LoRA
Low-Rank Adaptation - add small trainable matrices to attention layers
QLoRA
Quantized LoRA - 4-bit quantization + LoRA for memory efficiency
Adapter Layers
Insert small trainable modules between frozen transformer layers
Prefix Tuning
Learn continuous soft prompts prepended to each layer
Prompt Tuning
Learn task-specific prompt embeddings (input layer only)
IA³
Infused Adapter - learn rescaling vectors for activations
Instruction Tuning
Fine-tune on instruction-following datasets (e.g., Alpaca, ShareGPT)
RLHF
Reinforcement Learning from Human Feedback with reward model
Reward Modeling
Train reward model from human preference comparisons
DPO
Direct Preference Optimization - simplified RLHF without reward model
ORPO
Odds Ratio Preference Optimization - single-stage SFT + preference
Constitutional AI
Self-improvement via AI feedback based on constitutional principles
Feature Extraction
Freeze base model, train only the classification head
Domain Adaptation
Adapt pretrained model to a new domain (e.g., medical, legal)
Continued Pretraining
Further pretrain on domain-specific corpus before fine-tuning
Knowledge Distillation
Train smaller student model to mimic larger teacher model
Multi-Task Learning
Train on multiple tasks simultaneously with shared representations
Transfer Learning
Reuse pretrained model knowledge for new tasks via fine-tuning or feature extraction
Active Learning
Iteratively select most informative samples for labeling to minimize annotation cost
Model Quantization
Reduce model precision (FP32→INT8/INT4) for faster inference and smaller footprint