Question 1

When should you fine-tune a model vs. using RAG or prompt engineering?

Accepted Answer

Fine-tuning is best when you need to teach the model new behaviors, styles, or domain knowledge that is too complex to convey through prompts. RAG is best when the model needs access to specific, frequently updated factual information. Prompt engineering is best for adjusting behavior within the model's existing capabilities. Most production systems use all three in combination.

Question 2

What is the difference between LoRA and full fine-tuning?

Accepted Answer

Full fine-tuning updates all model weights — effective but computationally expensive and requires large gradient memory. LoRA injects trainable low-rank matrices into specific weight matrices, training only ~0.1–1% of total parameters while achieving comparable performance. LoRA is preferred for most fine-tuning tasks due to its efficiency and resistance to catastrophic forgetting.

Question 3

How much data do you need to fine-tune an LLM?

Accepted Answer

For instruction tuning, high-quality datasets of just 1,000–10,000 examples can meaningfully shift model behavior. For domain adaptation, larger datasets (100K+) are typically needed. Quality matters far more than quantity — a carefully curated dataset of 5,000 examples consistently outperforms a noisy dataset of 50,000.

Question 4

What is catastrophic forgetting in LLM fine-tuning?

Accepted Answer

Catastrophic forgetting occurs when fine-tuning on a new task causes the model to lose capabilities it had before training. It happens when the fine-tuning data is too narrow or training runs too long. Techniques to mitigate it include: data mixing (including general instruction data alongside task-specific data), replay buffers, and elastic weight consolidation.

Question 5

How do you evaluate whether a fine-tuned model is actually better?

Accepted Answer

Comprehensive evaluation requires both automated benchmarks and human evaluation. Automated metrics (MMLU, MT-Bench, custom task evals) measure capabilities broadly. Human evaluation with side-by-side comparisons between the baseline and fine-tuned model captures quality dimensions that benchmarks miss. Tracking regression on held-out tasks is essential to ensure fine-tuning didn't damage general capabilities.

Level	Base Salary	Total Comp (with equity)	Intern Monthly
Intern	—	—	$9,500–$15,000/mo
Entry-Level (0–2 yrs)	$140,000–$200,000	+20–40% in equity/bonus	—
Mid-Level (3–5 yrs)	$200,000–$280,000	+30–60% in equity/bonus	—
Senior (5–8 yrs)	$280,000–$391,000	+50–100% in equity/bonus	—

LLM Fine-Tuning Engineer Jobs & Internships 2026

What Does a LLM Fine-Tuning Engineer Do?

Required Skills & Qualifications

A Day in the Life of a LLM Fine-Tuning Engineer

Career Path & Salary Progression

Top Companies Hiring LLM Fine-Tuning Engineers

Apply for LLM Fine-Tuning Engineer Roles

LLM Fine-Tuning Engineer — Frequently Asked Questions

When should you fine-tune a model vs. using RAG or prompt engineering?

What is the difference between LoRA and full fine-tuning?

How much data do you need to fine-tune an LLM?

What is catastrophic forgetting in LLM fine-tuning?

How do you evaluate whether a fine-tuned model is actually better?

Related AI Roles