RAG vs Fine-Tuning: Knowledge vs Behaviour

← Home
Choosing between retrieval-based systems and model adaptation for real-world decision support

AI delivery decisions start with a diagnostic, not a tool choice.

Core Diagnostic Question:
Is the model failing because it doesn't know the facts, or because it doesn't know how to act?
---

1. The Core Distinction: Knowledge vs Behaviour

Feature RAG (Knowledge) Fine-Tuning (Behaviour)
Operational Role "Open-book test" "Specialised training"
Primary Function Inject external knowledge at runtime Shape internal behaviour and reasoning
Best For Dynamic data, proprietary knowledge, documents Format, tone, domain logic, structured outputs
Auditability High (source traceable) Low (weights are opaque)
Failure Mode Wrong or missing retrieval Confidently wrong, consistently
Maintenance Update index/data Retrain model - Expensive
---

2. RAG = Memory, Not Intelligence

RAG is not a model upgrade—it is a memory system at inference time.

Key Insight:
Most RAG failures are not generation failures—they are retrieval failures.

Critical design levers:

A bad retrieval result is often worse than no context—the model will reason confidently from incorrect premises.

---

3. Fine-Tuning = Behaviour Shaping

Fine-tuning does not reliably inject knowledge—it reshapes how the model behaves.

Key Insight:
If the model knows the answer but expresses it incorrectly → this is a behaviour problem.

Critical design levers:

Fine-tuning creates consistency, not awareness. It is a snapshot, not a living system.

---

4. The Practical Delivery Hierarchy

Complexity must be earned. Avoid the “GPU trap”.

  1. Prompt Engineering (Baseline): System prompts + few-shot examples. Fast, cheap, often sufficient.
  2. RAG (Knowledge Layer): Add when the model lacks factual grounding or access to proprietary data.
  3. Fine-Tuning (Behaviour Layer): Use only when format, tone, or reasoning cannot be stabilised via prompting.
  4. RAG + Fine-Tuning: Reserved for mature, high-value systems.
Common Failure:
Teams fine-tune too early, when a well-designed RAG pipeline would have solved the problem faster and cheaper.
---

5. Implementation: Engineering Reality

Advanced Pattern:

---

6. Operational Heuristics

← Home
The Delivery Manager’s Rule:
If the model fails on your data → it needs memory (RAG).
If it fails on format or reasoning → it needs training (fine-tuning).
The Golden Rule:
Context beats intelligence. A model with the right data at the right time will outperform a smarter model with the wrong information.