Read Me First: TAR Primer

Read Me First: TAR Primer

A high-level primer for understanding what Technology-Assisted Review is, how the workflow moves from goals to validation, and how prediction scores and metrics should be interpreted before relying on TAR in a real matter.

What TAR is, in plain English

What it is

  • Technology-Assisted Review is a defensible review workflow that uses software predictions trained by human coding decisions.
  • The goal is not to replace legal judgment; the goal is to prioritize, classify, validate, and document review decisions at scale.
  • A TAR result is only as useful as the protocol, training signal, quality control, validation sample, and stopping rationale behind it.

Topics covered

  • Computer-assisted review, continuous active learning, prediction scores, relevance calls, validation, proportionality, defensibility

Start with the process, not the product. TAR works when the review team can explain the goal, the coding rules, what the model learned, how the outputs were tested, and why the final stopping point was reasonable for the matter.

TAR Workflow at a Glance

  1. 01Set the goal

    Decide whether TAR will prioritize likely relevant material, reduce review volume, support quality control, or help reach a defensible stopping point.

  2. 02Set the protocol

    Write coding rules for relevance, issues, privilege, families, confidentiality, and reviewer escalation before training signals begin to shape the model.

  3. 03Train with human judgment

    Use expert review, reviewer coding, seed examples, or continuous active learning so the system has examples of what matters.

  4. 04Predict and prioritize

    Let the system score or rank documents so the team can review the most likely useful material earlier and monitor what the model is uncertain about.

  5. 05Review and quality check

    Review high-value queues, resolve conflicts, sample uncertain or low-score areas, and track coding overturns that may signal unclear guidance.

  6. 06Validate the result

    Use sampling, recall, precision, elusion, confidence intervals, and documented QC to test whether the workflow is meeting its goal.

  7. 07Document and move forward

    Record the rationale, metrics, assumptions, limits, and next review phase, such as privilege review, redaction, production, or additional training.

How Prediction Score Bands Guide Review

Prediction scoreWhat it usually meansUseful review action
80-100Prediction score range
The model sees strong similarity to documents reviewers have coded as relevant or important.Review early, check for hot documents, privilege, key issues, and consistent coding.
40-79Prediction score range
The model is less certain or the record has mixed signals.Use targeted review, conflict checks, and additional training to clarify whether the middle is noise or a real issue area.
0-39Prediction score range
The model predicts lower likelihood of relevance based on current training.Do not ignore automatically; validate with sampling and elusion testing before deciding this material can be left behind.

Metrics That Make TAR Defensible

Recallrelevant found / total relevant

Tests whether the workflow is capturing enough of the relevant population.

Precisiontrue relevant reviewed / predicted relevant

Shows how much review effort is being spent on material that is actually useful.

Elusionrelevant sampled from left-behind set / sampled left-behind set

Estimates what relevant material may remain after a proposed stopping point.

Richnessrelevant documents / total population

Sets expectations for how hard it will be to find relevant material and how large samples may need to be.

F1 Score2 * precision * recall / (precision + recall)

Balances precision and recall when comparing model behavior, but should not replace legal judgment.

Confidence Intervalestimate +/- margin of error

Shows uncertainty around sampling-based measurements so the team does not overstate precision.

MetricFormulaPurpose
Recallrelevant found / total relevantTests whether the workflow is capturing enough of the relevant population.
Precisiontrue relevant reviewed / predicted relevantShows how much review effort is being spent on material that is actually useful.
Elusionrelevant sampled from left-behind set / sampled left-behind setEstimates what relevant material may remain after a proposed stopping point.
Richnessrelevant documents / total populationSets expectations for how hard it will be to find relevant material and how large samples may need to be.
F1 Score2 * precision * recall / (precision + recall)Balances precision and recall when comparing model behavior, but should not replace legal judgment.
Confidence Intervalestimate +/- margin of errorShows uncertainty around sampling-based measurements so the team does not overstate precision.

Thresholds

RecallMin 70 · Target 80 · Strong 90
PrecisionMin 40 · Target 60 · Strong 80
Confidence LevelMin 90 · Target 95 · Strong 99
MetricMinimumTargetExcellent
Recall708090
Precision406080
Confidence Level909599

What to Document Before You Stop

QuestionWhy it matters
What population did TAR cover?Scope, exclusions, deduplication, family handling, and date/custodian choices affect what the model could find.
Who coded training and QC documents?Reviewer skill and consistency shape the signal the model learns from.
What score cutoff or stopping rule was used?A cutoff is a decision point, not proof by itself; it needs validation and proportionality support.
What did the validation sample show?Elusion, recall, confidence, and error review are the evidence that the workflow was reasonable.
What risks still need human review?Privilege, confidentiality, issue nuance, redactions, and court/client obligations still need lawyer oversight.

Where to go next

What it is

  • Use the next section, Start Here: Additional Overview Resources, for the primary EDRM, FJC, and Grossman/Cormack source materials behind this primer.
  • Use Validation, Metrics & Sampling when you need the math behind recall, precision, elusion, and confidence intervals.
  • Use Practical Checklists & Protocol Prompts when you are preparing for a Rule 26(f) conference, vendor planning meeting, or internal TAR kickoff.

Topics covered

  • Overview resources, validation, sampling, protocol planning, AI governance, case law, vendor documentation

The rest of this library is the primer's backup material: primary sources first, then legal authority, validation concepts, protocol prompts, and specific tool documentation.

Start Here: Additional Overview Resources