← Back to Pearl
// METHODOLOGY

How we measure the 67% claim.

Pearl claims its AI marking platform, SAM, cuts assessor marking time by 67% on Ofqual-regulated qualifications. This page explains how that number is measured, what counts as marking time, the sample it is drawn from, and the caveats. Written for FE procurement teams who want to verify the maths before signing.

The headline

67%
Mean time saved
8,412
Marked submissions
47
Assessors
4
FE providers

Across AY 2024-25, SAM-assisted marking reduced assessor active marking time by a mean of 67% per submission, with a median of 64% and an interquartile range of 58% to 73%. The sample covered Ofqual-regulated qualifications at Levels 2, 3 and 4, including BTECs, NVQs, Functional Skills and Access to HE units.

What we mean by marking time

Marking time is the active time an assessor spends:

Marking time excludes:

How the baseline is captured

Before SAM is enabled on a qualification, the assessor marks a cohort of submissions in the standard Pearl interface. Active time is measured from submission-open to grade-confirmed, with idle-tab detection pausing the timer after 90 seconds of inactivity. Assessor self-report time logs are reconciled against platform timestamps. The baseline is the mean active marking time per submission across that cohort, computed per assessor per qualification.

How the SAM-assisted time is captured

Once SAM is enabled on the qualification, every submission is processed by SAM on upload. The assessor sees a proposed grade, criterion-level rationale, and suggested feedback. The assessor reviews, edits where needed, and confirms the final mark. Active time is measured on the same submission-open to grade-confirmed basis, with the same idle-tab detection. The SAM-assisted time is the mean active time per submission across the same qualification cohort.

Time saved per submission is computed as:

Time saved % = (Baseline time − SAM-assisted time) / Baseline time × 100

Per-assessor figures are then aggregated to the qualification level, the provider level, and finally the cross-provider mean.

The sample

Caveats and what we do not claim

Three caveats matter when you read the 67% figure:

We do not claim:

How to verify with your own data

Procurement teams can replicate the measurement in a paid pilot:

Pilot pricing and scope are agreed in advance. The pilot is paid because the rubric configuration and assessor training cost real time at our end.

FAQ

Who validated the methodology?

The measurement methodology was reviewed internally by Pearl's product and assessment leads, and externally by an IQA practitioner at one of the four sample providers. It has not been peer-reviewed in an academic journal. We are open to a third-party audit and will publish any subsequent revisions on this page.

Where can I see the raw data?

Anonymised per-assessor, per-qualification time logs are available under NDA for procurement teams in active evaluation. Email sales@epearl.co.uk to request access.

Does the 67% include marking quality?

No. This page only measures time. Mark quality is measured separately, against IQA agreement rates, inter-assessor reliability and learner outcome data. We will publish a separate methodology page on mark quality once the AY 2025-26 dataset is complete.

Run the 67% test on your own provision.

Book a 20-minute call. We will scope a paid pilot against one of your qualifications and agree the measurement protocol up-front.

Book a 20-min demo →
Last updated 26 May 2026. Methodology owner, Pearl product team. Send corrections to sales@epearl.co.uk.