Blog

What the CBSE On-Screen Marking Failure Tells Us About Digital Exam Evaluation

Rishabh Galkar

June 1, 2026

Something happened in May 2026 that every exam administrator in India should study carefully.

The Central Board of Secondary Education rolled out On-Screen Marking for Class 12 board examinations at national scale — nearly one crore answer books, 40 crore scanned pages. The execution collapsed in a way that affected over 17 lakh student households and pushed the national pass percentage to its lowest point in seven years.

This is not about blame. It is about understanding what went wrong — and what it tells us about how digital evaluation systems should be deployed.

Where It Went Wrong

No pilot before full-scale deployment

The most consequential decision was going from zero to national scale in one step. No phased rollout. No small-scale pilot to surface edge cases. Systems that behave perfectly in controlled conditions routinely surface unexpected failure modes at scale. When the stakes are Class 12 results and there is no fallback, every failure mode becomes a crisis.

Scanning quality was not validated

Students who accessed their scanned copies found pages blurred beyond legibility. In some cases, answer sheets were tagged to the wrong roll numbers — a different student's handwriting appearing under someone else's name. Some of these were evaluated and marked before the errors were caught. A basic scan validation step before evaluation begins would have caught this.

The safety net was removed too early

Post-result verification was abolished on the grounds that digital auto-tabulation eliminates totalling errors. True — but it does not account for upstream errors: scanning failures, mismatched files, marks assigned to illegible content. Removing the safety net before the system was proven left no room for correction when failures arrived.

Warning signs were ignored

Weeks before actual evaluation, teachers reported portal failures, slow performance, and data errors during mock sessions. These were early signals of exactly what later caused the rollout to unravel. Pre-deployment warnings are the cheapest version of a failure — they should be taken seriously, not managed as a perception problem.

What Responsible Deployment Looks Like

Pilot before scaling. Run one subject, one exam batch. Compare against manual evaluation. Fix the gaps. Then expand.

Validate inputs before processing. Confirm every scan is legible and correctly tagged before it enters the evaluation queue.

Keep humans in the loop. Automation should reduce the burden on reviewers — not eliminate them. A system that flags uncertain cases for human review is fundamentally more resilient than one that processes everything automatically.

Do not remove safety nets before they are earned. Maintain verification mechanisms until real deployment data — not controlled demos — justifies removing them.

The Broader Lesson

The CBSE OSM rollout did not fail because On-Screen Marking is a bad idea. It failed because the system was deployed in a way that gave it no chance to surface and fix its own problems before they affected real students.

An AI-assisted evaluation system that flags low-confidence answers for human review creates a built-in safety mechanism — the system surfaces its own uncertainty, routes those cases to human judgment, and produces a full audit trail for every answer. That is not just a better evaluation system. It is a defensible one.

Where to Start

One subject. One real exam batch. Full evaluation from scan to result — with quality checks, confidence scoring, and a comparison against manual evaluation. That is what a pilot tells you that a demo never can.

ParikshAI is an AI-assisted exam evaluation platform built for Indian universities. Our hybrid model — AI grading with human review for low-confidence answers — is designed for institutions that want faster results without compromising on accuracy or oversight.

Request your free pilot at pariksha.tech