When the run goes sideways — the hidden cracks in everyday workflows
I was pouring a cuppa in the lab kitchen when the scheduler lit up: a 10x Visium slide run at 09:00, Trinity College lab, March 2022 — and we lost nearly 12% of reads to barcode mismatch. That small scene frames a larger trouble with a transcriptomics dataset: scenario (routine Visium run) + data (12% drop in usable reads) + question (how many more runs are quietly failing in the background?).

I have over 15 years running core facilities and I’ll be blunt — the spatial omics resource center is judged by its quiet failures as much as its successes. I see the same traditional flaws: brittle metadata practices, ad-hoc library preparation notes scribbled in lab books, and mismatched barcoding schemas that only reveal themselves after expensive sequencing. Spatial transcriptomics, UMI collapse and barcode integrity are not abstract terms here; they are the daily battles. We once reprocessed a run and found that a single misapplied index caused a 0.8× coverage drop across key loci — grand, but costly. (It stings.) These are not flashy problems but they erode trust — and trust runs your publications, grants, and collaborations. — moving on to how we fix it.
What’s the unseen snag?
Fixing the foundation — pragmatic steps and a forward view
We changed tack after that March morning. I began treating each transcriptomics dataset like a legal document: provenance, exact protocol versions, scanner serials, and library preparation lot numbers logged in a searchable registry. Direct action matters: enforce a standard barcode dictionary, automate basic QC to flag UMI saturation, and require paired metadata with every sample. The difference was measurable — a subsequent batch in June 2022 showed a 9% improvement in usable reads and a quarter fewer re-runs. I learned to prefer systems that give clear audit trails rather than opaque dashboards. Short sentence. Clear wins.

Looking ahead, I urge labs to compare platforms not on features alone but on reproducibility metrics. You want traceability, resilient metadata schemas, and predictable library prep outcomes. Integrate slide scanner logs with the LIMS; don’t let data live in isolated spreadsheets. We found that simple automation of barcode checks saved a week of troubleshooting across three projects. Quick aside — there are humane gains here too: fewer late-night calls, fewer ruined samples. What’s next? Build the chain from sample to report. — And keep the human in the loop.
What’s Next
Evaluation metrics and closing counsel
I’ll close with three practical evaluation metrics I use when choosing infrastructure or partners for spatial omics work: 1) Proven reproducibility — measured as percent variance across replicate slides and documented across at least two independent runs; 2) Metadata completeness — proportion of samples with full protocol, instrument, and reagent traceability (aim for >95%); 3) Recovery efficiency — usable read fraction after QC (benchmarks vary, but track changes run-to-run). I insist on these because they turn vague promises into numbers you can weigh. We once dropped a supplier because their recovery efficiency slipped by 6% over four months — decision made, time saved, relationships preserved. Wait — one more thing: check how they handle interrupted runs (real life!), because that’s the moment they show you what’s under the hood.
Evaluate, insist on metrics, and then keep watching the dashboard. I speak from experience — I’ve shepherded dozens of datasets from messy starts to publishable cohorts, and the details matter: exact reagent lot, scanner firmware, timestamped metadata. If you want a partner who treats those details as sacred, consider the people behind stomics.