By Emilio Rios in Reproducibility — 31 Jan 2026

SciOps: the missing operational layer for modern research teams

I came into research from industry, after years in software factories, telecoms and a global consultancy firm where DevOps/DevSecOps were daily tools. I lived the "works on my machine" failures and the whole-weekend deployments they triggered. Operational thinking mattered there because it made delivery reliable, handoffs safe, and iteration fast. The same patterns show up in modern, data-intensive research teams.

Definition: SciOps

SciOps is the operational layer for reliable, scalable, data-intensive research. It is the set of practices that make scientific work runnable, reviewable, and transferable by a team.

Scope (not tool-specific):

workflow design and orchestration
automation and environments
quality gates and validation
deployment of results and artifacts
collaboration norms and ownership

Recurring pain patterns

These are common failure modes, not moral failings:

fragile, person-dependent pipelines that only run on one laptop
non-repeatable results because parameters, versions, or inputs are missing
slow iteration because every change triggers manual rework
poor handoffs between wet lab and computational teams
hidden decisions that can't be traced back to a run

A DevOps analogy, translated carefully

In software, development and operations split because "writing code" and "running code reliably" are different jobs. Continuous delivery captured the idea that code only has value once it is deployed and used.

For research, the translation is straightforward: results are only (or greatly more) valuable when they are rerunnable, reviewable, and transferable to another person or environment. SciOps is the operational layer that makes that possible. The goal is not to turn labs into software companies; it is to reduce operational risk while making iteration faster and handoffs safer.

Origin story: where the word came from

I knew DevOps/ChatOps/DevSecOps from industry long before I used the term SciOps. Later I came across a preprint that uses "SciOps"; it resonated and gave me a useful vocabulary for an operational layer I was already trying to describe. I do not know who coined the term, and I'm not implying endorsement or ownership. (Reference: https://arxiv.org/abs/2401.00077)

A lightweight maturity framing

This is progress-focused, not a grading system:

Level 1 - Ad hoc runs: individual scripts and manual steps; reruns are possible but fragile.
Level 2 - Shared workflow: basic documentation and shared scripts; some repeatability across people.
Level 3 - Operational baseline: versioned code, pinned environments, smoke tests, and run manifests.
Level 4 - Team-ready delivery: releases, QC gates, and clear ownership support stable handoffs.
Level 5 - Continuous improvement: regular reviews, small releases, and learning from failures.

Minimum viable SciOps operational layer

Each item includes a concrete artifact and a Definition of Done (DoD).

Versioned code + environments
Artifact: Git repo with tagged releases plus environment.yml, requirements.lock, or a container image.
DoD: a fresh clone can run the smoke test using pinned versions without manual steps.
Testable pipelines (smoke tests)
Artifact: tiny test dataset + a scripted smoke test command.
DoD: smoke test completes in <10 minutes and validates expected outputs.
Provenance / run manifest
Artifact: machine-readable run manifest (JSON/YAML) saved with outputs.
DoD: manifest captures input IDs, parameters, tool versions, code revision, and timestamps for every run.
Decision log
Artifact: decisions.md or ADRs in the repo.
DoD: any change to thresholds, cohorts, or definitions has a linked entry.
Ownership / DRI
Artifact: OWNERS.md or a README section naming a DRI and backup.
DoD: OWNERS.md lists a DRI and backup for each pipeline with contact handles.
Release tags + published artifacts
Artifact: Git tags + release notes; built artifacts (containers, bundles, or reports) stored in a registry.
DoD: run manifests include a release tag; artifacts are retrievable by tag.
Basic QC gates
Artifact: automated QC checks with thresholds and a pass/fail report.
DoD: runs fail or flag when QC is out-of-bounds; QC report is stored with outputs.

What each level buys you

Use this as motivation, not as a scorecard:

Level 1 - Ad hoc runs: you can rerun your own work without guesswork and explain what changed.
Level 2 - Shared workflow: a teammate can run it with minimal back-and-forth; onboarding gets lighter.
Level 3 - Operational baseline: runs are reviewable and comparable; failures are diagnosable.
Level 4 - Team-ready delivery: handoffs stabilize; releases and QC make results defensible.
Level 5 - Continuous improvement: feedback loops reduce repeated mistakes and make progress steadier.

Tooling and Level 0: team culture

SciOps, like DevOps, benefits from using several tools for each task: version control systems, change management systems, project management tools, and even team gadgets (bots, dashboards, and shared status views). The tooling matters, but even before Level 1 the hardest part is what I would call Level 0: team culture.

Without shared vision, common responsibility, continuous improvement, proactive communication and feedback, and effective leadership, teams struggle to progress through the levels.

I will talk about culture and tools in the next post. For now, if you're interested but not completely acquainted with the terms used here, the references below are a good starting point.

References and primers

The Turing Way handbook: https://book.the-turing-way.org/
Good enough practices in scientific computing (PLOS Comp Bio): https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005510
Best practices for scientific computing (PLOS Biol): https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1001745
Ten simple rules for reproducible computational research (PLOS Comp Bio): https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1003285
FAIR principles overview: https://www.go-fair.org/fair-principles/
Software Carpentry - Version Control with Git: https://swcarpentry.github.io/git-novice/
Nextflow documentation: https://www.nextflow.io/docs/latest/
Snakemake overview: https://snakemake.github.io/

Call to action

If you adopt only three things this week, start here: pin your environment, add a <10-minute smoke test with a tiny dataset, and write a machine-readable run manifest for every run. Then assign a DRI for the pipeline and cut a first tagged release once it runs end-to-end.