Data Stewards vs. Data Scientists: Why Biotech Needs Both (But Hires Only One)

November 03, 2025

You need both data stewards and data scientists.

You canโ€™t hire your way out of a data stewardship problem. ๐Ÿ—๏ธ

I keep seeing the same pattern at small biotech companies:

โ†’ Team doubles after funding

โ†’ Data requests pile up

โ†’ Leadership posts a Data Scientist role

โ†’ New hire gets buried in โ€œquick reportsโ€

โ†’ Six months later, same bottleneck exists

Hereโ€™s whatโ€™s actually happening: Theyโ€™re hiring a Ferrari to fix their roads. ๐ŸŽ๏ธ

๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐˜๐—ถ๐˜€๐˜๐˜€ ๐—ด๐—ฒ๐—ป๐—ฒ๐—ฟ๐—ฎ๐˜๐—ฒ ๐—ป๐—ฒ๐˜„ ๐—ถ๐—ป๐˜€๐—ถ๐—ด๐—ต๐˜๐˜€

โ†’ Build ML models

โ†’ Design novel analyses

โ†’ Answer questions nobodyโ€™s asked yet

โ†’ Create competitive advantage through discovery

๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐˜๐—ฒ๐˜„๐—ฎ๐—ฟ๐—ฑ๐˜€ ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ๐˜€ ๐—ณ๐—ผ๐—ฟ ๐—ถ๐—ป๐˜€๐—ถ๐—ด๐—ต๐˜๐˜€

โ†’ Create self-service tools

โ†’ Build data pipelines

โ†’ Answer the same questions 100 times (so nobody has to again)

โ†’ Create competitive advantage through efficiency

Both roles are critical. But timing matters.

When scientists spend 8+ hours weekly on routine reporting, thatโ€™s not a data science problem. Thatโ€™s a data stewardship problem.

A real example: One client was hiring their third data scientist while their team waited days for basic QC reports. We built a self-service dashboard instead. Reporting time dropped from 8 hours to 45 minutes. The data scientists? They finally got to do actual data science.

๐—ง๐—ต๐—ฒ ๐˜๐—ฒ๐˜€๐˜ ๐—ณ๐—ผ๐—ฟ ๐˜„๐—ต๐—ฎ๐˜ ๐˜†๐—ผ๐˜‚ ๐—ป๐—ฒ๐—ฒ๐—ฑ:

If someone asks โ€œCan you pull this report?โ€ for the tenth time this month โ†’ You need a steward

If someone asks โ€œCan you discover something we donโ€™t know?โ€ โ†’ You need a scientist

Most early-stage biotechs need stewardship infrastructure before advanced analytics. Build the roads before you buy the Ferrari.

The good news? Data stewardship directly enables better data science. Once scientists can self-serve routine analyses, your data science team can focus on the questions that actually require their expertise.

Whatโ€™s been your experience? Have you seen teams struggle with this distinction?