Data Governance and Integrity: Pharma, Biotech, and the Data Deluge, Part 2

Good rules make good drivers: Why drug development can’t seem to get anywhere with their data

Here’s a thought experiment for you: Imagine what it would be like to drive with no lanes or laws. 

Madness, right?

If you’ve been lucky enough to visit some of South Asia’s magnificent, ancient cities, you might say you know exactly what that’s like. Millions of people eventually get from A to B, but the process is… let’s say, more organic than efficient. One way or another, everyone figures out how to make it work. But it’s far from an optimized process.

Even if you’ve never traveled that far, though, you may still know exactly what it’s like to see thousands upon millions of things moving through a space with no rules, no standards, and only the guidance they create for themselves. And you may be all too familiar with the delays, confusion, and inefficiency that that can cause.

Credit: VN Express International

If you’re like many drug development professionals, you don’t need to hop on a plane to experience that. You can just crack open your SharePoint. 

And with that quick thought experiment, we’re back with the next chapter in our ongoing series on data governance and integrity in the drug development industry. In our kickoff post, we looked at why drug developers are still struggling with the many penalties and handicaps of unstructured data – and why that problem is rapidly intensifying with every fresh exabyte of data they produce. Now, in the second chapter of our story, we’ll look at how our industry can begin to structure its way out of that challenge.

Welcome back! 

Tackling the “sum total of arrangements” challenge

As we saw in the first post in this series, pharma and biotech companies often have a sharp gradient between their scientific rigor and their data management discipline. Many are still struggling with information channels so tangled, inconsistent, and choked with heterogeneous files and formats, a Mumbai khaali peeli would feel right at home in them. 

But as we also saw, and as we’ll dig into here, effective data governance depends on clear consistent rules – just like drivers need lanes, speed limits, signals, signs, and more. For knowledge and information to move efficiently through a business system, that business needs a similarly robust governance structure for its data. 

Which begs the question: what should we look for in that structure? What kind of framework best supports effective governance of drug developers’ unique and uniquely complex data?

In our kickoff post, we looked at why drug developers are still struggling with the many penalties and handicaps of unstructured data – and why that problem is rapidly intensifying with every fresh exabyte of data. Now, in the second chapter of our story, we’ll look at how our industry can begin to structure its way out of that challenge.

It’s a question that our industry has struggled to answer for some time – and sometimes answered in ways that produce as much confusion as clarity. 

For example, one common definition of data governance in GMP environments describes it as “The sum total of arrangements to ensure that data, irrespective of the format in which it is generated, recorded, processed, and retained will ensure an attributable, legible, contemporaneous, original, accurate, complete, consistent, enduring, and available record throughout the data lifecycle.”

The intentions are positive and the goals are mandatory… but at the same time, complex ecosystems ultimately demand much more rigorous, precise, and holistic frameworks for navigating them. Imagine if the DMV presented the rules of the road as “the sum total of driving conventions.” Charge your phone, it’s going to take a while for everyone to figure this out!

No, modern drug developers need data structures – rules for the digital roads their data flow through – that are specific, comprehensive, implementation-ready, and mapped to relevant requirements and concepts. But at the same time, they also need to make sure that their digital processes, systems, and frameworks are… well, actually digital.

“We already have a SharePoint”: Getting to the root of a digital divide

Stop me if you’ve heard this before: “We’re a digital-first organization. We’re all on OneDrive.”

Credit where it’s due, spreadsheets, word documents, PDFs, PPTs, and emails are all a small eon ahead of the paper documents that are still all-too-common in our industry – functionally and ecologically. But “rendered with pixels” is still a far cry from true digital information. 

Why? Because out of our industry’s three essential  information formats – paper documents, electronic documents, and digital data – two share the same base unit. Whether their document lives in manila folders or OneDrive folders, paper- and e___-based systems both have the same inescapable limitation: their default is to index their documents rather than the data within them.

Documents that obscure, restrict, and fragment the data they contain. Documents that put physical and electronic limits on the accessibility and interconnection of all the knowledge they record. 

To truly structure and integrate their data, drug developers need to escape not one but both of those limitations. And that starts by atomizing every one of those documents: breaking them down to all the discreet, individual pieces of information they contain. The actual data.

Spreadsheets, word documents, PDFs, PPTs, and emails are all a small eon ahead of the paper documents that are still all-too-common in our industry – functionally and ecologically. But “rendered with pixels” is still far cry from true digital information.

Data that are relational, connectable, and easily referenceable independent of any document. Data that can be linked, integrated, and, yes, structured in ways that facilitate solutions with exponential impact – from powerful visualizations, to automated processes, and streamlined knowledge exchange. 

Once we’ve done that work, and documents – paper or electronic – have been fully decomposed into structure-ready data, then we can get back to the big question. What kind of structure should drug developers look for? 

Well, that’s a FAIR question indeed. Sorry. Couldn’t resist.

The FAIR data paradigm: A promising framework for drug development data

In my recent collaboration with Paul Denny-Gouldson, Sujeegar Jeevanandam, and John F. Conway, we took a detailed look at why the FAIR data framework is especially attractive and advantageous for drug developers. Simply put, it’s a holistic, comprehensive, and practical paradigm that provides an optimal working schema for CMC data. 

That framework has four simple, universal principles. Data should be: 

With just those four principles – and a system of identifiers, labels, and metadata that facilitate them – it’s easy to see how data can be created, stored, and used in a format that’s perfectly suited to the many powerful applications we all now associate with modern digital architecture and information systems. Adopting this framework can be an equally powerful first step toward unleashing the power of those technologies within the drug development process. 

Is it a replacement for ALCOA+? Absolutely not. In fact, it’s a valuable complement to that established data governance paradigm. While ALCOA precisely defines the target outcome of data integrity efforts – how data should be once data governance processes are applied – FAIR represents functional states and actions that produce those outcomes. Just like targeted health management strategies support target health outcomes, following FAIR principles can help drug developers pave a confident path to the level of data integrity they need to achieve.

Outcome
Functional strategy
Health integrity
Weight loss, reduced hypertension, lower cholesterol, lower HbA1c
Increased exercise, stress reduction, diet adaptation, insulin therapy
Data integrity
Attributable, Legible, Contemporaneous, Original, Accurate, +Complete, +Traceable
Unifying data paradigm (like FAIR), rich (meta)data with industry-relevant taxonomies & ontologies, vertically integrated knowledge base with structured foundation, digital (not electronic) technology that supports automated compliance, organizational commitment to modern data management principles

For more on how the FAIR paradigm can be applied to drug development data, I eagerly invite you to explore my collaborative post with Paul, Sujeegar, and John. Suffice it here to say that these four principles represent an optimal framework for drug development data across a wide range of domains and processes – from process development, to risk analyses, to tech transfer and beyond. They can be a powerful tool for establishing a robust and consistent structure across an organization’s data resources, starting from the most discreet base unit of data. 

That’s fabulous for the data, though. But what about the people who create and manage it?

Building a “data quality pyramid” that supports consistent data principle implementation

As any scientist or engineer can tell you, data doesn’t simply appear – in a FAIR framework or any other. Data is a generated product, the output of countless processes, experiments, assays, analyses and more. 

And for drug developers – for businesses in any industry, really – that means that implementation of any data governance strategy has to start with the people who create and manage what goes into that ecosystem. 

I like to think of this reality as a pyramid. Data integrity, the ultimate target outcome, is supported by an effectively operationalized data governance strategy. But that strategy is itself fully dependent on the mindset, behavior, and motivation of multiple personas that span a spectrum between data producers and data consumers – the “data culture” that produces and utilizes the data itself.

And as my colleague John Maguire discussed at the 2022 Digital CMC Summit, that culture takes time, dedication, and persistence to produce. 

How can one be fostered at a drug development organization? It can be challenging in a business where organizational goals can have 10-year time horizons, and where data production and utilization tasks can often seem remote from the outcomes they support. But there are still many things data leaders and transformation stakeholders can do to encourage data-centric thinking and behaviors by the staff who have their hands on that data today. 

Focus on a few key things: 

  • Policies: Set the rules for generating, logging, sharing, and referencing data. Make them clear and make them sticky.
  • Ownership: Clearly define who’s responsible for critical assets like identifiers, taxonomies, and ontological metadata, and ensure that they know how and where those responsibilities fit into their day-to-day workflows. 
  • Processes: Establish clear steps for implementing and managing data infrastructure tasks, whether that’s recording experiment data in a FAIR format, atomizing document data appropriately, or generating reports in a way that complies with your data governance policies. 
  • Incentives: Ensure that data stakeholders feel like following data best practices will benefit them professionally, in whatever way makes sense for your business. 

 

Each of these priorities can make a major contribution to a burgeoning data culture – by both setting clear expectations and showing the value of meeting them. Start there and focus there, and critical constructs like FAIR will take root in an organization that much more quickly and sustainably.

Coming up next: What structured data can mean for CMC processes and outputs

Strong data frameworks are all well and good, of course – but to borrow a phrase, every forward-thinking drug developer has a plan for their data until they get punched in the face by those exabytes of data. So what should a next-gen, truly data-savvy CMC program look like if it’s built with structured information in mind from the ground up? 

Next up in our data governance series, we’ll get down to some of those brassier tacks with a closer look at what CMC workflows and outputs can look like when they’re built on a foundation of structured data. 

Until then, check out some of our team’s other insights on evolving applications, challenges, and best practices for drug development data: 

GET IN TOUCH

Ready to start structuring your product and process data?

No need to wait for the rest of our series. Reach out to our experts today to learn how we can help.

Victor Goetz, Ph.D

Executive Director, TS/MS New Modalities and Data Strategy, Eli Lilly and Company

Victor Goetz, Ph.D. is the Executive Director of Technical Services New Modalities and Data Strategy at Eli Lilly and Company. He has over 35 years of industry experience in developing and commercializing nine novel medicines to enhance the exchange of knowledge needed to speed the delivery of new medicines to patients. Dr. Goetz holds a BS in chemical engineering from Stanford University and a PhD in chemical and biochemical engineering from the University of Pennsylvania.

Rachelle Howard

Director of Manufacturing Systems Automation and Digital Strategy, Vertex Pharmaceuticals

Rachelle is the Director of Manufacturing Systems Automation and Digital Strategy for Vertex’s Small Molecule Manufacturing Center. She oversees the site Automation Engineering function and has co-led Vertex’s global Digital Manufacturing Transformation program since 2019. She leads several initiatives related to data integrity, data management, and employee education. Rachelle is a graduate of Tufts University and the University of Connecticut where she has degrees in Chemical Engineering and a PhD in Process Control.

Vijay Raju

Vice President, CMC Management, Flagship Pioneering

Vijay currently leads CMC activities to deliver on Pioneering Medicines portfolio. The portfolio is built on Flagship Pioneering’s bio-platforms covering multiple modalities (small molecules, biologics, cell & gene therapies). Vijay was previously in technical leadership roles at Novartis.

Greg Troiano

Head of cGMP Strategic Supply & Operations, mRNA Center of Excellence, Sanofi

Greg serves as Head of cGMP Strategic Supply and Operations at the mRNA Center of Excellence at Sanofi, where he is responsible for all aspects of clinical production and raw material supply chain. He joined Sanofi via acquisition of Translate Bio, where he was Chief Manufacturing Officer and responsible for Technical Operations. Over his 20+ year career in the drug delivery field, Greg had various roles leading the pharmaceutical development of complex formulations, including numerous nano- and microparticle based systems. Greg received his MSE and BS in Biomedical Engineering from The Johns Hopkins University and was elected and inducted into the American Institute for Medical and Biological Engineering (AIMBE) College of Fellows in 2020 for recognition of his accomplishments in drug delivery.

Pat Sacco

Senior Vice President Manufacturing, Quality, and Operations, SalioGen

Pat is a Biotechnology technical operations executive with 30+ years of experience leading and managing technical operations functions at numerous innovative companies in the biotech and life sciences industries. He has a passion for advancing and implementing best practices in pharmaceutical manufacturing.

Diana Bowley

Associate Director, Data & Digital Strategy, AbbVie

Diana is the Associate Director, Data & Digital Strategy in S&T-Biologics Development and Launch leading the organization’s Digital Transformation since October 2021. She joined AbbVie in 2012 in the R&D-Discovery Biologics group focused on antibody and multi-specific protein screening and engineering, leading multiple programs to the cell line development stage. In 2017 she joined Information Research and led a team of IT professionals who supported AbbVie’s Discovery Scientists in Biotherapeutics, Chemistry, Immunology and Neuroscience. She has a PhD in Molecular Biology from The Scripps Research Institute and Bachelor of Science in Chemistry from The University of Northern Iowa.

Robert Dimitri, M.S., M.B.A.

Director Digital Quality Systems, Thermo Fisher Scientific

Robert Dimitri is a Director of Digital Quality Systems in Thermofisher’s Pharma Services Group. Previously he was a Digital Transformation and Innovation Lead in Takeda’s Business Excellence for the Biologics Operating Unit while leading Digital and Data Sciences groups in Manufacturing Sciences at Takeda’s Massachusetts Biologics Site.

Devendra Deshmukh

Global Head, Digital Science Business Operations, Thermo Fisher Scientific

Devendra Deshmukh currently leads Global Business Operations for Digital Science Solutions at Thermo Fisher Scientific. In this role he oversees operations broadly for the business across its product portfolio and leads the global professional services, technical support, and product education teams.

Grant Henderson

Sr. Dir. Manufacturing Science and Technology, VernalBio

Grant Henderson is the Senior Director of Manufacturing Science and Technology at Vernal Biosciences. He has years of expertise in pharmaceutical manufacturing process development/characterization, advanced design of experiments, and principles of operational excellence.

Ryan Nielsen

Life Sciences Global Sales Director, Rockwell Automation

Ryan Nielsen is the Life Sciences Global Sales Director at Rockwell Automation. He has over 17 years of industry experience and a passion for collaboration in solving complex problems and adding value to the life sciences space.

Shameek Ray

Head of Quality Manufacturing Informatics, Zifo

Shameek Ray is the Head of Quality Manufacturing Informatics and Zifo and has extensive experience in implementing laboratory informatics and automation for life sciences, forensics, consumer goods, chemicals, food and beverage, and crop science industries. With his background in services, consulting, and product management, he has helped numerous labs embark on their digital transformation journey.

Max Peterson​

Lab Data Automation Practice Manager, Zifo

Max Petersen is the Lab Data Automation Practice Manager at Zifo responsible for developing strategy for their Lab Data Automation Solution (LDAS) offerings. He has over 20 years of experience in informatics and simulation technologies in life sciences, chemicals, and materials applications.

Michael Stapleton

Board Director, QbDVision

Michael Stapleton is a life sciences leader with success spanning leadership roles in software, consumables, instruments, services, consulting, and pharmaceuticals. He is a constant innovator, optimist, influencer, and digital thought leader identifying the next strategic challenge in life sciences, executing and operationalizing on high impact strategic plans to drive growth.

Matthew Schulze

Head of Digital Pioneering Medicines & Regulatory Systems, Flagship Pioneering

Matt Schulze is a Senior Director in the Flagship Digital, IT, and Informatics team, where he leads and manages the digital evolution for Pioneering Medicines. His role is pivotal in ensuring that digital strategies align with the overall goals and objectives of the Flagship Pioneering initiative.

His robust background in digital life sciences includes expertise in applications, informatics, data management, and IT/OT management. He previously spearheaded Digital Biomanufacturing Applications at Resilience, a CDMO start-up backed by Arch, where he established a team responsible for implementing global manufacturing automation systems, Quality Assurance applications, laboratory systems, and data management applications.

Matt holds a B.S. in Biology and Biotechnology from Worcester Polytechnic Institute and an M.B.A. from the Boston University Questrom School of Business, where he focused on Strategy and Innovation.

Daniel R. Matlis

Founder and President, Axendia

Daniel R. Matlis is the Founder and President of Axendia, an analyst firm providing trusted advice to life science executives on business, technology, and regulatory issues. He has three decades of industry experience spanning all life science and is an active contributor to FDA’s Case for Quality Initiative. Dan is also a member of the FDA’s advisory council on modeling, simulation, and in-silico clinical trials and co-chaired the Product Quality Outcomes Analytics initiative with agency officials.

Kir Henrici

CEO, The Henrici Group

Kir is a life science consultant working domestically and internationally for over 12 years in support of quality and compliance for pharma and biotech. Her deep belief in adopting digital technology and data analytics as the foundation for business excellence and life science innovation has made her a key member of PDA and ISPE – she currently serves on the PDA Regulatory Affairs/Quality Advisory Board

Oliver Hesse

VP & Head of Biotech Data Science & Digitalization, Bayer Pharmaceuticals

Oliver Hesse is the current VP & Head of Biotech Data Science & Digitalization for Bayer, based in Berkeley, California. He has a degree in Biotechnology from TU Berlin and started his career in a Biotech start-up in Germany before joining Bayer in 2008 to work on automation, digitalization, and the application of data science in the biopharmaceutical industry.

John Maguire

Director of Manufacturing Sciences, Sanofi mRNA Center of Excellence

With over 18 years of process engineering experience, John is an expert in the application of process engineering and operational technology in support of the production of life science therapeutics. His work includes plant capability analysis, functional specification development, and the start-up of drug substance manufacturing facilities in Ireland and the United States.

Chris Kopinski

Business Development Executive, Life Sciences and Healthcare at AWS

As a Business Development Executive at Amazon Web Services, Chris leads teams focused on tackling customer problems through digital transformation. This experience includes leading business process intelligence and data science programs within the global technology organizations and improving outcomes through data-driven development practices.

Tim Adkins

Digital Life Science Operations, ZÆTHER

Tim Adkins is a Director of Digital Life Sciences Operations at ZÆTHER, serving the life science industry by assisting companies reach their desired business outcomes through digital IT/OT solutions. He has 30 years of industry experience as an IT/OT leader in global operational improvements and support, manufacturing system design, and implementation programs.

Blake Hotz

Manufacturing Sciences Data Manager, Sanofi

At Sanofi’s mRNA Center of Excellence, Blake Hotz focuses on developing data ingestion and cleaning workflows using digital tools. He has over 5 years of experience in biotech and holds degrees in Chemical Engineering (B.S.) and Biomedical Engineering (M.S.) from Tufts University.

Anthony DeBiase

Offering Manager, Rockwell Automation

Anthony has over 14 years of experience in the life science industry focusing on process development, operational technology (OT) implementation, technology transfer, CMC and cGMP manufacturing in biologics, cell therapies, and regenerative medicine.

Andy Zheng

Data Solution Architect, ZÆTHER

Andy Zheng is a Data Solution Architect at ZÆTHER who strives to grow and develop cutting-edge solutions in industrial automation and life science. His years of experience within the software automation field focused on bringing innovative solutions to customers which improve process efficiency.

Sue Plant

Phorum Director, Regulatory CMC, Biophorum

Sue Plant is the Phorum Director of Regulatory CMC at BioPhorum, a leading network of biopharmaceutical organizations that aims to connect, collaborate, and accelerate innovation. With over 20 years of experience in life sciences, regulatory, and technology, she focuses on improving access to medicines through innovation in the regulatory ecosystem.

Yash Sabharwal​

President & CEO, QbDVision

Yash Sabharwal is an accomplished inventor, entrepreneur, and executive specializing in the funding and growth of early-stage technology companies focused on life science applications. He has started 3 companies and successfully exited his last two, bringing a wealth of strategic and tactical experience to the team.

Joschka Buyel

Senior MSAT Scientist at Viralgen, Process and Knowledge Management Scientist at Bayer AG

Joschka is responsible for the rollout and integration of QbDVision at Bayer Pharmaceuticals. He previously worked on various late-stage projects as a Quality-by-Design Expert for Product & Process Characterization, Process Validation, and Transfers. Joschka has a Ph.D. in Drug Sciences from Bonn University and a M.S. and B.S. in Molecular and Applied Biotechnology from the RWTH University.

Luke Guerrero

COO, QbDVision

A veteran technologist and company leader with a global CV, Luke currently oversees the core business operations across QbDVision and its teams. Before joining QbDVision, he developed, grew, and led key practices for international agency Brand Networks, and spent six years deploying technology and business strategies for PricewaterhouseCoopers’ CIO Advisory consulting unit.

Gloria Gadea Lopez

Head of Global Consultancy, Business Platforms | Ph.D., Biosystems Engineering

Gloria Gadea-Lopez is the Head of Global Consultancy at Business Platforms. Using her prior extensive experience in the biopharmaceutical industry, she supports companies in developing strategies and delivering digital systems for successful operations. She holds degrees in Chemical Engineering, Food Science (M.S.), and Biosystems Engineering (Ph.D.)

Speaker Name

Speaker’s Pretty Long Title, Specialty, and Business

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Aliquam dignissim velit et est ultricies, a commodo mauris interdum. Etiam sed ante mi. Aliquam vestibulum faucibus nisi vel lacinia. Nam suscipit felis sed erat varius mollis. Mauris diam diam, viverra nec dolor et, sodales volutpat nulla. Nam in euismod orci.