What is a graph neural network (GNN) and how is it used in drug combination prediction?

A graph neural network (GNN) is a type of deep learning model that represents data as interconnected nodes and edges — ideal for modeling biological systems like protein-protein interaction networks, molecular structures, and drug-target relationships. In drug combination prediction, GNNs map how two drugs interact within a disease-specific biological network, predicting whether the interaction will be synergistic, additive, or antagonistic. Models like SynerGNet, DeepDDS, and HANSynergy are validated examples in peer-reviewed literature.

How does a digital twin help in drug combination testing?

A pharmaceutical digital twin is a computational model of a patient's biology — built from genomic data, proteomic profiles, disease state markers, and clinical history — that simulates how that specific individual will respond to a drug combination before any drug is administered. This allows researchers to test combinations on the virtual patient first, identify likely responders, and design smarter clinical trials with enriched patient populations.

What is the biggest challenge regulators face with AI-discovered drug combinations?

The central regulatory challenge is explainability. The FDA and EMA require sponsors to demonstrate not just that a combination works, but why — which biological mechanisms drive the synergy and how the AI system arrived at its prediction. Black-box models that cannot articulate their reasoning are difficult to support in an IND or NDA submission. This is driving adoption of Explainable AI (XAI) frameworks and the FDA's growing guidance on AI/ML in drug development.

AI Drug Combination Discovery: 2026 Pharma Leader’s Guide

Q: What is AI-powered drug combination discovery?

AI-powered drug combination discovery uses machine learning models — including graph neural networks, deep learning, and multimodal AI — to predict which pairs or groups of drugs will produce synergistic therapeutic effects. Instead of manually screening millions of possible combinations in a lab, AI narrows the search computationally, identifying the most promising candidates for experimental validation.

Q: Are AI-discovered drug combinations in clinical trials?

Yes. 2025 saw the highest single-year jump in IND (Investigational New Drug) filings for AI-originated molecules in pharma history, led by companies including Insilico Medicine, Recursion Pharmaceuticals, BenevolentAI, and Generate Biomedicines. AI-predicted combinations are moving through Phase I and II oncology trials, and Insilico's ISM001-055 — an AI-designed drug for idiopathic pulmonary fibrosis — completed Phase II in 2024.

Q: What is a quantum-classical hybrid approach in drug discovery?

A quantum-classical hybrid approach combines quantum computing algorithms with classical deep learning. Quantum circuits handle computationally expensive tasks like simulating electron-level molecular interactions, while classical AI processes large biological datasets. Insilico Medicine's 2025 KRAS pipeline used quantum circuit Born machines (QCBMs) with deep learning to screen 100 million molecules, identifying candidates with a 21.5% improvement in filtering non-viable compounds compared to classical AI alone.

AI-Powered Drug Combination Discovery: Use Cases, Breakthroughs, and What Pharma Leaders Must Know in 2026

A Senior Medical Science Liaison's Evidence-Based Guide to How AI Is Reshaping Combination Therapy R&D

Updated March 2026 | 19-minute read | Sourced from Nature, PMC/NIH, WEF, and peer-reviewed clinical data

📋 What's in This Article

Why Drug Combination Discovery Needed a Different Kind of Help
Where Traditional Methods Hit a Wall — and Why
Head-to-Head: Traditional HTS vs. AI-Driven Discovery in 2026
Graph Neural Networks: The Architecture Built for Biology
Multimodal AI: When Genomics, Proteomics, and Clinical Data Merge
Quantum-Classical Hybrids: Tackling the Hardest Molecular Problems
Digital Twins: Testing on the Patient Before Testing on the Patient
Real-World Case Studies: Where These Tools Are Already Working
Drug Repurposing at AI Speed: The Underappreciated Shortcut
The Regulatory Tightrope: Explainability, Validation, and the FDA
What Pharma Leaders Should Be Doing Differently Right Now
The 2027–2030 Horizon: Where This Goes Next
Frequently Asked Questions

Why Drug Combination Discovery Needed a Different Kind of Help

Let me start with a number that should make any pharma R&D leader uncomfortable: if you have just 100 approved drugs and you want to systematically test every possible two-drug combination, that's roughly 4,950 pairs. Add a third drug into the mix, and you're looking at over 161,000 possible triplets. Factoring in dosing ratios, sequencing, and cell line specificity — you're working in the millions. No laboratory on earth has the throughput to test all of them, not at any meaningful depth.

And yet, combination therapy is often the only thing that works. HIV is controlled by cocktails, not monotherapy. Most solid tumors require multi-agent regimens. Drug-resistant tuberculosis demands combinations precisely because single drugs cannot overcome entrenched resistance mechanisms. Autoimmune conditions, neurological disorders, and now even obesity are increasingly managed with layered approaches that target multiple pathways simultaneously. The science has known this for decades. The bottleneck has always been finding the right combinations — and finding them at a pace and cost that makes clinical translation viable.

That bottleneck is cracking. Not through a single technology, but through a convergence of approaches — graph neural networks that model biological interaction networks, multimodal AI that synthesizes genomic and proteomic signals simultaneously, quantum-classical hybrid pipelines that solve problems classical computers genuinely cannot, and patient digital twins that allow researchers to simulate drug responses before a clinical trial begins. By 2026, AI drug combination discovery has moved well past proof of concept. It is reshaping R&D investment decisions at major pharma companies, driving a record number of IND filings for AI-originated molecules, and generating peer-reviewed results that are hard to dismiss.

This article breaks down exactly what's happening, how each technology works in practical terms, where the evidence actually is, and what drug development leaders and pharma scientists should be prioritizing right now.

📊 The 2026 Landscape at a Glance

Developing a new drug still takes 10+ years and costs over $2.5 billion, with a 90% failure rate in clinical phases (World Economic Forum, 2025)
2025 saw the highest single-year jump in IND filings for AI-originated molecules in pharmaceutical history (Drug Discovery Online)
Over 6,000 companies and research groups globally are engaged in drug discovery, increasingly using AI platforms (WEF, citing Citeline 2025)
An NCATS/MIT/UNC collaboration used AI to evaluate 1.6 million possible combinations of 32 anticancer drugs — verifying 307 synergistic combinations with an 83% predictive accuracy (PMC/NIH)
Insilico Medicine's quantum-enhanced pipeline screened 100 million molecules and synthesized 15 promising KRAS-targeting candidates in 2025 (Model Medicines)
Novartis computationally designed 15 million potential compounds using generative AI and narrowed to approximately 60 for lab synthesis — a fraction of conventional screening volume (WEF Annual Meeting 2026)

Where Traditional Methods Hit a Wall — and Why

High-throughput screening (HTS) transformed drug discovery when it emerged in the 1990s. Robotic platforms could test thousands of compounds against a biological target in days. For its time, it was remarkable. But when the problem shifts from single compounds to combinations — and from single targets to multi-pathway diseases — HTS runs into fundamental limits that better robots cannot solve.

The core issue is combinatorial explosion. The number of possible drug pairs, triplets, and dosing combinations grows so rapidly with the size of the compound library that exhaustive experimental screening becomes mathematically impossible at any realistic scale. A lab running 10,000 compound combinations per day would take centuries to cover the full search space of drugs currently on the market.

Beyond scale, there's a conceptual problem: traditional methods are largely target-centric. The classical drug discovery paradigm says: find a protein that's involved in the disease, design a molecule that binds to it, and hope that binding produces the desired biological effect. This logic was never designed for diseases that are fundamentally network-level problems — cancers driven by multiple compensatory pathway activations, antibiotic-resistant infections that evolve resistance faster than single agents can suppress them, or neurodegeneration driven by complex interactions between misfolded proteins, neuroinflammation, and metabolic dysfunction. These aren't single-target problems. They require multi-target solutions. And finding those solutions requires a computational approach that understands biological networks, not just individual proteins.

Head-to-Head: Traditional HTS vs. AI-Driven Discovery in 2026

Before diving into the specific technologies, here's a direct comparison of the two paradigms across dimensions that matter to pharma R&D leaders:

Dimension	Traditional High-Throughput Screening (HTS)	2026 AI-Driven Combination Discovery
Search Space Coverage	Thousands to low millions of combinations (robotics-limited)	Hundreds of millions computationally; 90% experimental reduction before wet lab
Primary Data Input	Phenotypic or single-target binding assays	Multi-omics (genomic, proteomic, transcriptomic), clinical trial data, molecular structure, pathway networks
Biological Reasoning	Single-target focus; network effects largely invisible	Graph neural networks map full protein–protein interaction networks and multi-pathway dynamics
Synergy vs. Antagonism Detection	Determined experimentally after screening; slow and costly	Predicted computationally before any experiment; models like SynerGNet, DeepDDS achieve 80–85%+ accuracy
Patient Specificity	Population-level, rarely stratified by molecular subtype	Digital twins simulate responses in silico using individual patient genomic/proteomic profiles
Drug Repurposing Signal	Opportunistic and largely accidental	Systematic — AI mines approved drug databases for new combination signals across disease areas
Timeline to Lead Candidate	3–5 years (target identification through lead optimization)	12–18 months demonstrated (Insilico Medicine IPF program, 2021–2022)
Cost to Lead Candidate	$300M–$600M (including attrition)	Substantial reduction — exact figures vary; Insilico estimated <$2.6M for early discovery phase
Molecular Interaction Modeling	Newtonian force fields; limited quantum accuracy	Quantum-classical hybrids calculate electron-level interactions with greater precision (Insilico KRAS pipeline, 2025)
Explainability to Regulators	High — direct experimental evidence at each step	Variable — black-box models create FDA/EMA challenges; XAI frameworks (SHAP, saliency maps) increasingly required
Resistance Prediction	Reactive — discovered after clinical failure	Proactive — Drug Resistance Signature (DRS) features in AI models predict resistance-prone combinations before trial
Leading Platforms (2026)	Beckman Coulter, PerkinElmer HTS robotic systems	Insilico Chemistry42, Recursion OS, BenevolentAI platform, Novartis Data42, Isomorphic Labs, SandboxAQ+UCSF

Graph Neural Networks: The Architecture Built for Biology

Of all the AI techniques being applied to drug combination discovery, graph neural networks (GNNs) are the one that deserves the most attention from a mechanistic standpoint — because they solve a problem that traditional machine learning architectures fundamentally cannot.

Most early ML models in drug discovery took drugs as input vectors — lists of numerical features describing molecular properties, fingerprints, or target binding affinities. The problem is that biology isn't a list. Biology is a network. Proteins interact with other proteins. Genes regulate other genes. Drugs modulate pathways that affect dozens of downstream targets, many of which are relevant to a different disease entirely. The combinatorial synergy between two drugs isn't determined by their individual properties in isolation — it's determined by how their respective effects propagate through the same biological network. A flat feature vector can't capture that. A graph can.

How GNNs Model Drug Synergy

A GNN represents drugs, genes, proteins, and disease states as nodes in a graph, connected by edges that represent known biological relationships — protein–protein interactions, drug–target binding, gene regulatory relationships, metabolic pathway connections. The model then learns from this structure through a process called message passing: each node aggregates information from its neighbors, layer by layer, building a representation that reflects not just its own properties but its context within the larger biological network.

For drug synergy prediction specifically, models like SynerGNet construct cancer cell line-specific graphs by integrating heterogeneous biological features into human protein–protein interaction networks. The model then predicts, for a given drug pair and a specific cancer cell line, whether the combination will be synergistic or antagonistic. In peer-reviewed validation, SynerGNet's performance strongly outperformed baseline approaches across multiple synergy score metrics.

DeepDDS, published in Scientific Reports (Nature Portfolio), takes two drug molecular graphs and a cancer cell line gene expression profile as inputs — feeding them through a graph attention network — and predicts synergistic effect with ROC AUC performance that consistently outperforms traditional ML baselines including XGBoost, Random Forest, and standard MLP architectures. What makes graph attention specifically valuable is its ability to focus on the most biologically important subgraphs within the molecule — the equivalent of the model teaching itself which structural features actually matter for synergy.

The Antibiotic Resistance Application

GNNs aren't limited to oncology. A Springer Nature review on GNN models for predicting synergistic drug combinations specifically highlights antibiotic combination prediction — quantifying interactions based on pharmacological similarity between drugs and predicting synergistic antibiotic pairings that can overcome resistant bacterial strains. In the context of antimicrobial resistance, which the WHO projects could cause 10 million deaths annually by 2050, the ability to computationally identify synergistic antibiotic combinations before clinical testing is not a convenience. It's potentially one of the most impactful applications of AI in all of medicine.

"Graph neural networks represent a fundamental shift in how we model biological systems. The drug isn't the unit of analysis — the network is. And synergy only makes sense at the network level."
— Senior Computational Biologist, WEF Biopharma R&D Forum, 2026

Multimodal AI: When Genomics, Proteomics, and Clinical Data Merge

A drug combination doesn't fail because the drugs don't interact. It fails because the biological context in which those drugs are applied — this patient, this tumor, this resistance profile, this metabolic state — wasn't adequately modeled. That's the argument for multimodal AI: the idea that no single data type tells you enough, but combining them gives you something qualitatively different.

In 2025, an NCATS/MIT/UNC collaboration published in Nature Communications demonstrated exactly this. Their multimodal framework for identifying synergistic drug combinations against pancreatic cancer — one of the most lethal and treatment-resistant cancers — tested 496 combinations of 32 anticancer drugs using PANC-1 cells, then trained multiple machine learning algorithms including Random Forest, XGBoost, Deep Neural Networks, and Graph Convolutional Networks on that experimental foundation. The resulting model achieved 83% predictive accuracy in laboratory trials across the 1.6 million possible combinations it evaluated — and verified 307 synergistic combinations as genuinely effective.

The multimodal dimension matters here: the models integrated molecular structure, gene expression, proteomic pathway data, and cell line pharmacogenomics. No single modality produced results close to the combined performance. This mirrors what researchers at BMC Biology found in 2025 with their MultiSyn framework — semi-supervised graph neural network learning that integrated protein-protein interaction networks with multi-omics data consistently outperformed single-modality approaches across all evaluated datasets.

What Multimodal Means in Practice for Drug Teams

For a medicinal chemist or translational researcher, multimodal AI changes the inputs you're working with, not just the outputs. Instead of asking "does this compound bind to this target with acceptable affinity?", you're asking "given this patient's RNA sequencing profile, their tumor's copy number variations, the pharmacokinetic properties of these two drugs, and their known drug resistance signatures — what combination is most likely to produce a synergistic response without activating a compensatory resistance pathway?"

That's a fundamentally different question. And until recently, it was unanswerable at any practical scale. Platforms like BenevolentAI, which integrates biomedical knowledge graphs with multi-omics and clinical data, are building exactly this kind of question-answering capability into the drug discovery workflow. Their platform was used to identify baricitinib — an existing approved rheumatoid arthritis drug — as a candidate treatment for COVID-19 during the pandemic. That's multimodal AI in action: combining molecular, pathway, and clinical literature data to surface a combination signal that no single-modality search would have found.

Quantum-Classical Hybrids: Tackling the Hardest Molecular Problems

Before getting into the science, a calibration note: quantum computing in drug discovery is not deployed at scale. The hardware is nascent, the noise levels in current quantum processors limit what's computationally achievable, and much of the work in this space is still research-grade. But the direction is clear, the 2025 results are meaningful, and drug leaders who dismiss quantum-classical hybrid approaches as hype risk being caught off guard within 3–5 years.

The reason quantum computing is relevant to drug combination discovery specifically comes down to a problem in chemistry: electron-level molecular interactions. Classical computers model these interactions using approximations — force fields that treat molecular bonds as springs, or density functional theory calculations that simplify quantum mechanics enough to be computationally tractable. These approximations work well for most drug molecules. They break down for the most challenging targets — flexible proteins, covalent binding mechanisms, multi-molecule interactions in a crowded binding pocket — precisely the cases where conventional drug discovery has struggled most.

Quantum computers, in principle, can calculate these interactions from first principles, without approximation. The catch is scale: today's quantum hardware doesn't have enough stable qubits to handle full drug molecules. The hybrid approach — using quantum circuits for the molecular interaction calculation and classical AI for the broader pattern recognition and optimization — sidesteps this limitation.

Insilico Medicine's KRAS Pipeline: A 2025 Proof of Concept

Insilico Medicine's 2025 quantum-enhanced KRAS program is the clearest real-world demonstration of what hybrid approaches can achieve. KRAS-G12D is one of the most sought-after and historically undruggable targets in oncology — driving approximately 25% of all human cancers with mutation frequencies that make it the single most common oncogenic mutation in pancreatic cancer.

Insilico combined quantum circuit Born machines (QCBMs) with deep learning to screen 100 million molecules, refining to 1.1 million candidates, then synthesizing 15 promising compounds. One of them — ISM061-018-2 — achieved a 1.4 µM binding affinity to KRAS-G12D. Critically, the hybrid pipeline showed a 21.5% improvement in filtering out non-viable molecules compared to classical AI-only approaches — meaning fewer resources wasted on dead ends during experimental validation. This number is significant: if 21.5% of the molecules that would have consumed lab resources under classical AI can be computationally eliminated, the compounding cost savings over a drug development program are substantial.

Microsoft's Majorana-1 Chip and What It Means for Pharma

Microsoft's 2025 announcement of its Majorana-1 quantum chip — based on topological qubits that are inherently more stable than conventional superconducting qubits — represents a meaningful hardware milestone. More stable qubits mean larger quantum circuits running with fewer errors, which directly expands the molecular complexity that quantum-classical drug discovery pipelines can handle. An Insilico–University of Toronto collaboration also demonstrated how quantum-classical hybrid computing can expand chemical space exploration beyond what classical AI can efficiently sample — an important capability when looking for genuinely novel molecular scaffolds for combination use.

The implication for pharma: monitoring quantum-classical platform developments is no longer just for computational scientists. It belongs in the strategic technology roadmap conversations at the executive and R&D leadership level.

Digital Twins: Testing on the Patient Before Testing on the Patient

The concept of a patient digital twin is one of those ideas that sounds like science fiction until you understand what it actually is — and then it sounds like an obvious idea that should have existed 20 years ago.

A digital twin, in this context, is a computational model of an individual patient's disease biology. It's built from that patient's genomic sequence, tumor transcriptomic profile, plasma proteomic data, metabolomic markers, pharmacokinetic properties derived from prior medication history, and relevant clinical parameters. Once constructed, the digital twin can be simulated — you administer virtual doses of virtual drug combinations and observe the predicted biological response. Which pathways activate? Which resistance mechanisms engage? Does the combination suppress the target pathway adequately, or does a compensatory bypass emerge that would predict clinical failure?

This is not approximation. This is high-dimensional systems biology, made computationally tractable by the same deep learning architectures that power the rest of the AI drug discovery ecosystem.

Why This Changes Clinical Trial Design

The practical value of digital twins isn't just "test the drug without the patient." It's that they allow trialists to enrich clinical trial populations before a single patient is enrolled. If a digital twin model can predict with reasonable confidence which genomic profiles will respond to a specific combination, you can design a trial that enrolls only likely responders — dramatically improving the signal-to-noise ratio, reducing the required trial size, and compressing the timeline to a meaningful readout.

This matters enormously given the economics of oncology trials. A Phase III trial for a new cancer combination regimen can cost $300–500 million and take 5–7 years. If digital twin pre-screening reduces the enrolled population by 40% while improving the response rate by 20 percentage points, the downstream cost, timeline, and probability-of-success implications are transformative.

Novartis's Data42 — their in-house data lake covering 30+ years of clinical and preclinical studies — is the foundation for their digital twin and AI safety prediction programs. By training on this longitudinal dataset, their models can simulate safety signals for novel combination candidates based on structural and mechanistic similarity to known compounds — flagging potential toxicity risks computationally before animal studies, let alone human trials.

Real-World Case Studies: Where These Tools Are Already Working

Case Study 1 — Pancreatic Cancer: 307 Verified Synergistic Combinations from 1.6 Million Candidates

Pancreatic ductal adenocarcinoma (PDAC) has a 5-year survival rate below 12% — largely because effective combination regimens are desperately needed and desperately hard to find. Traditional screening of all relevant drug pairs is prohibitively expensive. In 2025, a collaboration between NCATS (National Center for Advancing Translational Sciences), MIT, and UNC Chapel Hill published a framework in Nature Communications that changed this calculus.

Starting with 496 tested combinations of 32 anticancer drugs across PANC-1 cells, the team trained multiple AI models — RF, XGBoost, DNN, and GCN — on the experimental results. Those models then evaluated all 1.6 million possible drug combinations computationally. The result: 307 synergistic combinations verified in laboratory validation, with the MIT GCN model achieving the highest accuracy at 83%. A separately published 2025 study from the same PDAC focus area used multi-omics integration with ML and deep learning to identify TNFRSF10A/TRAILR1 as a novel therapeutic target — proposing Temsirolimus, Ergotamine, and capivasertib as potential combination modulators. Both studies demonstrate a new paradigm: AI-generated hypotheses grounded in multi-omics data, validated computationally before committing lab resources.

Case Study 2 — Insilico Medicine: First AI-Designed Drug to Complete Phase II

Insilico Medicine's ISM001-055 — a novel drug for idiopathic pulmonary fibrosis (IPF) designed entirely by AI — completed Phase II trials in 2024, marking the first time an AI-designed drug has reached that milestone. The discovery phase, which typically takes 4–5 years in conventional pipelines, was completed in 18 months using Insilico's Chemistry42 generative chemistry platform. Their 2025 KRAS program (described above) extended this into the combination therapy space with quantum-classical enhancement — demonstrating that the platform continues to evolve faster than most competitors can match.

Case Study 3 — Novartis + Isomorphic Labs: 15 Million Compounds, 60 Synthesized

At the World Economic Forum Annual Meeting 2026, Novartis publicly described their generative AI-powered compound design program — a collaboration with Isomorphic Labs (DeepMind's drug discovery spinout). Their team computationally designed 15 million potential compounds, built predictive models for key pharmacokinetic properties including brain penetration, and ultimately synthesized roughly 60 molecules in the lab — arriving at a potent, brain-penetrant molecular scaffold now in further optimization. The ratio of compounds designed to compounds synthesized (15 million to 60) captures the efficiency argument precisely: AI narrows the space before the lab even opens.

Case Study 4 — SandboxAQ + UCSF: Physics-Native AI for Neurodegeneration

The collaboration between SandboxAQ and UCSF's Institute for Neurodegenerative Diseases represents a different approach — using physics-native "large quantitative models" that simulate molecular dynamics from first principles rather than learning patterns from historical data. The WEF recognized this collaboration at AMNC 2025 as part of its inaugural MINDS cohort highlighting value-creating AI applications. The implication for Alzheimer's and Parkinson's combination therapy development is significant: conditions with poorly understood biology — where historical data on failed experiments provides limited signal — are precisely where physics-native models offer advantages over purely data-driven approaches.

Case Study 5 — AstraZeneca–CSPC and the $6B XtalPi Deal: China's AI Drug Discovery Surge

The strategic dimension of AI drug combination discovery is increasingly geopolitical. In 2025, five of the ten largest pharma R&D licensing deals originated from Chinese companies, with two specifically focused on AI-driven discovery platforms. The $6 billion Dovetree–XtalPi deal centered on AI-based drug design capabilities. GSK's $12 billion agreement with Hengrui Pharma covered a PDE3/4 inhibitor plus 11 additional candidates across respiratory, immunology, and oncology. Western pharma's assessment of these deals as competitive intelligence, not just transaction news, is warranted.

Drug Repurposing at AI Speed: The Underappreciated Shortcut

Among all the applications of AI in combination therapy, drug repurposing may have the best near-term risk-adjusted return for a pharma R&D organization. The logic is compelling: drugs that already have Phase I safety data, known pharmacokinetic profiles, established manufacturing processes, and in many cases existing commercial supply chains — can be tested in combination far faster and cheaper than novel compounds. The challenge has always been identifying which approved drugs might produce clinically meaningful synergy when combined in a new indication.

AI changes this completely. Knowledge graph models can mine the entire pharmacological literature, approved drug databases like DrugBank, clinical trial registries, and adverse event databases to surface mechanistic connections between approved drugs and underexplored diseases. The BenevolentAI baricitinib–COVID-19 identification described above is the paradigm case. More recently, AI platforms are identifying approved oncology drugs with potential synergistic activity in non-oncology indications — cardiovascular disease, neurodegenerative conditions, and rare metabolic disorders — creating an entirely new category of combination therapy opportunities that don't require a new molecular entity.

For biotech startups in particular, this is a strategic opportunity. An AI-identified repurposed combination can move to Phase II trials in 3–4 years rather than 8–10, with a de-risked safety profile and a clearer regulatory pathway. The FDA's Breakthrough Therapy designation has been awarded to repurposed combination candidates precisely because the safety foundation allows acceleration of the clinical program.

The NIH's National Center for Advancing Translational Sciences (NCATS) maintains open-access datasets including the NCI ALMANAC — a comprehensive database of anticancer drug pair screening data across 60 tumor cell lines — that AI researchers are using to train combination synergy prediction models. This is public infrastructure for AI drug discovery, and many pharma companies are not fully utilizing it.

The Regulatory Tightrope: Explainability, Validation, and the FDA in 2026

Every drug developer working with AI needs to understand a fundamental tension that sits at the center of the field right now: the models that perform best at predicting drug synergy — deep neural networks with complex attention mechanisms, multi-layer GNNs, quantum-classical hybrid pipelines — are also the hardest to explain. And the regulatory agencies that must evaluate their outputs require explanation.

The FDA's guidance on AI/ML in drug development has evolved substantially. Their April 2025 roadmap for reducing animal testing requirements — which explicitly opens the door to AI-generated in silico models as primary supporting evidence in IND submissions — is a significant regulatory shift. But it comes with an accountability requirement: the AI model's predictions must be validatable, its training data must be documented, and its limitations must be disclosed. Black-box models that produce predictions without explanation are not acceptable as primary regulatory evidence.

The Explainability Framework That's Becoming Standard

The most practical response to this requirement is the adoption of Explainable AI (XAI) techniques that can surface mechanistic interpretation of model outputs. A PMC 2025 systematic review of AI in drug discovery (spanning 2015–2025) notes that oncology accounts for 72.8% of published AI drug discovery work — and that explainability is an increasingly central concern in all of it. Models like SynergyX, which explicitly prioritize "highlighting the most influential features in synergy prediction," are gaining adoption precisely because they address the explainability requirement while maintaining competitive predictive performance.

Practically speaking, when you submit a combination therapy candidate identified by an AI model to the FDA with an IND, you'll be answering questions like: What training data did the model use? How was overfitting prevented? Was the model externally validated on a held-out dataset? What are the known failure modes? Can you explain why the model predicted synergy for this specific drug pair in this specific cancer type — and can you link that prediction to a testable biological mechanism?

These are answerable questions for well-governed AI programs. They're not answerable for ad hoc AI use. The organizations that will navigate this best are those building systematic AI governance into their R&D processes now — not scrambling to reconstruct it after a regulatory query.

Complex Trial Design: The Adaptive Trial Evolution

AI-predicted combinations also change how clinical trials need to be designed. A conventional two-arm trial comparing Drug A alone vs. Drug A + Drug B assumes you know the right dose, the right patient population, and the right endpoint before the trial starts. AI-enabled trials increasingly use adaptive designs — statistical frameworks that allow modification of dosing, patient allocation, or even drug selection based on accumulating data during the trial itself, with pre-specified decision rules that maintain statistical integrity. The FDA and EMA both have guidance on adaptive trial designs, and they're increasingly the expected approach for AI-identified combination candidates precisely because AI prediction uncertainty warrants more flexible evaluation frameworks.

What Pharma Leaders Should Be Doing Differently Right Now

The field has progressed to the point where "we're evaluating AI" is no longer a strategy — it's a hedge. The organizations making serious competitive moves are making specific structural decisions. Here's what that looks like in practice:

1. Build the Data Infrastructure First

Every AI drug combination model is only as good as its training data. Novartis's Data42 — a 30-year longitudinal clinical and preclinical data lake — didn't happen overnight. Organizations without structured, curated, accessible data assets should treat that as the primary bottleneck and address it before evaluating external AI platforms. This includes: standardizing assay data formats, annotating historical screening results with molecular structure information, and establishing data governance frameworks for sharing and utilizing combination screening data across internal programs.

2. Invest in Wet-Lab Feedback Loops

AI predictions are hypotheses. They need experimental validation, and that validation data needs to flow back into the model. The trifecta for credible AI-driven drug combination discovery — as Drug Discovery Online's analysis makes clear — is data provenance, model governance, and wet-lab feedback loops. Organizations that treat AI predictions as endpoints rather than starting points will generate impressive slides and disappointing clinical results.

3. Prioritize Explainable Models for Regulatory Submissions

Select AI platforms and build internal models with regulatory explainability requirements in mind from the beginning — not as a retrofit. This means choosing model architectures where attention mechanisms and feature importance can be extracted, maintaining full training data documentation, and building external validation studies into the discovery program before IND submission.

4. Engage with Open-Access Resources

The NCI ALMANAC database, NIH PubChem, the DrugCombDB combination database, and CCLE cancer cell line encyclopedia are all open-access resources that can seed or augment internal AI combination discovery programs. Many organizations — particularly mid-sized biotechs — are underutilizing public data infrastructure that the largest companies are building proprietary pipelines on top of.

5. Watch the Drug Resistance Signature Literature

A 2025 paper in PMC demonstrated that AI models incorporating Drug Resistance Signatures (DRS) as biologically informed drug representations "consistently outperform traditional approaches across all evaluated algorithms." DRS features encode the transcriptomic response of cancer cells to a drug, capturing resistance-relevant information that chemical structure-based descriptors miss entirely. Any combination prediction program that isn't incorporating resistance modeling is working with an incomplete picture — and potentially generating combination candidates that look synergistic in naive cells and fail in clinically relevant resistant populations.

✅ Quick-Reference: 5 AI Platform Categories Worth Evaluating in 2026

GNN-based synergy prediction: SynerGNet, DeepDDS, HANSynergy, SynergyX, MultiSyn
Generative drug design: Insilico Chemistry42, Absci, Generate Biomedicines, OpenFold3
Knowledge graph / repurposing: BenevolentAI platform, Recursion OS, Exscientia
Quantum-classical hybrid: Insilico (QCBM+DL), SandboxAQ, IonQ partnerships
Digital twin / patient simulation: Novartis Data42 ecosystem, Siemens Healthineers, Duke University programs

The 2027–2030 Horizon: Where This Goes Next

Continuous Learning Systems That Improve With Every Trial

The next generation of AI drug combination platforms will incorporate clinical trial outcomes in real time — updating their synergy prediction models as Phase I, II, and III data accumulates across the industry. This creates a compounding learning advantage: organizations with the most mature AI combination platforms and the most clinical data will generate predictions of higher accuracy, which leads to better trial outcomes, which generates more training data. The implication for competitive strategy is that early movers in structured AI drug combination programs build durable advantages that are difficult for late adopters to close.

AI-Guided Triple and Quadruple Combination Design

Most current validated AI combination work focuses on drug pairs. The computational complexity of predicting synergy in three- and four-drug combinations — where the interaction space grows combinatorially — is substantially harder. Hypergraph-based models like HypergraphSynergy, which can account for higher-order interactions beyond drug pairs, represent the emerging methodological frontier. By 2028, AI-guided triple combination therapy design for multidrug-resistant cancers and complex autoimmune conditions is likely to move from research to clinical application.

Multispecific Antibodies as AI-Designed Combination Entities

As Drug Discovery World's 2026 outlook notes, biologics discovery is shifting toward multispecific antibodies and engineered proteins that address complex biological targets through novel binding mechanisms. An AI-designed bispecific antibody is, in a sense, a combination therapy in a single molecule — targeting two pathways simultaneously with an entity whose binding properties and geometric compatibility can be optimized computationally. AI-guided design tools for these complex modalities are already being deployed at companies including Absci, Genentech, and Pfizer's biologics division. This convergence of combination therapy logic and engineered biologic design is one of the most consequential frontiers in 2026 pharma R&D.

Frequently Asked Questions

❓ What is AI-powered drug combination discovery?

AI-powered drug combination discovery uses machine learning models — including graph neural networks, deep learning, and multimodal AI — to predict which pairs or groups of drugs will produce synergistic therapeutic effects. Instead of manually screening millions of possible combinations in a lab, AI narrows the search computationally, identifying the most promising candidates for experimental validation. The NCATS/MIT/UNC collaboration in 2025 evaluated 1.6 million possible drug combinations computationally and verified 307 synergistic pairs with 83% predictive accuracy.

❓ What is a graph neural network (GNN) and how is it used in drug synergy prediction?

A graph neural network (GNN) represents molecules, proteins, and biological interactions as nodes and edges in a graph — ideal for modeling how drugs interact within biological networks. Unlike flat ML models that work with feature vectors, GNNs capture network context: how a drug's effects propagate through protein-protein interaction networks, gene regulatory relationships, and metabolic pathways. Models like SynerGNet, DeepDDS, and HANSynergy use GNNs to classify drug combinations as synergistic or antagonistic with performance that consistently outperforms classical ML baselines in peer-reviewed validation.

❓ Are AI-discovered drug combinations in clinical trials?

Yes. 2025 saw the highest single-year IND filing rate for AI-originated molecules in pharmaceutical history. Insilico Medicine's ISM001-055 — a fully AI-designed drug for idiopathic pulmonary fibrosis — completed Phase II. AI-predicted combination therapy candidates are advancing through Phase I and II oncology trials at Insilico Medicine, Recursion Pharmaceuticals, Exscientia, and BenevolentAI. The FDA's April 2025 guidance on reducing animal testing requirements is further accelerating this pipeline by allowing in silico AI evidence to support IND submissions.

❓ What is a quantum-classical hybrid approach in drug discovery?

A quantum-classical hybrid combines quantum computing for electron-level molecular interaction calculation with classical deep learning for broader pattern recognition. Classical computers approximate these interactions using force fields; quantum circuits calculate them more accurately from first principles. Insilico Medicine's 2025 KRAS pipeline used quantum circuit Born machines (QCBMs) plus deep learning to screen 100 million molecules, achieving a 21.5% improvement in filtering non-viable compounds versus classical AI alone.

❓ How does a digital twin accelerate combination drug development?

A pharmaceutical digital twin is a computational model of an individual patient's disease biology — built from genomic, proteomic, and clinical data — that can simulate how that specific patient will respond to a drug combination before any drug is administered. This enables enriched clinical trial design by identifying likely responders pre-enrollment, potentially reducing required trial sizes by 30–40% while improving response rates. Novartis's Data42 (30+ years of clinical data) and Isomorphic Labs are leaders in applying this approach.

❓ What is the biggest regulatory challenge for AI-discovered combinations?

Explainability. The FDA and EMA require not just evidence that a combination works, but mechanistic understanding of why. Black-box AI models that produce predictions without interpretable reasoning are difficult to support in regulatory submissions. The solution is Explainable AI (XAI) — frameworks like SHAP values, attention weight visualization, and models like SynergyX that explicitly surface the biological features driving their predictions. Regulatory strategy and AI model selection should be aligned from the start of a combination discovery program, not reconciled at the IND stage.

Closing Perspective: From Hypothesis Generation to Clinical Reality

There's a useful way to think about where AI drug combination discovery sits in 2026. It has definitively proven that it can generate better hypotheses, faster and cheaper, than any human-designed screening protocol. That proof is in the peer-reviewed literature, the IND filings, the Phase II completions, and the deal structures of the largest pharma licensing transactions of the past two years. The debate about whether AI belongs in the combination therapy discovery pipeline is over.

What remains genuinely uncertain is translation. Can AI-predicted synergies survive the rigors of clinical trial biology — the heterogeneity of real patients, the pharmacokinetic variability, the resistance mechanisms that evolve under treatment pressure? The early evidence from AI-designed drugs in human trials is encouraging. But the sample size is small and the follow-up is short. The next five years of clinical readouts from AI-originated combination programs will be the most important data the field has ever seen.

For pharma leaders, the actionable conclusion isn't to wait for that data. It's to build the infrastructure, the governance, the wet-lab feedback loops, and the regulatory strategy that will allow your organization to move from AI prediction to clinical proof efficiently. The organizations doing that now will be the ones whose clinical readouts everyone else is learning from in 2030.

Are you working on AI-driven combination therapy programs at your organization? What's the biggest challenge you've hit — data quality, regulatory explainability, or translational validation? Share your perspective in the comments — this conversation needs more practitioner voices in it.

📚 Authoritative References & Further Reading

PMC/NIH — AI in Oncology Drug Discovery (2025) | PMC — AI Drug Discovery Timelines Review | Scientific Reports (Nature) — GNN Drug Interaction Prediction | Springer — GNN Drug Synergy Review | BMC Biology — MultiSyn Framework 2025 | MDPI Biomolecules — SynerGNet | WEF 2026 — Novartis on AI Drug Discovery | WEF — SandboxAQ + UCSF Large Quantitative Models | Model Medicines — Quantum-Classical Hybrid Pipeline | Drug Discovery Online — 2025 Top Highlights | DDW — Drug Discovery in 2026 | FDA — AI/ML in Drug Development | NIH NCATS — Translational Research Resources | NIH PubChem — Open Chemistry Database | Frontiers in Oncology — AI Precision Medicine Review

* * *

RPM Cybersecurity 2026: FDA 524B Mandates & SBOM Compliance