The Future of Drug Discovery AI Is Retrieval as Much as Prediction

By Blaise AI Team

A lot of AI discussion in chemistry still assumes the most valuable system is the one that predicts best from scratch.

That assumption is too narrow.

In live projects, a surprising amount of useful work is not pure prediction at all. It is retrieval.

Have we seen this substitution pattern before? Did this vector kill permeability in the last series? Is there a patent where somebody already explored the obvious fast-follow chemistry? Which internal project found the same microsomal stability trap? Which route worked on a closely related scaffold? Which analogue looked promising until a late counter-screen broke the story?

Those are retrieval questions.

They matter constantly.

Chemists spend a lot of time remembering

Not remembering in the trivial sense of searching for a paper.

Remembering in the project sense: the failed motif, the matched pair that moved exposure the wrong way, the route that looked elegant and turned out impossible on scale, the series that taught the team not to trust a certain assay pattern.

This memory is part of medicinal chemistry judgement. It is one reason experienced teams make better decisions than a clean structure-only model with no access to the scar tissue of prior work.

When AI systems ignore that, they often feel strangely repetitive. They keep proposing ideas the organisation already half knows are bad. They treat every prompt as a cold start. They mistake amnesia for originality.

Prediction without retrieval repeats old mistakes more efficiently

This is especially obvious in settings where the local project history matters more than any broad global prior.

A model can make a decent potency guess and still miss that the same motif already failed for developability reasons. It can propose an appealing scaffold hop and still ignore that the route burden was unacceptable last time. It can recommend an assay sequence and still forget that the previous batch revealed the real bottleneck somewhere else.

That kind of system is not useless. It is just under-instrumented.

It knows chemistry in the abstract and too little about the project it is supposedly helping.

Retrieval is not a fallback. It is a different kind of intelligence

There is a habit in ML culture of treating retrieval as secondary. If the model needs memory, perhaps the underlying predictor was not good enough.

That is the wrong instinct for this domain.

A lot of project value lies in surfacing the right precedent at the right moment. Sometimes the winning move is not a novel prediction. It is a relevant memory delivered early enough to stop a bad decision.

That memory may be internal. It may live in old project decks, notebook fragments, assay archives, synthesis history, or chemist comments. It may be external, embedded in patents, literature, vendor catalogues, or historical series from adjacent programmes.

The point is the same. The system that remembers well may outperform the system that predicts elegantly in a vacuum.

Retrieval also grounds explanation

This matters because teams rarely want only an answer.

They want footing.

Why does the model think this? Which compounds is it leaning on? What related series show the same liability? Which precedent makes the route look feasible? What historical failure is shaping the warning?

Prediction alone often gives a result. Retrieval gives a reason the team can inspect.

That does not solve everything. Precedent can mislead, and memory can be biased. But in a field where decisions need to be argued, not merely emitted, retrieval changes the quality of the interaction.

This is where internal memory becomes strategically important

Public benchmarks underweight this badly because they are mostly blind to internal context. A live biotech project is not.

The organisation knows things that are rarely captured cleanly in public datasets: route pain points, failed motifs, misleading readouts, borderline liabilities, compounds nobody wants to touch again, chemist objections that never became structured labels.

Most current systems leave that on the floor.

That is a waste.

If useful AI in discovery is supposed to become project-native quickly, then retrieval is one of the fastest routes there. It turns historical fragments into active context instead of waiting for the model to rediscover everything by supervised learning alone.

The future stack probably looks hybrid

This is why the future does not look like pure prediction versus pure search.

It looks more hybrid than that. Retrieve the relevant precedents. Use them to ground ranking, route reasoning, and explanation. Update against local project data. Keep the memory live as the project evolves.

The strongest system may not be the one that claims to know chemistry universally.

It may be the one that remembers what matters, when it matters, and uses prediction to extend that memory rather than replace it.

Memory is not optional in a scarred domain

Drug discovery projects accumulate scars. Failed vectors, dead scaffolds, routes that never scaled, readouts that looked meaningful and were not.

Any AI system that behaves as though the project begins fresh each time is missing too much of the reality it is meant to operate inside.

This is why retrieval deserves to sit much closer to the center of the stack. Not as a product feature for convenience. As a core part of how the system becomes useful.

Prediction matters. Of course it does.

But in a field this stateful, memory may matter just as much.