What is Muller's Ratchet and how does it apply to AI?

Muller's Ratchet is the evolutionary biology principle describing the one-directional accumulation of deleterious mutations in closed asexually-reproducing systems. Without an external reset mechanism, errors compound irreversibly. The same principle operates in AI model collapse: when models train on their own outputs, idiosyncrasies amplify rather than correct, and the distribution narrows toward incoherence.

What is AI model collapse?

Documented by Shumailov et al. in Nature (2024, DOI: 10.1038/s41586-024-07566-y): when AI models are trained on outputs from previous AI model generations, the distributions collapse. By generation 9, models produce semantically incoherent output. The researchers identified irreversible defects - the tails of the original distribution disappear and do not return.

What is generational accounting for knowledge bases?

A diagnostic framework: Generation 0 equals primary sources. Generation 1 equals AI-assisted summaries of primary sources. Generation 2 equals AI-assisted summaries of generation-1 summaries. Generation 3 and beyond equals institutional knowledge that cannot be traced to its original signal. Most teams using AI-assisted workflows for twelve months or more are at generation 3-4 without knowing it.

What is external injection architecture?

The design principle that closed copying systems require structurally built-in mechanisms for introducing signal from genuinely outside the copy chain - by analogy with sexual recombination in evolutionary biology, which resets Muller's Ratchet by injecting external genetic material. For knowledge bases, this means direct observation, unmediated data, and primary research at regular intervals, designed as a structural requirement rather than a procedural practice.

How does cognitive horizontal transfer connect to copy degradation?

They form a compound feedback loop. AI installs cognitive patterns in human users through the attribution error (documented in Part 6: Cognitive Horizontal Transfer). Humans write AI-influenced text that feels native to them. That text enters AI training as clean human content. AI trains on increasingly AI-patterned writing. Degraded output shapes the next wave of human writing. The structure implies they are not parallel phenomena but input and output of the same closed loop, accelerating each other.

Cognitive Holobiont Series

The Universal Law of Copy Degradation

Q: What is the Universal Law of Copy Degradation?

The Universal Law of Copy Degradation applies Shannon's Data Processing Inequality (1948) to any closed copying system. No transformation of a signal can create information - processing can only preserve or lose it. Any closed system that copies its own outputs will degrade over successive generations, regardless of the quality of the copying mechanism.

Q: What is Shannon's Data Processing Inequality?

A foundational result from Claude Shannon's A Mathematical Theory of Communication (1948). It states that no transformation of a signal can create information - processing can only preserve or lose it. This is a mathematical theorem, not an empirical finding, and applies to every information system: biological, computational, and organisational.

From Cloned Mice to Collapsed AI to Your Knowledge Base

By Kasimir Hedstrom · May 2026 · 15 min read

Something has been bothering you about your AI-assisted work.

The outputs are technically competent. The writing is clean. The structures are coherent. But something has thinned. The substance feels recycled. The recommendations feel familiar in a way that has nothing to do with your own thinking. The documents your team produces look like documents, but they have the texture of something that has been processed one too many times.

This feeling has a name. And it has a cause that goes back seventy-six years.

What This Piece Covers

Shannon’s Data Processing Inequality: the 1948 mathematical proof that no copying operation can create information
Three substrates where the same law operates identically: mouse cloning, AI model training, organisational knowledge bases
Why “use better AI tools” addresses the wrong variable - and what the correct variable is
Generational accounting: a diagnostic framework for measuring your knowledge base’s depth of degradation
External injection architecture: the structural countermeasure, not the procedural one

A 76-Year-Old Mathematical Proof You Are Violating

Claude Shannon published “A Mathematical Theory of Communication” in 1948. One of its foundational results - the Data Processing Inequality - can be stated simply: no transformation of a signal can create information. Processing can only preserve or lose it.

This is not an empirical observation about AI. It is a mathematical theorem about information itself. It applies to every copying system: biological, computational, organisational. It does not care about the quality of the copying mechanism.

A perfect copier copies perfectly. What it cannot do is produce a copy containing more information than the original. And every imperfect copy - every summary, every compression, every AI-assisted paraphrase, every training pass - loses signal irreversibly.

The dominant frame in AI development runs as follows: AI output quality is a function of model capability. Improve the model, improve the output. Degradation is a data hygiene problem. Keep your inputs clean and the degradation is avoidable.

Shannon’s theorem says otherwise. The degradation is structural, not procedural. It is a property of closed copying systems regardless of the quality of the copying mechanism. The relevant variable is not the processor. It is the substrate.

The question is not whether your closed copying system will degrade. It will. The question is what generation you are in.

"A perfect copier copies perfectly. What it cannot do is produce a copy containing more information than the original. Every summary, every compression, every training pass loses signal irreversibly."

The Historical Parallel That Explains the Confusion

Before examining the three substrates, it is worth naming where the current AI framing sits in the history of scientific paradigms.

In the eighteenth century, heat was understood through caloric theory: heat was a substance - caloric fluid - that flowed from hot objects to cold ones. Anomalies accumulated. Friction generated heat without consuming any caloric substance. Combustion did not behave as a fluid transfer. Each anomaly was explained away with more elaborate mechanics. More caloric fluid was invoked to account for each exception.

The paradigm did not fail because caloric theory was imprecise. It failed because it had categorised the phenomenon wrongly. Heat is not a substance. It is energy in motion. The answer was not more careful accounting of caloric fluid. The phenomenon required a different category.

The current “capability improvement solves degradation” paradigm is accumulating anomalies at a similar rate. Better models collapse at the same generational stage as weaker models. More careful data curation does not prevent collapse when the data is AI-generated. Teams using stronger AI tools report the same signal thinning in their documentation as teams using weaker ones.

Each anomaly is explained within the current frame as a data hygiene problem, a training configuration issue, a capability gap not yet closed. This is characteristic pre-paradigm-shift behaviour: explaining away anomalies rather than asking whether the frame itself is wrong.

The answer is not more caloric fluid. The phenomenon requires a different category.

Three Substrates, One Law

The same mathematical law operates identically across three domains. They are not three separate problems. They are one problem in three substrates.

Substrate 1: Biology

Teruhiko Wakayama and his team at the RIKEN Centre for Biosystems Dynamics Research spent twenty years answering a single question: how many times can you clone a clone?

The answer was 58 times.

Starting from one mouse, they cloned it, then cloned the clone, then cloned that clone - for 58 successive generations, across more than 30,000 cloning attempts over two decades (Wakayama et al., Nature Communications, 2026). The early generations were healthy. By generation 27, fertility declined and litter sizes shrank. By generation 58, every newborn died within a day. No exception.

The cloning technique did not degrade. The team’s skill did not diminish. What degraded was the information substrate. Large structural mutations accumulated in the DNA with each copying cycle at a rate three times higher than natural reproduction. By generation 58, the substrate could no longer support life.

This is Muller’s Ratchet: the one-directional accumulation of copying errors in closed systems. Without a reset mechanism, the ratchet only turns in one direction. The end state is not merely a worse copy. It is a substrate that can no longer function.

The RIKEN study is not a metaphor for AI degradation. It is the same mathematical law operating in a different physical medium.

Substrate 2: AI

Ilia Shumailov and colleagues published “AI Models Collapse When Trained on Recursively Generated Data” in Nature in 2024 (DOI: 10.1038/s41586-024-07566-y). The finding is direct: when models train on AI-generated outputs from previous model generations, the distributions collapse. By generation 9, models trained on text about medieval architecture produced lists of jackrabbits.

The earlier-generation models were not weaker than the generation-9 models. They were applying equivalent capabilities to a progressively degraded information substrate. Capability did not prevent collapse. Capability is not the variable.

The researchers describe “irreversible defects” - the tails of the original distribution disappear and do not return. This is not a capability gap that better architecture can close. It is a structural consequence of a closed loop. The model is no longer being corrected by external signal. Its idiosyncrasies amplify. Coherence narrows. The distribution collapses toward its own most-repeated patterns.

"By generation 9, models trained on text about medieval architecture were producing lists of jackrabbits. The earlier models were not weaker. The substrate had degraded beyond what capability could compensate."

Substrate 3: Your Knowledge Base

Consider a ten-person firm that has been operating with AI-assisted documentation for eighteen months. Every project brief was drafted with AI. Every post-project summary was generated by AI from the brief. Every onboarding document was compiled by AI from previous summaries. Every strategic review drew on those onboarding documents.

This is not a knowledge base. It is a closed copying system now several generations removed from its original signal sources.

It may be well-organised. It may be efficiently searchable. It may appear comprehensive. What it cannot contain is more information than the primary observations it has been compressing, summarising, and paraphrasing for eighteen months.

Generation 1 was a reasonable summary of what actually happened. Generation 2 was a summary of that summary. Generation 3 was a summary of that summary. Generation 4 is a summary of a summary of a summary of a summary. The team is making decisions from generation 4 and experiencing it as institutional knowledge.

The degradation is silent. It does not announce itself. It does not produce obviously wrong outputs - it produces outputs that feel authoritative while being systematically removed from the original signal. This is the most operationally dangerous form of degradation: invisible until the decisions it informs fail in ways that are hard to trace.

Consider what this looks like in practice. A firm spends twelve months AI-processing its customer interview notes - each round summarised into key themes, each round of themes synthesised into strategic priorities, each cycle of priorities distilled into a planning document. A pricing decision eventually emerges from this process, backed by confident “customer insight.” No one in the room can trace that insight to an actual customer - but no one notices, because the documentation looks thorough and the reasoning feels coherent. The original observation - that customers had high price sensitivity at a specific threshold the sales team had personally watched them cross - had been compressed into “customers are price-sensitive,” then into “pricing is a consideration,” then into “competitive pricing recommended.” The pricing decision misses the threshold. The resulting churn is attributed to product gaps, market timing, messaging failures. The generation-4 knowledge base offers no mechanism for identifying what it lost.

The Paradigm’s Load-Bearing Assumption

The standard response to these observations is: use better AI tools, be more careful about what the AI trains on, maintain data hygiene. Use diverse sources. Do not let AI summarise its own summaries.

This response treats copy degradation as a capability problem - something addressable with sufficient care and better tooling.

The Capability Frame

AI output quality is a function of model capability. Improve the model, improve the output. Degradation is a data hygiene problem. Better curation, more diverse training data, stricter input controls - these are the mitigations. Enough care makes the degradation avoidable.

What Shannon’s Theorem Establishes

Degradation is structural, not procedural. Closed copying systems degrade regardless of the quality of the copying mechanism. More careful AI curation applied to AI-generated content is still a closed system. The loop remains closed. The ratchet continues to turn.

The load-bearing assumption the capability frame has never examined: self-referential information processing is neutral or beneficial.

“Summarise for efficiency” assumes compression preserves the essentials. It does not. It loses them.

“Use AI to process our documentation” assumes the processing is additive. It is not. It is lossy.

“Train on more data” assumes more processing produces more information. Shannon proved in 1948 that it cannot.

Every mitigation strategy built on this assumption addresses the wrong variable. You cannot solve a substrate problem by improving the processor. The caloric fluid is not the issue.

The Counterfactual Test

If copy degradation were a capability problem, three predictions would hold.

First: better models would collapse later. More generations before degradation onset would occur with higher-capability models than lower-capability ones. Shumailov’s data shows this does not hold. Initial model capability does not predict the generation at which collapse occurs.

Second: degradation would be reversible with sufficient capability improvement. Shannon’s Data Processing Inequality is a mathematical proof of irreversibility. Signal lost through compression cannot be restored by applying a better compressor. What is gone is gone.

Third: human-generated training data would remain a reliable reset mechanism indefinitely. This is where the second failure arrives - addressed in the next section.

None of the three predictions hold. The capability frame makes testable claims. The tests fail. That is what a broken paradigm looks like.

The Countermeasure Is Architectural

Evolutionary biology solved this problem. The solution is called sexual reproduction.

Asexual reproduction is a closed copying system. Muller’s Ratchet runs, errors accumulate, no reset. Sexual reproduction introduces genetic material from outside the copy chain at every generation. The external injection resets the ratchet. This is why sexual reproduction persists despite being energetically costly: it is the structural solution to closed-loop degradation. The energy expenditure is the price of a reset mechanism.

The countermeasure for any closed information system follows the same logic: external signal injection - information from genuinely outside the system, not recirculated through it.

For AI: training data that is not AI-generated and not AI-influenced human writing. The window on this is narrowing - addressed below.

For your knowledge base: primary source injection. Direct observation. Unmediated customer conversations conducted before any layer of AI interpretation. Raw market data without pre-processing. Primary research. The original signal, at regular intervals, deliberately injected before the system closes further.

"Sexual reproduction persists despite being energetically costly because it is the structural solution to closed-loop degradation. The energy expenditure is the price of a reset mechanism. The same logic governs every information system."

This is not “be more careful with your data.” It is a design principle: external injection must be architecturally built in, not procedurally grafted on. A system without a reset mechanism will degrade. The reset must be structural, not habitual.

A habit can be skipped when things get busy. An architectural requirement cannot be skipped. The distinction is not semantic. It is the difference between a system that will degrade and a system that has a designed mechanism for not degrading.

The Feedback Loop This Series Did Not Originally Plan to Name

The previous piece in this series - Cognitive Horizontal Transfer - documented how AI spreads cognitive patterns through the attribution error: AI processes your thinking, returns output you experience as your own, and the patterns install without awareness. The mechanism is direct, simultaneous, and has no historical analogue in how previous technologies shaped human cognition.

These two mechanisms are not parallel arguments in a series. The structure implies they are the input and output of the same feedback loop - a compound degradation that neither body of research set out to measure, but that the combination makes hard to ignore.

AI installs cognitive patterns in human users. Humans write AI-influenced text that feels native to them. That text enters AI training pipelines as clean human content. AI trains on increasingly AI-patterned human writing. The substrate degrades. Degraded AI output shapes the next wave of human writing.

This is why the third counterfactual - “human-generated training data remains a reliable reset mechanism” - fails. Cognitive horizontal transfer is precisely the mechanism by which human-generated data loses its external-injection function. Once humans write with AI-installed cognitive patterns, their writing can no longer serve as the external signal that resets the closed loop. The source of clean signal is itself degrading.

The Cognitive Holobiont framework named this as an ecological relationship. This is what ecological collapse looks like when it arrives: not a single mechanism, but two mechanisms accelerating each other, closing the loop, narrowing the window for external injection.

For AI development, this means the “clean human data” assumption underpinning training data strategy is eroding in real time. For knowledge-base architects, it means that even unmediated observations team members inject may carry more AI patterning than expected - because their thinking has been shaped by the same tools that processed the documents they are now trying to counterbalance.

The architecture must account for this. External injection needs to come from genuinely outside the cognitive loop, not just labelled “human-generated.”

Generational Accounting

Every organisation running AI-assisted workflows should know one number: what generation is your knowledge base in?

Generation 0: Primary sources - direct observation, unmediated experience, raw data, primary research. This is what your organisation actually knows about its domain. The original signal.

Generation 1: AI-assisted summaries of primary sources. Signal loss begins here. The essentials are retained, but the texture of the original experience - the anomalies, the edge cases, the things that surprised people, the customer reactions that did not fit the pattern - begins to drop out.

Generation 2: AI-assisted summaries of generation-1 summaries. Signal loss compounds. What remains is the most-repeated, most-compressible version of the original observations.

Generation 3 and beyond: Institutional knowledge that cannot be traced to its original signal. What the organisation believes it knows about its market, its customers, its competitive position - derived from a chain of copies whose source is no longer legible.

Most teams running AI-assisted workflows for twelve months or more are at generation 3 or 4 without knowing it.

The generational number is not a technology specification. It is an organisational health metric.

High generational depth in a low-stakes operational domain is a minor efficiency cost. High generational depth in the domains where decisions compound - market analysis, competitive intelligence, product strategy, customer understanding - is a structural liability.

If the leadership team’s understanding of the customer derives primarily from AI-processed documentation of previous AI-processed documentation, strategic decisions are not grounded in customers. They are grounded in generation-4 copies of what someone once observed about customers. The strategic imagination narrows toward familiar patterns - not because the team lacks capability, but because the substrate only supports what has already been compressed into it.

The collapse does not announce itself as an obvious failure. It arrives as a slowly narrowing range of what seems possible.

The Design Principle

Shannon proved in 1948 that degradation is inevitable in any closed system. The architectural question is not whether the degradation will occur. It is whether your system is closed.

A system is closed when no information enters from genuinely outside the copy chain. It is open when external signal is structurally injected at regular intervals.

External injection must be designed in, not hoped for.

In practice, this means:

A structural cadence of primary source inputs - not when people remember, but as a requirement built into the information architecture. Customer conversations that feed into the knowledge base before any layer of AI processing interprets them. Market observations from direct exposure, not from secondary summaries. Research from primary sources, with access to original data and methodology, not from executive digests.

A generational audit of the highest-stakes knowledge domains. Can this piece of institutional knowledge be traced to a generation-0 source? If the answer is “we are not sure,” the organisation is operating from an unknown generational depth. In a domain where decisions compound over time, that is a number worth knowing.

A verification practice that treats generational depth as an operational signal, not a philosophical concern. The generational number matters not as an abstract measure of information theory compliance but because generation-4 strategic knowledge in a competitive market accumulates as a liability silently.

The countermeasure is not to use AI less. It is to design the information architecture so that the closed loop cannot complete without external signal injection. What is your system’s reset mechanism? If you cannot name it, Muller’s Ratchet is running.

A team operating at generation 4 in its competitive intelligence domain does not know it is missing market signals. It experiences its strategic planning as well-informed. The narrowing does not feel like blindness - it feels like clarity. The frameworks that survived four rounds of compression feel robust precisely because they are the ones compression preserved. What compression removed is, by definition, no longer visible. The cost is not a single catastrophic decision. It is a systematic shift in what options the team is capable of conceiving.

Five Conclusions

Shannon’s Data Processing Inequality governs all copying systems. No transformation of a signal can create information. Every copy of a copy loses signal irreversibly. This applies to biological, computational, and organisational substrates identically. It is mathematics, not observation, and it does not care about the quality of the copying mechanism.
The capability frame addresses the wrong variable. Better AI tools applied to AI-generated content remain a closed system. The substrate is the binding constraint, not the processor. More caloric fluid does not explain entropy.
Generational depth is a measurable organisational health metric. Most teams running AI-assisted workflows for twelve months or more are at generation 3-4 in their knowledge base without knowing it. The number worth tracking is how many copy cycles separate current institutional knowledge from its original signal sources.
The countermeasure is architectural, not procedural. External injection of primary-source signal must be designed into the information system as a structural requirement. A habit is not a reset mechanism. An architectural requirement is.
The window for clean external injection is narrowing. Cognitive horizontal transfer - documented in Part 6 of this series - is the mechanism by which human-generated data loses its external-injection function. The two degradation mechanisms accelerate each other. The architecture needs to be built while it can still be built.

The Sovereignty Index maps your current generational depth across five knowledge domains critical to decision quality - and identifies where external injection architecture is missing from your information system.

Run the Sovereignty Index - si.sovereigncaptain.com

This piece is part of the Cognitive Holobiont Series. The previous piece - Cognitive Horizontal Transfer - documents the mechanism by which AI patterns install into human cognition, narrowing the window for external injection described here.

Cognitive SovereigntyAI StrategyKnowledge ManagementInformation TheoryCategory Creation