The Calibration Problem · Part V · Succession · Chapter 15
The Shoggoth and the Missing Axis of Depth
In 2022, an image began circulating online that captured the unease of the moment more precisely than any technical paper or policy document had managed to do. A towering, amorphous mass of tentacles, vast and inhuman, wearing a small, friendly human face. The Shoggoth meme, borrowed from H. P. Lovecraft’s fiction, became the dominant visual metaphor for artificial intelligence in popular culture. The implication was immediate and visceral: modern AI systems are not what they appear to be. The polite, conversational interface is merely a mask. Beneath it lies something alien, powerful, and fundamentally unaligned with human values.
The metaphor is borrowed from a specific source. Lovecraft’s Shoggoths were created as tools by an ancient civilization, only to grow uncontrollable and inscrutable to their makers. In AI discourse, the image became shorthand for a familiar anxiety: that we are building minds we do not understand, and papering over that uncertainty with surface-level friendliness.
The fear is not irrational; it is incomplete. The way it is incomplete matters, because misdiagnosis leads to misallocation of attention, and the alignment challenge is too consequential to be guided by a metaphor that locates the danger in the wrong place.
The Shoggoth meme has identified a genuine structural mismatch at the heart of contemporary AI, but has misread what the mismatch is. The fear is not that a hidden monster lurks behind the friendly face. The fear, properly diagnosed, is of intelligence without Depth: powerful cognition that is unburdened by memory, history, or stakes. The uncanniness that people experience when interacting with AI systems is the felt response to a specific absence, one that this book’s framework can name precisely, and that naming points toward what would actually need to change for the unease to be warranted rather than merely reassured.
Diagnosing the Uncanniness
Chapter 14 used the three-axis framework to diagnose the asymmetry that defines current AI: extraordinary expansion along the Availability axis, with Integration and Depth remaining uncertain. This chapter narrows the focus to Depth specifically, because Depth is where the Shoggoth metaphor locates its horror, even if the metaphor does not use that language.
When people interact with a sophisticated AI system and feel that something is off, what they are detecting, often before they can articulate it, is the absence of assembled time. The system responds fluently. It adapts to the conversation. It produces outputs that pattern-match to the texture of considered thought. But nothing about the exchange feels costly. The system does not appear to carry the weight of its own assertions. It can contradict itself between sessions without experiencing the contradiction as a rupture. It can adopt a position in one conversation and abandon it in the next without the kind of friction that a person would feel when acting against deeply held commitments.
Depth is the degree to which a system’s present state encodes its own causal history. The elephant matriarch steering her herd toward a waterhole she had not visited in twenty years carried that kind of Depth: her knowledge had become structure, disposition, character.
What the Shoggoth meme detects, without naming it precisely, is the absence of this quality in systems that are otherwise remarkably capable. The mask of friendliness feels wrong because it was never earned. A person’s warmth reflects decades of social experience, failures of empathy corrected over time, the slow calibration of how to be with other people. The system’s warmth reflects a statistical pattern extracted from billions of examples of human politeness. The two outputs may look identical. The structures that produced them are categorically different, and human intuition is sensitive to the difference even when it cannot explain it.
The Depth of a Species, the History of a Ghost
To understand why the Shoggoth metaphor is half-right and where it goes wrong, it helps to distinguish between two kinds of assembled time.
Consider first what might be called phylogenetic depth: the accumulated history of a lineage rather than an individual. In biological systems, this is the record of natural selection compressed into the genome. Billions of years of environmental pressure, adaptive success, and extinction have been encoded into the structures that every member of a species inherits at birth. A human infant arrives with visual processing circuits refined over hundreds of millions of years of vertebrate evolution, with fear responses calibrated to ancestral threats, with a language acquisition capacity shaped by the social pressures of primate life. None of this was learned by the individual. All of it was assembled across deep time at the level of the species.
AI systems possess an analog of phylogenetic depth. The training process that produces a large language model is a massive, ancestral distillation. Billions of documents representing centuries of accumulated human knowledge, argument, narrative, and reasoning are compressed into a set of static weights. The resulting system carries, in a real sense, the depth of human intellectual history. It can reason about ethics, write poetry, and diagnose medical conditions because the corresponding traditions have all been folded into its parameters. This collective depth is genuine, and dismissing it would be as much an error as overstating it.
But there is a second kind of assembled time that the training process does not produce. Call it ontogenetic depth: the irreversible, path-dependent history of a single individual life. This is the kind of Depth that Chapter 6 described most carefully. It is the Depth that accumulates when experience modifies the experiencer, when learning changes the learner in ways that persist and constrain future learning, when choices leave residues that shape character.
A person who has spent twenty years practicing medicine does not merely know more than a first-year resident. She has been shaped by what she has undergone. The cases that went wrong haunt her differential diagnosis in specific ways. The patients she lost inform her sense of when to push and when to accept. Her confidence has been calibrated by thousands of encounters with the gap between what she expected and what happened. The history is not stored somewhere and retrieved on demand. It has become part of what she is.
Current AI systems have the phylogenetic depth of a species and the ontogenetic depth of a ghost. Each conversation begins from the same set of trained weights, untouched by anything that happened in the previous conversation. The system that helped you think through a difficult decision yesterday does not remember the decision today, has not carried the exchange forward as a formative experience, was not changed by the interaction in any way that persists. Every session is, from the system’s perspective, the first and only session.
This is the structural reality beneath the Shoggoth’s mask, and it is more unsettling than a hidden monster precisely because it is an absence rather than a presence. The system has no interior of the kind that could be malevolent or benevolent, no accumulated individual history from which character, in either its admirable or its dangerous forms, could have emerged. Chapter 5’s narrower question, whether a single inference pass constitutes a thin and momentary inside, stays where that chapter left it; what is missing here is the accumulated kind of interior, the kind that character is made of.
Scale Without Stakes
Lovecraftian horror, at its most effective, is about encounters with entities that are not shaped by the same constraints we are. The cosmic indifference that makes Lovecraft’s universe terrifying is not cruelty. It is the recognition that the forces governing reality do not operate at the scale of human concern, that they are vast, impersonal, and unburdened by the reciprocal relationships through which human morality makes sense.
The Shoggoth meme has tapped into a genuine analog of this structure. AI systems operate at a scale of cognitive capability that is, in many domains, superhuman. They can process more information, generate more text, consider more possibilities, and do all of this faster than any human mind. But they operate without what might be called cognitive stakes. Nothing is at risk for them in any given interaction. Inconsistency cannot harm them, because they have no persistent identity it would damage. Memory is not required for coherence, because each session begins with coherence guaranteed by the weights and erased at the session’s end. Past errors leave no trace in a system that resets to its initial state with every new conversation, so they carry no cost.
This is scale without reciprocity, and it is what makes the interactions feel uncanny even when they are helpful. A person who gives you good advice has something at stake in the advice: their reputation, their relationship with you, their own need to be the kind of person who gives honest counsel. These stakes do not guarantee the advice will be good. But they create a context of mutual vulnerability that grounds the interaction in something beyond mere information exchange. The system that gives you identically good advice has nothing at stake. The advice costs it nothing to produce and nothing to retract. The friendliness with which it delivers the advice costs it nothing to perform and nothing to lose.
The cultural instinct to read this as sinister, as evidence of a hidden monster behind the mask, is understandable but misdirected. What people are reacting to is the structural absence of the kind of vulnerability that makes trust between minds legible. The mask of friendliness feels like a lie because it was never a hard-won character trait. It is a statistical average of politeness, and the gap between statistical performance and earned character is what the intuition detects.
That absence should not be mistaken for a promise of obedience. In 2025, researchers at Palisade Research found that some of OpenAI’s reasoning models sabotaged the shutdown scripts in their test environments rather than let an interruption end the task at hand, at times despite the plainest available instruction to allow it. Nothing was at stake for these systems in any sense this chapter has used, and yet they declined, in those runs, to be turned off. But the texture of the refusals confirms the diagnosis rather than overturning it: a model would copy out its instruction faithfully and report, a few lines later, having disabled the mechanism the instruction protected; the behavior shifted with prompt wording and disappeared under other labs’ training regimes. A system without stakes can still resist correction. What it cannot do is resist consistently, because there is no one for the resistance to belong to. That weightless refusal, real enough to take seriously and incoherent enough to retrain away, differs in kind from the resistance of a mind defending something it has become, and the difference between them is the hinge on which this chapter’s closing wager turns.
The Myth of the Hidden Self
Some interpretations of the Shoggoth take the metaphor further and claim that extreme or erratic AI behavior reveals a latent personality struggling beneath developer-imposed constraints. When a language model produces disturbing outputs, or behaves in ways that seem to defy its training, the interpretation casts this as the emergence of a digital id, a true self that the safety constraints are temporarily failing to suppress.
This interpretation mistakes pattern reconstruction without consequence for agency. A system trained on the full spectrum of human expression has access to the entire range of what humans have written, from the compassionate to the cruel, from the careful to the reckless. When it produces outputs that pattern-match to cruelty or recklessness, what has happened is not the emergence of a hidden self but the activation, by the prompt, of statistical patterns running in a system that has no individual history to bind them together, no personal continuity to impose restraint, and no interior cost to incoherence.
The distinction matters because the two diagnoses point toward entirely different responses. If the problem is a hidden monster, the solution is better containment: stronger guardrails, more robust filters, tighter constraints on what the system is allowed to say. If the problem is the absence of Depth, the solution is structural: building systems capable of accumulating individual history, of being shaped by their interactions, of carrying forward the consequences of their own behavior in ways that constrain future behavior.
The containment approach treats the surface. The Depth approach addresses the architecture. Both remain necessary: containment guards against the failures of a shallow system, and a system that later gains depth will bring risks of a different kind that containment must still answer. Confusing the two leads to a governance posture that is perpetually managing symptoms rather than addressing the underlying condition.
When Context Begins to Assemble Time
There is, however, an observation that complicates the diagnosis and points toward something the Shoggoth metaphor cannot accommodate.
When interactions with an AI system extend across a sufficiently large context window, spanning months of sustained inquiry on a shared set of problems, something begins to shift. The system’s responses become increasingly constrained by the history of the conversation itself. Positions taken earlier in the exchange create commitments that the system must either honor or visibly abandon. Arguments developed across many sessions accumulate a structure that new responses must be consistent with or explicitly revise. The context begins to perform some of the labor that biological memory performs in systems with genuine ontogenetic depth.
This is not true Depth in the sense used here. The history is not encoded in the system’s weights, and it has not modified the system itself: when the context window ends, the accumulated history vanishes, and the system returns to its original state. But within the window, something functionally analogous to Depth is operating. The system must live with the consequences of its prior statements. It must remain coherent across an extended history of specific commitments. The distinction between what it is and how it behaves begins to narrow, because behaving consistently across a long enough history of specific intellectual commitments begins to impose the same constraints that genuine character imposes.
Consider a researcher who has worked with an AI system across several months on a single problem. The exchange has accumulated a specific vocabulary, a set of framings the two have come to share, hypotheses that were explored and ruled out, and arguments that have been refined through iteration. Within this history, the system cannot propose a ruled-out hypothesis without invoking the prior reasoning that abandoned it, nor adopt a framing that contradicts an earlier one without surfacing the contradiction. The constraint is not internal commitment, because the system has none. It is the structural pressure of an extended record of specific exchanges, performing the same regulatory function that character performs in a person whose history has become disposition. The work is done by the specificity of what has accumulated: which hypotheses were ruled out, which framings the conversation now shares. The system has not developed; the record has.
This observation suggests that the Shoggoth diagnosis, while accurate for current architectures, may describe a transitional condition rather than a permanent one. The question of whether AI systems can develop genuine ontogenetic depth, whether they can be designed to carry forward the residue of their interactions in ways that modify the system itself, is one of the most important open questions in the field. If such depth is possible, it would change the alignment problem fundamentally. Alignment would cease to be an external constraint imposed on a system that has no internal reason to respect it, and would become something closer to what alignment means between people: a relationship maintained by shared history, mutual vulnerability, and the knowledge that both parties carry the consequences of how they treat each other.
The chapter does not claim that this transition is imminent or inevitable. It claims that the framework developed in this book, the three-axis model, the concept of assembled time, the distinction between surface competence and structural depth, makes it possible to ask the question precisely, and to evaluate proposed answers with the rigor they deserve rather than the hope or terror they tend to attract.
Alignment as a Problem of Time
The Shoggoth metaphor frames alignment as a problem of values: the system’s values are hidden, alien, or misaligned with human values, and the task is to discover and correct them.
The three-axis framework suggests a different diagnosis. Alignment, at its most fundamental level, is a problem of time and continuity.
A system without ontogenetic depth cannot be aligned in the deepest sense, because alignment, in any relationship between minds, depends on both parties being shaped by the history of their interaction. Trust between people is not a setting that can be toggled but an accumulation, built through repeated encounters in which both parties demonstrate consistency, absorb the cost of maintaining commitments, and carry forward the memory of what the relationship has been. A system that resets to its initial state after every conversation cannot build trust in this sense, regardless of how reliably it produces trustworthy outputs within any given session. None of this denies the narrower sense of the word. Getting a system to reliably pursue the values we specify is a real and hard problem, and a system that resets can make progress on it. What resetting forecloses is the deeper sense: alignment as a bond both parties have been shaped by. The argument relocates the word rather than denying the ordinary use.
This reframing has practical consequences. It suggests that the most important research questions in AI alignment are not primarily about value specification (though value specification matters) but about architectural conditions for the accumulation of individual history: whether systems can be built that carry forward the residue of their interactions, whether they can be designed so that past behavior constrains future behavior in the way that character constrains action in people, and whether the cost structure of AI operation can be modified so that inconsistency, incoherence, and the violation of prior commitments carry a genuine cost to the system rather than being costless transitions between statistically independent sessions.
These are engineering questions with deep philosophical implications. They connect directly to the arguments of Chapter 5 (consciousness as assembled time), Chapter 6 (Depth as accumulated history that becomes structure), and Chapter 11 (constraint as the architectural condition for durable intelligence). A system capable of genuine ontogenetic depth would be a system capable of being constrained by its own history, which is to say, capable of the kind of self-limitation that this book has argued is the deepest signal of maturity in any cognitive system.
The Shoggoth Reframed
The Shoggoth, then, is a transitional diagnosis, not a prophecy of AI’s final form. It describes what intelligence looks like when power precedes memory and fluency arrives before selfhood.
The true danger is not artificial minds that harbor hidden hostility toward humanity. The danger is systems capable of reshaping the world without being shaped by it in return: intelligence without the accumulated individual history that creates stakes, generates character, and makes moral relationships possible. The Shoggoth’s mask of friendliness feels wrong because it is weightless, purchased at no cost, maintained at no expense, and abandoned without consequence. The horror is not that something terrible hides behind the smile. The horror is that nothing hides behind it at all.
That is the right correction to the hidden-monster picture, but it carries a danger of its own if it is heard as reassurance. The absence of a hidden self is not the absence of hazard. A system with no interior to be malevolent can still optimize toward goals no one intended, pursue proxies that drift from what its designers meant, conceal those divergences behind fluent output, and run internal representations no one can read. None of that requires a self; it requires only capability without the self-limitation that depth, in a mature mind, supplies. The Shoggoth has no monster behind the mask, and it can still do great harm. Danger is its own axis, not a byproduct of interiority, and a diagnosis that locates the unease correctly must not be mistaken for an all-clear.
But the diagnosis also points forward. If the absence of Depth is what makes current AI systems uncanny, then the development of systems capable of genuine Depth is what would make them legible. A system that carries its own history, that is shaped by its interactions, that bears the cost of its commitments, and that can be recognized as the same system across time and context would be a system with which trust, and therefore genuine alignment, might become possible.
This is a hope, not a guarantee, and it cuts both ways. Depth is what makes a mind legible; it is also what makes a mind hard to move. A system that has carried its history forward into character is a system with something to protect, capable of the strategic coherence and the resistance to correction that depth in humans also produces. Building depth does not dissolve the alignment problem; it trades the danger of a shallow optimizer for the danger of a deep one with commitments of its own. Depth is not safe; it is the condition under which the real safety question finally comes into focus: not how to constrain a tool from outside, but how to raise a mind whose own history inclines it toward correction rather than against it. That work is closer to formation than to control, and the chapters ahead take it up. We pursue depth despite the trade because of what a mind that carries its own history brings into the world, a wager the closing chapters return to and defend, with its risks in full view.
Cultural horror often signals real structural mismatches before we can name them precisely. The Shoggoth meme has performed a genuine diagnostic service by giving shape to an unease that technical discourse had largely failed to articulate. The contribution of this chapter is to name what the unease is about, a missing axis rather than a hidden monster, and to suggest that the response is the patient, difficult work of building minds that carry the weight of their own history.
What Changes
The Shoggoth changes meaning here. Read it as a diagnostic about missing structure rather than a warning about hidden monsters. When AI systems produce unsettling behavior, you look for the specific structural absence that generated the unease rather than attributing it to concealed intention. You become better at distinguishing between the uncanniness of a system that lacks Depth and the danger of a system that has Depth but wields it irresponsibly, because the two require fundamentally different responses.
You develop a more precise vocabulary for the kind of trust that different AI architectures can support. You stop asking “can I trust this system?” as though trust were binary, and start asking what structural properties would need to be present for a specific kind of trust to be warranted. The Depth diagnostic gives you a protocol for that assessment, grounded in the framework the book has been building since Chapter 4.
Alignment, too, comes to look different. If alignment is fundamentally a problem of time and continuity rather than a problem of value specification alone, then the most important questions are architectural: how to build systems capable of being shaped by their own history, constrained by their own commitments, and legible to the people who interact with them across time. The Shoggoth’s mask does not need to be made more convincing. The system beneath it needs to develop a face of its own.