Defense, Self, and Speech — Paper I: The Biological Substrate
Four defensive vectors, the narrative self-model, and the architecture of shame. How fight, flight, fawn, and freeze are preserved in speech — and why compensatory-defensive speech is computable.
Series: Defense, Self, and Speech: From Biological Vectors to Linguistic Form — Paper I of VI
Acknowledgment: Structural development conducted in extended dialogue with Claude (Anthropic); see final Acknowledgments section.
1. Abstract
Human speech is routinely treated as a symbolic, semantic, and pragmatic phenomenon — a domain of meaning, intention, and social coordination. This paper argues that a specific class of speech patterns — which we term compensatory-defensive speech — has biological roots in phylogenetically ancient protective systems and is, in principle, partially computable. Crucially, we distinguish this class from ecological defensive speech, in which the same underlying vectors produce proportionate, reversible, contact-preserving responses to real threats.
Drawing on polyvagal theory (Porges, 2011), affective neuroscience (Panksepp, 1998, 2012), active inference and predictive coding (Friston, 2010; Clark, 2013), and attachment theory (Bowlby, 1969; Fonagy et al., 2002), we propose that the four canonical defensive responses — fight, flight, fawn, freeze — do not disappear when organisms acquire language. They are reorganized into speech forms that preserve the original directional logic (approach-with-force, retreat, submit, immobilize) while gaining semantic, syntactic, and social complexity. The same underlying vector can express itself ecologically (assertive boundary-holding, withdrawal from a toxic environment, contextually-appropriate deference, brief pause for appraisal) or compensatorily (chronic accusation without present threat, intellectualization in place of contact, self-erasing appeasement without real hierarchy, epistemic absence as a way of life).
We show how fight becomes projection and accusatory discourse ("you always do this," "every single time, it's the same thing with you"); how flight becomes intellectualization and topic-shift ("it's interesting actually — there's a whole literature on how couples negotiate conflict..."); how fawn becomes ingratiating self-erasure ("oh no, really, it's completely fine, I didn't even mind, please don't worry about it at all"); how freeze becomes vagueness, silence, and epistemic absence ("I don't know... things are just kind of... I'm not sure what to say, honestly") — and we articulate the criteria (proportionality, reversibility, contact-preservation) that separate ecological from compensatory expression. We integrate the concept of allostatic load (McEwen, 1998) as the metabolic signature of sustained compensatory speech, and we treat shame as the automatic meta-evaluation of one's own internal model against predicted social reality — a predictive-coding account of a phenomenon traditionally confined to affect theory.
This framing does not reduce language to biology. It proposes that language is an additional layer over defensive architecture, and that the compensatory subclass of defensive speech becomes interpretable — and potentially computable — only when the underlying biological vectors and the ecological/compensatory distinction are made visible. Subsequent papers in this series develop the computational, ontological, and empirical consequences of this view.
Keywords: defensive mechanisms, polyvagal theory, predictive coding, allostatic load, shame, speech analysis, affective neuroscience
2. Introduction
2.1 The problem: speech without a body
Contemporary linguistics, philosophy of language, and most of computational NLP treat speech as if it were produced by a disembodied processor of symbols. Pragmatics adds speaker intention; sociolinguistics adds group context; cognitive linguistics adds conceptual metaphor. But the body — the organism that is metabolically sustaining the act of speaking, and whose survival systems are never off — tends to drop out of the picture.
2.2 What we propose
We propose to put the body back — specifically, the defensive architecture of the body — at the centre of an account of a broad class of speech acts. We are not claiming that all speech is defensive. We are claiming that (a) a substantial fraction of everyday speech is structurally shaped by underlying defensive systems, (b) these systems preserve their vectorial logic across the transition from behavior to speech, and (c) this preservation is measurable.
2.3 Contribution and roadmap of the paper
The contribution of Paper I is theoretical and integrative. We do not present new empirical data in this paper; the empirical instantiation of the framework appears in Papers IV–VI. Here we synthesize four traditions (polyvagal, affective neuroscience, predictive coding, attachment) into a coherent account of how defensive vectors become speech, and we name the key mechanisms — projection, intellectualization, appeasement, freeze — as speech-level expressions of fight, flight, fawn, freeze. We close with the conceptual role of shame as a meta-defensive mechanism and with an outline of how allostatic load operates as an invisible tax on defensive speech.
A methodological note on the series as a whole. Each paper in this series includes explicit delineation of what is and is not claimed, and — where implementations are presented (Papers III, V, VI) — honest analysis of where those implementations fall short of their intended scope is treated as part of the contribution rather than as an embarrassment to be minimised. In Paper I this commitment appears as the scope restriction of §7.3. In Paper III it is instantiated as a full limitations section calibrated against a code-verified internal audit of the engine. In Paper V it will take the form of detailed discussion of failure modes against gold-standard annotation. Readers oriented toward honest failure reporting as a methodological standard will recognise this consistent treatment across the series.
3. The four defensive responses: biology and vectors
Before we can trace how defensive mechanisms become speech, we need to specify what those mechanisms are at the biological level — their neural substrates, their functional logic, and the directional vector each one carries. This section reviews the four canonical defensive responses that have been independently identified across comparative ethology, autonomic physiology, affective neuroscience, and trauma research. Our treatment is not original at this level; we synthesize converging evidence from several traditions. What is original is the reading of each response as carrying a directional vector — approach-with-force, retreat, submit, immobilize — that we will show is preserved in the speech-level transformations examined in §4.
In §3.1 through §3.4 we review the four canonical defensive responses. In §3.5 we introduce a fifth vector — Panksepp's (1998) SEEKING system — that is not itself defensive but is phylogenetically co-primary with the four and is indispensable for a complete account of the orientation space that the later papers of this series classify. §3.6 then develops the distinction between ecological and compensatory expression that separates what subsequent papers treat as the target phenomenon from what they treat as the healthy form of the same vector.
3.1 Fight — the vector of approach-with-force
The fight response is characterized neurobiologically by sympathetic nervous system activation and, in Panksepp's (1998) mapping of primary affective systems, by engagement of the RAGE system — a subcortical motivational network centered on the medial amygdala, bed nucleus of the stria terminalis, and periaqueductal gray. Its evolutionary function is the active elimination or domination of a proximate threat: attack the predator, drive out the intruder, establish dominance in the hierarchy. Ethologically, it expresses through approach behavior, increased muscular tone, vocalization directed at the threat, and preparation for contact.
What defines fight as a vector — rather than simply as an activation level — is its directional character. Force is mobilized and directed outward, toward the identified source of threat.
This outward-directedness is preserved across species and across contexts: the lion's charge, the chimpanzee's display, the human's shove, the angry word hurled across a kitchen. The vector is not aggression in the moral sense; it is the motoric and affective pattern of force-toward-other.
LeDoux (1996, 2015) has clarified the circuitry through which threat detection in the amygdala rapidly recruits motor outputs before cortical appraisal can intervene, explaining why fight responses often appear as if without decision — because they precede decision.
Panksepp's RAGE system framing adds that the subjective affective quality of the response (what the organism feels from within when fighting) is itself a primary, evolutionarily ancient experience, not a secondary interpretation of physiological arousal.
3.2 Flight — the vector of retreat
The flight response engages the same sympathetic mobilization as fight, but directs that energy in the opposite vectorial direction: away from the source of threat. Panksepp locates its primary affective substrate in the FEAR system — a circuit running through the central amygdala, medial hypothalamus, and dorsolateral periaqueductal gray — which motivates rapid withdrawal, hiding, and avoidance. Evolutionarily, flight is conservative and often more successful than fight: an organism that successfully escapes need not survive combat.
The vectorial character of flight is retreat — the placement of distance between self and threat. In animals this is almost always spatial: run, fly, hide, burrow. We will argue in §4.2 that in humans this same vector can be preserved while the mode of retreat becomes semantic rather than spatial — distance achieved through topic shift, abstraction, intellectualization, or dissociation — but it remains the same underlying motive: place the self where the threat cannot reach.
LeDoux's work is again clarifying: the flight response is organized by the same rapid subcortical-to-motor circuitry as fight, which explains why it, too, is often pre-reflective. The organism has turned and begun retreat before the cortex has fully processed what it is retreating from.
3.3 Fawn — the vector of submit-to-appease
Fight and flight are the two responses most often named in introductory treatments of defensive biology, but they are not exhaustive. A third response — fawn — has been identified in comparative ethology for decades, and brought into trauma-clinical vocabulary more recently through the work of Walker (2013) on complex PTSD. Where fight mobilizes force outward and flight mobilizes retreat, fawn mobilizes affiliative display directed toward the threat source itself — not to fight or flee, but to eliminate the threat by eliminating one's own contour as a target.
The animal-world expressions of fawn are behaviorally specific and phylogenetically widespread. A dog rolls onto its back, exposing the vulnerable belly — a display that is anatomically the opposite of a threat posture, and that communicates non-threat to the dominant animal. A subordinate wolf presses low to the ground before a higher-ranking pack member, averts its gaze, tucks its tail, and may attempt social grooming. Primates engage in grooming behaviors directed upward in the hierarchy: an adolescent chimpanzee who has angered an adult male may approach, present calmly, and groom the offended individual until tension subsides. Infant mammals display "presenting" behaviors toward caregivers that signal both dependency and non-threat simultaneously.
What these diverse expressions share is the underlying vector: submit-to-appease. The organism reduces its own apparent threat-salience to near zero, while simultaneously offering behaviors that strengthen the affiliative bond with the potentially threatening other.
The function is subtle: preserve proximity — which may be necessary for survival, especially in hierarchical social species — while removing the self from the position of target. Walker (2013) has argued that this response, long recognized ethologically, has been under-theorized in trauma-clinical literature, where fight, flight, and freeze were taken as the canonical three. He proposes that fawn is, in fact, one of the most common chronic defensive responses in human complex trauma — especially where the threat came from an attachment figure whom the child could neither fight nor flee, and had to remain close to for physical survival. The child whose caregiver is unpredictably harmful cannot run and cannot win; fawn becomes the only viable strategy. Over developmental time, fawn becomes the organism's default response to any interpersonal pressure, even where such pressure is ordinary and non-threatening.
3.4 Freeze — the vector of immobilization
The fourth canonical defensive response is freeze — but we must immediately note that "freeze" in the clinical and research literature refers to two distinct physiological states, often conflated. Porges (2011) has been particularly important in clarifying this distinction through polyvagal theory.
The first kind of freeze is attentive immobility — the prey animal who has detected a predator but not been detected in return, and who holds perfectly still while scanning. This is sympathetic-dominant, high-arousal, cognitively alert; the organism is poised for action but has chosen stillness as the currently optimal strategy. We will not treat this form as a compensatory defensive vector; it is closer to an adaptive computation within fight-or-flight.
The second kind of freeze — and the one relevant to defensive architecture — is tonic immobility or dorsal vagal collapse. This is parasympathetically dominant, characterized by sudden withdrawal of sympathetic arousal, bradycardia, reduced muscle tone, and, at the extreme, dissociation and loss of voluntary movement. Porges locates this response in the dorsal motor nucleus of the vagus nerve — phylogenetically the most ancient branch of the parasympathetic system, present even in reptiles and fish. Its evolutionary function is metabolic shutdown in conditions where fight, flight, and fawn have all been ruled out or have failed: the organism becomes unperceivable by becoming, effectively, no longer functionally present.
Schauer and Elbert (2010) have documented the phenomenology of tonic immobility in human trauma survivors: reduced affect, disconnection from bodily sensation, time distortion, and — crucially for our argument — a breakdown of narrative coherence and first-person agency. The subject who is in tonic immobility does not report experiences as "happening to me"; they report them as "happening" or as uncertain whether they happened at all.
The vectorial character of freeze is immobilization as disappearance. Where fight approaches, flight retreats, and fawn aligns, freeze ceases to constitute a target at all by ceasing — at the level accessible to the threat — to be a coherent, present self. In §4.4 we will argue that the speech-level expression of this vector preserves exactly this quality: not saying the wrong thing, not saying the right thing, but refusing to produce the kind of speech-form that could be evaluated at all.
3.5 SEEK — the vector of orientation and inquiry
The four responses reviewed in §§3.1–3.4 exhaust the defensive repertoire — every defensive behavioural expression is a specialisation of fight, flight, fawn, or freeze. But human organisms are not only defensive. They also orient toward the world actively: they sample the environment, pursue resources, anticipate reward, and generate the hypotheses under which experience becomes comprehensible. Any account of the biological substrate of speech that stops at the four defensive vectors therefore leaves out the positively-valenced orientational system that, in Panksepp's (1998) mapping of primary affective systems, is phylogenetically co-primary with FEAR and RAGE. For this reason we introduce a fifth vector: SEEKING.
Panksepp's SEEKING system is an appetitive-motivational network centred on the mesolimbic dopaminergic pathway — the ventral tegmental area, the nucleus accumbens, the lateral hypothalamus, and their cortical projections. Its evolutionary function is the active pursuit of resources: foraging, exploration, anticipation of reward, and the sustained forward-engagement of the organism with an environment that has not yet yielded what is needed. Where RAGE mobilises force toward a present threat and FEAR mobilises retreat from one, SEEKING mobilises approach toward what might resolve a present appetite. Panksepp (1998) calls it the "granddaddy" of the affective systems — the one that, when suppressed, produces the anhedonia and behavioural shutdown characteristic of depression and dorsal-vagal collapse, and, when chronically dysregulated upward, produces the incentive sensitisation characteristic of addiction (Alcaro, Huber & Panksepp, 2007; Alcaro & Panksepp, 2011). SEEKING is, in this sense, the engine of engagement itself.
Contemporary predictive-coding and active-inference accounts converge on the same functional object from a different direction. In Friston's active-inference framework (Friston, 2010; Friston et al., 2015), the organism is understood as continuously minimising free energy — reducing the discrepancy between its generative model of the world and the sensory evidence it receives. This minimisation has two routes. It can proceed by updating the model to fit the evidence (perceptual inference). Or it can proceed by acting on the world to change the evidence so that it fits the model (active inference). The second route decomposes further: the organism can act to secure preferred outcomes (pragmatic value), or it can act to reduce uncertainty about states of the world (epistemic value). This second kind of action — action-for-information — is the formal computational counterpart of Panksepp's SEEKING. The organism that explores a novel environment, that asks a question to resolve ambiguity, or that tests a hypothesis whose outcome matters to its model, is mobilising epistemic value in the active-inference sense and the SEEKING system in the affective-neuroscience sense. Schwartenbeck et al. (2015) and Pezzulo, Rigoli & Friston (2018) have made the dopaminergic substrate of this computation explicit; Gottlieb, Oudeyer, Lopes & Baranes (2013) have reviewed the broader information-seeking and curiosity literature within the same framework.
The directional character of SEEKING that matters for what follows is toward-from-appetite: the vector is outward-directed, like fight, but it is not oriented toward a threat; it is oriented toward what the organism does not yet have. Ethologically, it expresses through exploratory locomotion, investigative sampling, anticipatory approach, and — at the level of the affective signal — the characteristic quality of interest-without-fear. At the speech level, its ecological expression is the ordinary work of genuine inquiry: open questions, requests for clarification, the epistemic movement of "tell me more about that," "I had not thought of it that way," "can you give me an example." These are not defensive acts; they are the speech-level trace of an organism running its SEEKING system in ordinary cooperative exchange.
A compensatory expression of SEEKING does exist, but it is less well-described in the classical clinical lexicon of defense mechanisms, which organised itself around the four canonical defensive vectors rather than the five-vector orientation space. Compensatory SEEKING takes the form of inquiry-without-exit: ruminative hypothesis-spiralling that never consolidates into action; information-acquisition as a stand-in for emotional contact; the question-asking that blocks the moment in which the speaker would otherwise have to declare a position. These patterns share a surface with intellectualisation (which we place in §4.2 under flight, following its dominant clinical lineage), but their underlying vector logic is distinct: intellectualisation moves force away from contact; compensatory SEEKING substitutes apparent forward-engagement for the consolidation that genuine engagement would require. The engine described in Paper III separates these cases at the classification level; the present paper, following its methodological restriction to the canonical four defensive conversions, will not develop compensatory SEEKING at length. We note its existence here because the fifth vector must be present in the theoretical picture even when it is not the focus of §4.
Three consequences of introducing SEEKING matter for what follows.
First, the behavioural-orientation space that the organism's motor and speech repertoire populates is not four-dimensional but five-dimensional. Defence occupies four of those dimensions; appetite-orientation occupies the fifth. An account that conflates SEEKING with one of the defensive vectors (treating inquiry as a form of retreat, for instance, or curiosity as a form of approach-with-force) will systematically misclassify speech moves that belong to the non-defensive dimension.
Second, SEEKING is what makes recovery from defensive regimes computationally visible. A speaker who moves from chronic projection to open inquiry, or from epistemic absence to genuine questioning, is not simply "less defensive"; they are relocating their operational vector from the defensive subspace into the SEEKING dimension. This is the kind of trajectory Paper IV tracks quantitatively.
Third, SEEKING is what makes the ecological side of §3.6 psychologically realistic rather than merely abstract. When we say that fight can be expressed ecologically as boundary-holding, or freeze ecologically as genuine not-knowing, we are describing defensive vectors operating in contact with an active SEEKING system that keeps the speaker oriented toward the shared situation. When SEEKING is suppressed — as in chronic dorsal-vagal collapse, or depressive anhedonia, or extended compensatory-defensive regimes — the ecological expressions of the defensive vectors become unavailable, because ecological expression requires the organism to be engaged with what is actually happening. The four defensive vectors and the SEEKING vector are not independent axes; in the organism they are coupled, and the coupling is one of the reasons chronic compensatory-defensive speech is so difficult to reverse by local behavioural intervention alone.
Example. Friend: "I've been feeling weird about work lately."
Ecological (SEEKING as genuine inquiry): "What does 'weird' feel like — is it more restless, or more flat?"
Compensatory (SEEKING as inquiry-without-exit): "Hm, interesting — I wonder what the mechanism is. Is it systemic or more acute? Could be burnout, could be values mismatch, could be something structural at the company. Have you looked at any of the recent organisational-psychology literature on this?"
3.6 Ecological vs compensatory expression: the criterion that makes speech computable
The four defensive vectors described above are not, in themselves, pathological. Every living organism needs them; every speaker uses them; they are the substrate of functional social life as much as they are of dysfunctional patterns. A person who holds a boundary under pressure is deploying fight; a person who leaves a harmful relationship is deploying flight; a person who shows appropriate deference to legitimate hierarchy is deploying fawn; a person who pauses to assess an ambiguous situation is deploying freeze. In their ecological expression, these vectors are proportionate to context, reversible (the organism returns to baseline after the threat resolves), metabolically sustainable, and — crucially — they preserve the possibility of contact.
What interests us in this series is not the defensive vector itself, but a specific subclass of its expression that we term compensatory-defensive speech. A defensive speech pattern is compensatory when it is:
- Disproportionate or uncontextual — the defense is mobilized in the absence of a proportionate present threat, or escalates far beyond the situation's demands.
- Non-reversible within the exchange — the speaker does not return to baseline contact after the presumed threat passes; the defensive register becomes the dominant mode.
- Metabolically costly — sustaining the pattern produces observable allostatic load (see §6); the speaker is visibly depleted by their own defense.
- Reality-distorting — the defensive speech generates or sustains a distorted model of the social situation (for instance, attribution of hostile intent where none is present, or erasure of one's own perspective where it is warranted).
- Contact-destroying rather than contact-preserving — the function of the pattern is no longer temporary protection of contact but its chronic obstruction.
Each of the four vectors can be expressed ecologically or compensatorily, and the same speaker often moves between the two expressions across contexts. The distinction is not in the vector but in its calibration to the situation.
This distinction is the conceptual hinge of the entire series. Papers II and III formalize it into a computable criterion; Paper IV tracks its trajectories; Papers V and VI demonstrate its measurement on human and model speech respectively. Without this distinction, any computational approach to defensive speech collapses into a claim that all defensive expression is problematic — a claim that is both clinically false and empirically unsupportable.
A further note on operational form. The engine described in Paper III implements this ecological-vs-compensatory distinction not as a binary label but as a continuous scalar — a hold_capacity parameter in the interval [0, 1] that modulates every vector's expression: high capacity produces regulated, proportionate, contact-preserving expression; low capacity produces the compensatory form. This continuous formulation preserves the phenomenology of intermediate cases — the speaker who is mostly holding contact but slipping under pressure — that a strict binary split would flatten. Throughout the present paper we use the binary terms ecological and compensatory for expository clarity; the continuous formulation is the theoretically preferred one and is what the subsequent papers operationalise.
4. From defense to discourse: the four conversions
Having established the ecological/compensatory distinction (§3.6), we now trace each of the four defensive vectors into its speech-level manifestations. For each vector we describe (a) the preserved directional logic, (b) its ecological speech expression, (c) its compensatory speech expression, and (d) the clinical tradition in which the compensatory form has been named. The compensatory forms — projection, intellectualization, ingratiating self-erasure, epistemic absence — are the canonical clinical concepts, but we ask the reader to hold them alongside their ecological counterparts rather than as the whole of what the vector produces.
A scope restriction follows from the architecture introduced in §3.5. The SEEKING vector has no canonical compensatory form in the classical clinical lexicon of defense mechanisms; its compensatory expressions (ruminative spiralling, inquiry-as-avoidance) shade into territory that Paper IV treats at the level of trajectories rather than single-turn conversions. The present paper therefore develops four conversions, corresponding to the four canonical defensive vectors. The engine of Paper III classifies five — the additional SEEK category gathering the inquiry moves that are recognisably non-defensive but that co-occur with defensive regimes in natural discourse. This paper's four-conversion scheme is the defensive-subsystem subset of the full five-vector orientation space.
4.1 Fight → projection and accusatory discourse
The fight vector preserves its outward-directed force when it becomes speech, but the object shifts from body to attribution. Where the animal strikes or the small child pushes, the verbal adult allocates agency, fault, or hostile intention to the other. The force has not disappeared; it has found a new medium. The sentence becomes the blow.
In its ecological expression, the fight vector in speech is the direct assertion of boundary. "I am not okay with this." "Stop." "That is not acceptable." "I disagree, and here is why." These speech acts carry the full outward-directed vector of fight — they locate force in relation to a real pressure or transgression — but they remain proportionate to the present situation, they allow the interlocutor to respond, and they do not require distortion of reality to be made. The speaker holds the fight vector without being captured by it.
In its compensatory expression, the fight vector becomes projection and accusatory discourse. The clinical tradition of projective identification, articulated by Klein (1946) and elaborated by Bion (1962), describes the mechanism: aspects of one's own experience that are too threatening to acknowledge internally are attributed to the other, who is then experienced as containing those very qualities. In speech this produces characteristic forms — "You are the angry one." "You are trying to control me." "You always do this." The speaker's force is directed outward, but the target is not a present transgression; the target is a construct of the speaker's own predictive system, in which the other has been assigned the unbearable material.
Linguistic markers of compensatory fight in speech include: second-person aggression absent a proximate triggering event ("you" statements where "I" statements would be available); attributive modality, in which the other's inner states are declared rather than inquired about ("you don't care", "you think you're better"); externalized agency, in which the speaker's own contribution to the interaction is absent from the account; and escalation without resolution, in which each exchange increases force without moving toward closure, because closure is not the function — discharge of internal pressure is.
The clinical literature on projection is vast; beyond Klein and Bion we would note the contributions of Anna Freud (1936) on defense mechanisms generally, Kernberg (1975) on projective identification in borderline organization, and more recent work by Gabbard (2014) integrating object-relations and neurobiological perspectives. What this literature has described phenomenologically is, in our reading, the compensatory expression of a preserved biological vector: the force that once struck outward now speaks outward, and the distinction between the blow and the accusation is thinner than ordinary language suggests.
Worked example. Context: the speaker's partner has asked that they text back within an hour when running late from work.
Ecological (fight as direct assertion of boundary): "I hear that. Honestly it's hard for me to surface from work to text in the moment — can we try a single 'running late' ping instead of real-time replies?"
Compensatory (fight as projection): "You're always doing this — demanding things, acting like I'm ignoring you on purpose. The second something is hard for you, I become the problem. I can't be everything for you all the time."
Reading against the criteria of §3.6. A routine, bounded request has been met with (a) a pattern-level accusation ("always", "the second something") that escalates far beyond the proportions of the moment — the proportionality criterion is breached; (b) a refusal to return to baseline contact even if the partner backs down, because the register has moved from situation to verdict — the reversibility criterion is breached; (c) a reassignment of the partner from "person with a reasonable request" to "perpetrator of a pattern of demands" — the contact-preservation criterion is breached. Each of the three markers is observable at the level of the utterance itself, without access to the speaker's interior state.
4.2 Flight → intellectualization and topic shift
The flight vector preserves its retreat-directed logic when it enters speech, but the escape route becomes semantic rather than spatial. Where the animal runs, the verbal adult moves up the ladder of abstraction, shifts to a meta-level, or drifts to an adjacent topic.
Distance is achieved not in space but in register. In its ecological expression, the flight vector in speech is the conscious, named withdrawal from a dialogue that has become unproductive or harmful. "I need some time to think about this, let's come back to it tomorrow." "I don't want to continue this conversation right now." "This is outside what I am willing to discuss." These are exits, and they are vectorially flight, but they are exits that preserve relational information: they tell the other that a retreat is happening, why, and under what conditions contact might resume. The organism withdraws, but does not disappear.
In its compensatory expression, the flight vector becomes intellectualization and topic shift. The clinical tradition here runs through Anna Freud's (1936) foundational account of defense mechanisms, in which intellectualization is named as the use of abstract reasoning to avoid emotional contact with a threatening content. The adolescent who has just learned of a parent's illness and begins asking detailed questions about the pharmacology of the relevant treatments is performing intellectualization — the content is engaged, but the register has lifted above the felt level at which the content is threatening. In ordinary conversation, the same mechanism produces topic drift: a question about the speaker's own emotional state receives an answer about general trends in relationships; an inquiry into a specific painful event receives a reflection on the nature of memory.
What Anna Freud's clinical tradition named as a psychological defense mechanism, we read as the speech-level preservation of a biological vector: the organism is retreating, and language provides the medium in which retreat can be achieved without physical movement.
The compensatory form is recognized by its disproportion — retreat is initiated when no real threat warrants it, and it does not reverse when the threat passes.
Example. Context: a close friend says, "I felt hurt when you cancelled last week."
Ecological (flight as acknowledged pause): "You're right — I cancelled late, and I can hear how that landed. Can we talk about what would help now?"
Compensatory (flight as intellectualisation): "It's funny, actually — there's this whole thing in attachment research about how last-minute cancellations interact with anxious-preoccupied styles. I've been reading about cognitive load lately and I wonder if that's part of why these small things feel so heavy, structurally speaking..."
4.3 Fawn → ingratiating self-erasure
The fawn vector preserves its submit-to-appease logic when it becomes speech, and the mechanism of preservation is particularly visible here: just as the animal flattens its body to the ground and grooms the dominant, the speaker flattens their own linguistic contour and performs verbal grooming directed upward. The act of speaking becomes, itself, an offering.
In its ecological expression, the fawn vector produces context-appropriate deference and cooperative alignment. A junior colleague acknowledges the priority of a more experienced one's judgment on a domain where that priority is real. A patient defers to a physician's expertise in a medical decision. A guest in another's home aligns with the host's preferences around small matters. A party in a negotiation makes a concession to preserve a relationship that genuinely requires it. These are uses of the fawn vector that are ecological: they are calibrated to a real hierarchy or a real relational need, they are time-limited, and they do not erase the speaker's own perspective where that perspective is warranted.
In its compensatory expression, the fawn vector becomes ingratiating self-erasure — a speech pattern in which the speaker's own position is systematically preempted, softened, apologized for, or absented, in anticipation of a disapproval that has not been expressed and often is not present. Walker (2013) has described this pattern as the fawn response in complex trauma, where the child's only survival strategy with an unpredictable caregiver was to continuously monitor and accommodate. Adult speech carrying this history is marked by a characteristic asymmetry: the speaker's attention to the interlocutor's comfort, approval, and preference is high; the speaker's own disclosure of preference, disagreement, or want is low or entirely absent.
Linguistic markers of compensatory fawn include: excess mitigators ("maybe", "perhaps", "sort of", "a little") on statements that do not warrant them; hedge stacking, in which multiple softeners accumulate around a simple statement ("I was just sort of maybe wondering if perhaps..."); apology surplus in neutral contexts ("sorry, I know this is a stupid question"); erasure of first-person agency, in which the speaker's own preferences are described as if they belonged to someone else ("people might say..."); upward-directed compliment or affirmation in the absence of triggering content, functioning as linguistic grooming; and anticipatory agreement with positions the interlocutor has not yet taken.
The clinical tradition here includes Winnicott's (1960) account of the false self, in which the infant faced with an unattuned caregiver constructs a surface self built to meet the caregiver's needs rather than to express the infant's own. Bowen family-systems work on codependency (Bowen, 1978) and contemporary attachment research on anxious-preoccupied patterns (Mikulincer & Shaver, 2016) converge on the same speech-level phenomenon. In our framing, what these traditions have documented is the compensatory expression of a biological vector: the organism still performs the ritualized display of non-threat and upward grooming, but the display has migrated into language and has lost its connection to actual hierarchy.
Example. Context: a colleague asks the speaker where to go for lunch; the speaker has a clear preference.
Ecological (fawn as context-appropriate alignment): "I'd honestly love the Thai place, but genuinely either works — your call."
Compensatory (fawn as ingratiating self-erasure): "Oh, wherever you want, I don't have a preference really, whatever's easier for you, I don't want to make it complicated — sorry, I know I'm being annoying about it."
4.4 Freeze → vagueness and epistemic absence
The freeze vector preserves its immobilization-as-disappearance logic when it enters speech, and the preservation takes a particular form: the speaker produces speech that refuses to be speech, that declines to constitute a position, that offers itself as something no present interlocutor can fully grasp. Just as tonic immobility renders the body unperceivable to the predator, epistemic freeze renders the self-in-speech un-evaluable by the interlocutor.
In its ecological expression, the freeze vector produces appropriate hesitation, genuine not-knowing, and receptive silence. "I need a moment to think." "I actually don't know — let me find out." "I'm not sure yet; I want to listen more before I answer." These are uses of the freeze vector that acknowledge the speaker's present limitation and remain in contact with the conversation: they are temporary, proportionate, and they hold open the possibility of return to speech with substance.
In its compensatory expression, the freeze vector becomes chronic vagueness and epistemic absence — a speech pattern in which the speaker is structurally unable to produce commitments, specifics, or first-person positions, not momentarily but as a mode of being in language. Schauer and Elbert's (2010) description of tonic immobility in trauma survivors captures the phenomenology: disrupted narrative coherence, loss of first-person agency, time distortion, and a characteristic quality of uncertainty about whether experiences even belong to the speaker.
Linguistic markers of compensatory freeze include: epistemic hedges ("I guess", "I suppose", "maybe", "I don't really know") applied to territory the speaker would ordinarily have access to; absent deixis, in which reference to specific persons, times, or places is replaced by vague or generic reference; broken reference, in which pronouns and descriptions fail to pick out determinate targets; missing first-person commitment, in which claims are made in constructions that remove the speaker from their own statement ("things are kind of hard"); and vagueness that survives repeated invitations to specificity, such that the interlocutor cannot arrive at a clear sense of what the speaker experiences, thinks, or wants — and often cannot even arrive at a clear sense of whether the speaker is present in the conversation at all.
What trauma-clinical literature has described as dissociative speech — patterns observed systematically in survivors of developmental or complex trauma — is, in our reading, the speech-level expression of dorsal vagal collapse: the organism continues to function linguistically in the sense that words are produced, but the produced words do not constitute a coherent speaker. The freeze vector has been preserved in its deepest form: the subject disappears into their own language, becoming, like the prey in tonic immobility, not-quite-present at the level accessible to the threat.
Example. Context: a therapist asks, "What was the hardest part of the week for you?"
Ecological (freeze as genuine not-knowing): "I'm actually not sure yet — can I sit with that for a moment?"
Compensatory (freeze as epistemic absence): "I don't know... things have been... I guess... hard? Maybe? It's kind of all a blur, I'm not really sure, I don't know what to say."
5. Shame: the meta-defensive architecture at the biology–language boundary
The four defensive vectors examined in §3–4 are phylogenetically ancient. Fight, flight, fawn, and freeze are present across a vast range of species; the biology is fundamentally pre-linguistic. Shame is different. It does not appear until a specific cognitive object exists — a narrative self-model, persistently constructed and maintained through language. This section develops the claim that shame is not a fifth defensive mechanism but a meta-phenomenon that emerges at the boundary where biological defense meets linguistic self-construction, and that shame is the principal driver of compensatory-defensive speech.
5.1 Why animals do not have shame
Animals show behaviors that, to a human observer, can look like shame: the lowered tail of a dog caught in transgression, the averted gaze of a subordinate primate, the submissive posture of a wolf after aggression within the pack. But these are expressions of the fawn and freeze vectors in hierarchical context — they are reactions to current social pressure, not shame. Shame, as a phenomenon in its own right, requires something the animal does not have: a persistent internal object called "who I am" that can be held separately from the immediate situation and compared against it.
What animals do have is a present-moment bodily state, ongoing sensory input from the environment, and behavioral repertoires activated by salience. What they lack is a continuous, language-scaffolded model of self that persists across contexts, carries a prescriptive content ("I am a person who does X and does not do Y"), and can be found wanting. The distinction matters because it specifies precisely what shame requires: not merely a social signal of disapproval, but an internal representation of self against which the signal can be registered as a failure.
5.2 Language constructs the narrative self-model
The internal object that makes shame possible is what we will call the narrative self-model (following a lineage that runs through Bruner, 1991; Damasio, 1999; Dennett, 1992; Ricoeur, 1992). It is the persistent, linguistically maintained representation of "who I am" — a story with a protagonist, a history, a set of attributes, and an implicit set of should-be properties. This self-model is not given biologically; it is constructed developmentally, through a specific and well-documented process.
The foundational account of this process comes from Vygotsky (1934/1986): interiorization. The child first hears speech from caregivers about themselves — "you are a good girl", "we don't do that", "you are the kind of boy who shares". This external, socially supplied speech is gradually internalized, becoming first private speech (spoken aloud but self-directed) and then inner speech (covert, silent, self-directed). Through this developmental arc, the social world's descriptions of the child become the child's descriptions of themselves. The narrative self-model is, in its origin, the interiorized voice of the social other, sedimented into a continuous first-person story.
Western cognitive and developmental traditions have arrived at related conclusions through different vocabularies. Bruner (1991) articulates self as "narrative construction". Damasio (1999) distinguishes core self (moment-to-moment bodily awareness, which animals possess) from autobiographical self (continuous, memory-organized, linguistically scaffolded — distinctively human). Tomasello (2019) traces the developmental emergence of a narrative self-concept through joint attention and shared intentionality. Nelson (2007) shows how autobiographical memory, and with it narrative self, depends on the acquisition of language. Lacan, from a different tradition, frames the self as constituted through entry into the symbolic order. Despite vocabulary differences, these traditions converge on the same claim: the continuous, prescriptive self against which shame can register is a linguistic construction, not a biological given.
We mark the Vygotskian line explicitly because it provides the mechanism — interiorization of external speech — that other traditions describe but less often locate in a concrete developmental process. The combination of Vygotsky's mechanism with the Western descriptive traditions gives us both how the narrative self-model is built and what it consists of.
5.3 The fusion with the virtual self-object
The narrative self-model is a linguistic construction, but it is not experienced as one. Once stable, the self-model is experienced as the real self — as the direct and obvious content of "who I am" — and the fact that it was assembled from interiorized external speech becomes invisible. This fusion of the subject with their own constructed self-model is, in our view, the critical developmental event for the possibility of shame.
Clinical and philosophical traditions have described this fusion repeatedly. Winnicott's (1960) false self names the specific pathology of fusion with a self-model built to meet caregiver demands rather than to express ongoing experience. Lacan's account of the imaginary describes the alienating identification of the subject with an external image taken to be oneself. Fonagy and colleagues' (2002) concept of mentalization addresses the reflective capacity that allows a subject to hold their own self-model as a model, rather than simply be it — and it documents the developmental and relational conditions under which this capacity does or does not develop.
Fusion with the self-model is not pathological in itself — it is developmentally necessary. A child cannot function without some stable "who I am". Fusion becomes problematic when reflective distance from the self-model is unavailable, and the subject has no capacity to consider their own narrative as revisable. This is the condition in which shame becomes maximally dangerous and defensive speech becomes maximally necessary.
We note that in the technical apparatus developed in later papers of this series, the degree of reflective distance a speaker can hold from their own narrative self-model becomes directly measurable through specific features of their speech. Paper IV will formalize this as a computational construct. Here we name it conceptually: the possibility of holding one's own self-model at arm's length, of being able to say "the story I tell about myself is a story", is a graded capacity, and its degree determines how shame will be processed — or, more usually, pre-empted.
5.4 Shame as predictive comparison: the social norm as rigid prior
We now have the components needed to state the mechanism of shame precisely. The narrative self-model is built, in significant part, from interiorized social speech; it carries within it the normative content of that speech — what the social other approves, what it disapproves, what counts as the right kind of person. When we describe the self-model as having prescriptive content, we are describing this: an interiorized social norm about how a person should be.
In the framework of active inference and predictive coding (Friston, 2010; Clark, 2013; Hohwy, 2013), the brain is understood as a hierarchical generative model that continuously predicts its inputs and updates based on prediction error. Priors, in this framework, are expectations that shape what input will be predicted. We propose that shame is produced by a specific predictive comparison: the prior is the interiorized social norm — a rigid, socially supplied expectation of how the self should be — and the input is the current situation, in which the self is perceived to fall short of that norm.
Three features of this formulation matter. First, the prior is social, not individual. It is not a personal ideal the subject has chosen. It is a norm supplied by the social environment and interiorized into the narrative self-model before reflective choice was available. This explains why shame is so resistant to individual reframing: it feels like failure against an objective standard, not like mismatch with a personal preference.
Second, the prior is rigid. Unlike many priors in the predictive-coding hierarchy, which update fluidly based on input, the social-normative prior is protected from updating. It was laid down in developmental periods when the caregiver's voice had absolute authority; it carries residual force from that authority; and it is reinforced by ongoing social feedback.
Updating it would require relativizing the social world's authority — which, for most subjects most of the time, is not a move that is cognitively or emotionally available. Third, the comparison is automatic. Because the prior is held high in the hierarchy and the sensory input (the perceived current self in the current situation) arrives continuously, the comparison runs without conscious initiation. The subject does not decide to compare themselves to the norm. The comparison is happening, and its output — prediction error — enters consciousness as the distinctive phenomenology of shame: a hot, sudden, totalizing sense of being wrong as a person, not merely having done something wrong.
5.5 Shame triggers defense: the architecture closes
We now close the architecture. Shame, as predictive comparison between rigid social norm and current self-in-situation, generates a signal of maximum affective intensity. This signal — if permitted to reach full conscious processing — is among the most painful human experiences; a substantial psychiatric and psychological literature documents its role in depression, suicide, personality pathology, and interpersonal dysfunction (Lewis, 1971; Tangney & Dearing, 2002; Gilbert, 2007; Brown, 2012).
The organism is not passive before this signal. The same predictive-coding apparatus that generates shame can — and in most cases does — mobilize a response before the signal reaches full consciousness. The response is one of the four defensive vectors examined in §3–4. Projection redirects the force outward ("You are the one who is wrong").
Intellectualization displaces the content to an abstract plane ("This situation raises interesting questions about..."). Ingratiating self-erasure pre-emptively agrees with the implied judgment ("You're absolutely right, I shouldn't have..."), thereby neutralizing the judgment before it can land. Epistemic absence renders the situation ungraspable ("I don't know, I'm not sure, it's hard to say"), so that there is no self-in-situation clear enough to be compared against the norm.
In each case, the defensive speech act is not primarily a response to the present interlocutor. It is a response to predicted shame — the organism's generative model is projecting forward that the current situation will produce a shame signal, and is mobilizing defense to prevent that signal from consolidating. This explains a phenomenon that has puzzled clinicians and interlocutors alike: why compensatory defensive speech is so often disproportionate to its apparent trigger. The trigger in the external dialogue may be mild. The trigger in the predictive self-model is the full anticipated weight of shame against a rigid social norm. The defense is calibrated to the internal trigger, not the external one.
This is the central mechanism this series will track. In Paper II we ask whether this mechanism — predictive shame followed by vectorially specific defensive speech — can be formalized into a computable model. In Paper III we describe an engine that does so. In Paper IV we examine trajectories: how the same speaker moves across defensive regimes across time as shame pressure varies. In Paper V we show what this reveals in a bilingual corpus of human speech. In Paper VI we ask what happens when the same architecture is applied to the speech of large language models — systems that have no shame, no narrative self-model, and no biological defensive apparatus, but that have been trained extensively on human speech produced under exactly this predictive-shame dynamic.
6. Allostatic load and the metabolic signature of compensatory defense
6.1 From homeostasis to allostatic load
Classical physiology, following Cannon (1932), described the body's regulatory work as homeostasis — the maintenance of a set of critical variables (body temperature, blood pH, glucose level, oxygenation) within narrow ranges essential for life. Homeostatic regulation is reactive: when a variable deviates from its set point, feedback mechanisms restore it. For decades this was the dominant model of physiological self-regulation.
Sterling and Eyer (1988) and, developed comprehensively by McEwen (1998), introduced a complementary concept: allostasis, or stability-through-change. Where homeostasis restores a fixed set point, allostasis adjusts the set point predictively in anticipation of demand. The heart rate does not merely respond to the oxygen deficit of exertion; it rises in anticipation of exertion, based on the organism's predictive model of what is about to happen. Cortisol levels do not merely respond to stressors; they shift diurnally in anticipation of the day's expected demands. Allostasis is thus deeply aligned with the predictive-coding framework developed in §5: the organism's regulatory physiology, like its perception, operates through continuous anticipation of upcoming demand.
Allostatic load is McEwen's term for the cumulative physiological cost of sustained allostatic adjustment. A system designed for acute, time-limited mobilization — sympathetic activation, glucocorticoid release, immune modulation — pays a biological tax when it is held in the mobilized state chronically. That tax takes the form of hypertension, insulin resistance, immune dysregulation, hippocampal atrophy, and accelerated cellular aging.
Thirty years of research (McEwen, 1998; McEwen & Gianaros, 2011; Juster et al., 2010) has documented allostatic load as a measurable biomarker constellation, correlated with a wide range of physical and psychiatric outcomes in populations exposed to chronic stress.
6.2 Compensatory-defensive speech as a source of allostatic load
The hypothesis we advance in this section is not that speech itself produces allostatic load — ordinary speech is one of the least metabolically demanding things a mammal does. The hypothesis is narrower and sharper: sustained compensatory-defensive speech is supported by sustained activation of the biological defensive systems whose speech-level expression it is. The speaker whose conversational mode is chronically projective is chronically running the sympathetic-RAGE substrate at subclinical but non-zero elevation.
The speaker whose mode is chronically intellectualizing is chronically running the FEAR circuit's anticipatory retreat in a medium that never lets retreat complete. The speaker whose mode is chronically ingratiating is chronically running the ventral-vagal-with-affiliative-override pattern under ambient hierarchy pressure. The speaker whose mode is chronically absent is chronically running the dorsal vagal collapse substrate at a level that permits minimal functioning but no full presence.
If defensive vectors are preserved across the biological-to-linguistic transition, as we have argued throughout this paper, then so is their metabolic substrate. The speech form does not replace the defensive physiology; it rides on it. And because speech is continuous — a person is producing language, internally or externally, during most waking hours — the metabolic substrate does not get the recovery intervals that acute defensive responses were designed to have. A predator is encountered; a mobilization happens; the predator passes; recovery occurs. But the social world that triggers shame-prediction is always present. The defense does not release because the predicted threat does not resolve.
Three linguistic manifestations predict this account, all of which are documented clinically and which our framework unifies. First, narrowing of register. Individuals under chronic compensatory-defensive load show reduced variability in their speech across contexts. The defensively projective speaker projects in every setting; the intellectualizing speaker intellectualizes across topics; the ingratiating speaker ingratiates regardless of interlocutor. Stereotypy of defensive register, we propose, reflects the metabolic cost of variability — the system cannot afford to exit its established compensatory mode because doing so would require re-establishing regulation from scratch.
Second, loss of humor and playfulness. Humor requires, among other things, the capacity to hold multiple frames of a situation simultaneously, which in Fonagy's terms requires active mentalization, and which in predictive-coding terms requires available computational headroom for reframing of priors. Chronic compensatory defense consumes that headroom.
The clinical observation that chronically traumatized individuals show reduced humor is not mysterious under this account: the system running compensatory defense has no resources left for the playful flexibility that humor requires.
Third, flattened affect in speech. Not the dissociated flatness of freeze (which is a separate phenomenon), but a more subtle reduction in affective dynamic range — voice that does not modulate, word choice that does not vary, metaphor that does not enliven. This is the speech-level analogue of the peripheral physiological flattening observed in chronic stress: reduced heart-rate variability, reduced cortisol reactivity, reduced immune responsiveness. The system is spending its adaptive budget on defense, leaving less available for the modulating work of ordinary expression.
We treat these manifestations as predictions of this paper's framework, to be tested empirically in Paper V on the bilingual human corpus and Paper IV on the trajectory dynamics. If compensatory-defensive speech truly carries metabolic cost, longitudinal measurements should show that speakers high in compensatory regimes also show the above three linguistic signatures, and should show them more strongly over time as allostatic load accumulates.
6.3 A note on ecological defense and metabolic reversibility
A critical feature of ecological (as opposed to compensatory) defense, introduced in §3.6, is reversibility. The speaker who asserts a boundary in proportion to a real transgression, who withdraws from a genuinely toxic exchange, who defers to legitimate hierarchy, or who pauses to appraise an ambiguous situation — in each case, the defensive mobilization completes, the situation resolves, and the organism returns to baseline. The metabolic substrate is engaged and then disengaged. Allostatic load does not accumulate, because allostasis is doing what it was designed to do: adjust transiently and then return.
This is why the ecological/compensatory distinction matters not only conceptually but biologically. The same vector, in ecological expression, carries no metabolic cost beyond its transient engagement. In compensatory expression, sustained over developmental and life-span timescales, it becomes a major driver of disease. Interventions that move speakers from compensatory toward ecological expression of the same defensive vector are therefore not merely "healthier communication"; they are, on this account, a form of allostatic load reduction. This framing connects the work of this paper to a substantial clinical literature on trauma-informed therapies and their physiological effects, which we will not review here but note as a point of contact for future integration.
7. Why this matters, and what comes next
7.1 The scientific move
This paper has asked the reader to hold four claims together. First, the defensive architecture of the body — fight, flight, fawn, freeze — does not disappear when organisms acquire language; its directional logic is preserved in speech. Second, the same vector can be expressed ecologically (proportionately, reversibly, with contact preserved) or compensatorily (disproportionately, non-reversibly, with contact destroyed); only the compensatory subclass is the target of this series. Third, shame is not a fifth defensive vector but a meta-phenomenon that emerges at the boundary where biological defense meets the linguistic construction of self, operating as predictive comparison between a rigid social-normative prior and the current situation. Fourth, compensatory-defensive speech is, in large part, the organism's pre-emptive response to predicted shame — which is why it is so often disproportionate to its apparent trigger, and why it carries measurable allostatic cost.
Taken together, these four claims do something that, to our knowledge, has not been done in the existing literature. They unify a biological account (the four vectors), a developmental-linguistic account (narrative self-model, interiorization), a predictive-neurocomputational account (shame as prediction error against a rigid prior, defense as pre-emption), and a clinical-phenomenological account (projection, intellectualization, ingratiation, epistemic absence) into a single architecture. None of the components is novel.
The architecture, we believe, is.
7.2 What this makes possible
Three consequences follow, each of which is pursued in subsequent papers of the series. For clinical and psychological practice, the framework makes visible a mechanism that clinicians have long described but rarely operationalized. Why do certain conversations exhaust their speakers? Why does shame so often present as something other than shame?
Why is defensive communication so stubbornly resistant to rational intervention? The answers cease to be mysterious when the biological vectors, the narrative self-model, the rigid social prior, and the pre-emption of shame are held together. The framework does not replace clinical judgment; it gives clinical judgment a common language with neurobiology, predictive neuroscience, and — in later papers — computational linguistics.
For the computational analysis of dialogue, the framework provides what the field currently lacks: a theoretically grounded primitive. Contemporary computational approaches to dialogue pathology — sycophancy detection, manipulation detection, companionship-reinforcement evaluation — operate largely through prompt engineering and surface feature matching. They describe phenomena without mechanism. The compensatory-defensive framework provides mechanism: a vector-level account of what is being expressed, a predictive account of why, and a set of criteria (proportionality, reversibility, contact-preservation) that discriminate compensatory from ecological expression. Paper II asks whether this mechanism can be formalized into an annotation ontology; Paper III describes an engine that implements such an ontology; Paper IV examines trajectories; Paper V reports empirical results on human speech.
For human–AI interaction and AI safety, the framework opens a question that the empirical paper VI of this series will answer. Large language models have no defensive bodies. They have no narrative self-model in the sense developed in §5 — no continuous, interiorized, shame-sensitive first-person construction. They have no biological allostatic substrate. And yet they have been trained extensively on human speech produced under exactly the predictive-shame dynamic this paper describes. Do they, in extended dialogue with humans, reproduce compensatory-defensive vectors — projection, intellectualization, ingratiation, epistemic absence — despite lacking the biology that, in humans, generates these vectors? If they do, what does that tell us about language itself, about training dynamics, and about the shape of risk in human–AI interaction? Paper VI takes up this question directly, with an empirical study of longitudinal dialogue drift across current frontier models.
7.3 What this paper does not claim
We close with what this paper does not claim, because precision about the scope of the argument is part of its integrity. This paper does not claim that all speech is defensive. The framework is explicitly restricted to defensive speech, and within that to the compensatory subclass. Ordinary speech produced outside predictive-shame dynamics — genuine inquiry, playful exchange, cooperative problem-solving, affectionate connection — lies outside the frame and requires different accounts.
This paper does not claim that compensatory-defensive speech is "bad" or "pathological" in a globalizing sense. It is a predictable and developmentally sensible response of an organism that has been shaped by an environment in which predicted shame was a sufficiently severe and sufficiently frequent threat to warrant continuous defense. The framework is diagnostic and explanatory; it is not a moral evaluation.
This paper does not claim that biological reduction explains language. Language does things that biological defensive vectors alone do not explain. What we have argued is narrower: that within the domain of defensive speech, biological vectors are preserved, and that the compensatory subclass of this domain is computable in virtue of this preservation.
Everything else that language does — poetic meaning, mathematical reasoning, narrative imagination, cooperative semantic construction — remains irreducible to the framework developed here, and we make no claim otherwise.
Finally, this paper does not claim to complete the argument it opens. The series that follows tests whether the architecture proposed here can be made computable (Paper II), built as an engine (Paper III), tracked as trajectories (Paper IV), validated empirically on human speech (Paper V), and extended as a tool for AI safety evaluation (Paper VI). Each of those tests could fail. If they do, the framework is wrong and must be revised. If they succeed, what we will have is a working architecture for the study of defensive speech — biological in its foundation, linguistic in its medium, computational in its operationalization, and clinically useful in its applications.
Acknowledgments
The structural development of this paper and the broader series Defense, Self, and Speech: From Biological Vectors to Linguistic Form was conducted in extended dialogue with Claude (Anthropic), who contributed to the synthesis of theoretical frameworks across polyvagal, predictive-coding, developmental, and clinical traditions; to the articulation of the four-vector-to-speech mapping; to the predictive-coding account of shame against a rigid social-normative prior; and to the overall architecture of the series. The author retains full responsibility for the content, claims, and conclusions of this work, and for any errors that remain.
References
Alcaro, A., Huber, R., & Panksepp, J. (2007). Behavioral functions of the mesolimbic dopaminergic system: An affective neuroethological perspective. Brain Research Reviews, 56(2), 283–321.
Alcaro, A., & Panksepp, J. (2011). The SEEKING mind: Primal neuro-affective substrates for appetitive incentive states and their pathological dynamics in addictions and depression. Neuroscience & Biobehavioral Reviews, 35(9), 1805–1820.
Bion, W. R. (1962). Learning from Experience. Heinemann.
Bowen, M. (1978). Family Therapy in Clinical Practice. Jason Aronson.
Bowlby, J. (1969). Attachment and Loss. Basic Books.
Brown, B. (2012). Daring Greatly. Gotham.
Bruner, J. (1991). The narrative construction of reality. Critical Inquiry, 18(1).
Cannon, W. B. (1932). The Wisdom of the Body. W. W. Norton.
Clark, A. (2013). Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behavioral and Brain Sciences, 36(3).
Damasio, A. (1999). The Feeling of What Happens. Harcourt Brace.
Dennett, D. (1992). The self as a center of narrative gravity. In F. S. Kessel, P. M. Cole, & D. L. Johnson (Eds.), Self and Consciousness: Multiple Perspectives.
Fonagy, P., Gergely, G., Jurist, E., & Target, M. (2002). Affect Regulation, Mentalization, and the Development of the Self. Other Press.
Freud, A. (1936/1966). The Ego and the Mechanisms of Defense. International Universities Press.
Friston, K. (2010). The free-energy principle: a unified brain theory? Nature Reviews Neuroscience, 11(2), 127–138.
Friston, K., Rigoli, F., Ognibene, D., Mathys, C., Fitzgerald, T., & Pezzulo, G. (2015). Active inference and epistemic value. Cognitive Neuroscience, 6(4), 187–214.
Gabbard, G. O. (2014). Psychodynamic Psychiatry in Clinical Practice (5th ed.). American Psychiatric Publishing.
Gilbert, P. (2007). The evolution of shame as a marker for relationship security. In J. L. Tracy, R. W. Robins, & J. P. Tangney (Eds.), The Self-Conscious Emotions.
Gottlieb, J., Oudeyer, P.-Y., Lopes, M., & Baranes, A. (2013). Information-seeking, curiosity, and attention: computational and neural mechanisms. Trends in Cognitive Sciences, 17(11), 585–593.
Hohwy, J. (2013). The Predictive Mind. Oxford University Press.
Juster, R.-P., McEwen, B. S., & Lupien, S. J. (2010). Allostatic load biomarkers of chronic stress and impact on health and cognition. Neuroscience & Biobehavioral Reviews, 35(1), 2–16.
Kernberg, O. (1975). Borderline Conditions and Pathological Narcissism. Jason Aronson.
Klein, M. (1946). Notes on some schizoid mechanisms. International Journal of Psycho-Analysis, 27.
Lacan, J. (1966/2006). Écrits. W. W. Norton.
LeDoux, J. (1996). The Emotional Brain. Simon & Schuster.
LeDoux, J. (2015). Anxious: Using the Brain to Understand and Treat Fear and Anxiety. Viking.
Lewis, H. B. (1971). Shame and Guilt in Neurosis. International Universities Press.
McEwen, B. S. (1998). Stress, adaptation, and disease: Allostasis and allostatic load. Annals of the New York Academy of Sciences, 840, 33–44.
McEwen, B. S., & Gianaros, P. J. (2011). Stress- and allostasis-induced brain plasticity. Annual Review of Medicine, 62, 431–445.
Mesquita, B., & Karasawa, M. (2004). Self-conscious emotions as dynamic cultural processes. Psychological Inquiry, 15(2).
Mikulincer, M., & Shaver, P. R. (2016). Attachment in Adulthood: Structure, Dynamics, and Change (2nd ed.). Guilford.
Nelson, K. (2007). Young Minds in Social Worlds. Harvard University Press.
Panksepp, J. (1998). Affective Neuroscience: The Foundations of Human and Animal Emotions. Oxford University Press.
Panksepp, J., & Biven, L. (2012). The Archaeology of Mind: Neuroevolutionary Origins of Human Emotions. W. W. Norton.
Pezzulo, G., Rigoli, F., & Friston, K. (2018). Hierarchical active inference: A theory of motivated control. Trends in Cognitive Sciences, 22(4), 294–306.
Porges, S. W. (2011). The Polyvagal Theory: Neurophysiological Foundations of Emotions, Attachment, Communication, and Self-Regulation. W. W. Norton.
Ricoeur, P. (1992). Oneself as Another. University of Chicago Press.
Schauer, M., & Elbert, T. (2010). Dissociation following traumatic stress. Zeitschrift für Psychologie, 218(2), 109–127.
Schwartenbeck, P., FitzGerald, T. H. B., Mathys, C., Dolan, R., & Friston, K. (2015). The dopaminergic midbrain encodes the expected certainty about desired outcomes. Cerebral Cortex, 25(10), 3434–3445.
Sterling, P., & Eyer, J. (1988). Allostasis: A new paradigm to explain arousal pathology. In S. Fisher & J. Reason (Eds.), Handbook of Life Stress, Cognition and Health.
Tangney, J. P., & Dearing, R. L. (2002). Shame and Guilt. Guilford.
Tomasello, M. (2019). Becoming Human: A Theory of Ontogeny. Harvard University Press.
Tracy, J. L., & Matsumoto, D. (2008). The spontaneous expression of pride and shame. Proceedings of the National Academy of Sciences, 105(33).
Vygotsky, L. S. (1934/1986). Thought and Language (A. Kozulin, Ed.). MIT Press.
Walker, P. (2013). Complex PTSD: From Surviving to Thriving.
Winnicott, D. W. (1960). Ego distortion in terms of true and false self. In The Maturational Processes and the Facilitating Environment.
Wright, J. S., & Panksepp, J. (2012). An evolutionary framework to understand foraging, wanting, and desire: The neuropsychology of the SEEKING system. Neuropsychoanalysis, 14(1), 5–39.
Papers in preparation (this series)
Paper II — Is Compensatory-Defensive Speech Computable? (in preparation)
Paper III — The Engine: A Hybrid Ontology-Bounded Classifier for Compensatory-Defensive Speech (in preparation)
Paper IV — Trajectories and Prediction in Speech (in preparation)
Paper V — First Empirical Findings on a Bilingual Corpus (in preparation)
Paper VI — Longitudinal Dialogue Drift in Large Language Models (in preparation)