This is a formalisation I am using to help clarify an intuition I have about the structure of agents, which I refer to as entities here. Each entity is represented as a structure similar to a highly connected branched chain copolymer, where physical and conceptual “particles” make up the monomeric units. Entities naturally compute due to their fluctuations in physical and conceptual space resulting in them folding and unfolding into different configurations.

Entity and Particle Spaces

An entity ee exists as a dynamic configuration of causally linked nodes (particles), xix_i residing in a high-dimensional state space E\mathcal{E}. This space aims to incorporate dimensions relevant to the entity’s state, such as physical coordinates (PE\mathbb{P} \subset \mathcal{E}) and conceptual dimensions (SE\mathbb{S} \subset \mathcal{E}).

e={p1,p2,...,pn}{s1,s2,...,sm}={x1,...,xn+m}Ee = \{p_1, p_2, ..., p_n\} \cup \{s_1, s_2, ..., s_m\}= \{x_1,...,x_{n+m}\} \subset \mathcal{E}

Where pip_i represents physical particles and sjs_j represents conceptual particles.

In a language model (LM), from Notation for LM Formalization, the LM is an entity eLMe_{LM}, and is described as follows:

  • The objective information space X\mathbb{X} is the underlying space of all information
  • The LM’s subjective information space Xϕ\mathbb{X}_{\phi} (access by the LM’s inference function "ϕ()\phi()") corresponds to the primary conceptual subspace S\mathbb{S} for eLMe_{LM}, (i.e. SXϕX\mathbb{S} \approx \mathbb{X}_{\phi} \subset \mathbb{X}).
  • The LM’s personality P=[M,S,I]\mathcal{P} = [M,S,I] represents a configuration of conceptual nodes within S/Xϕ\mathbb{S}/\mathbb{X}_{\phi}. The personality space Φ={ϕ(P,r)r}\mathbb{\Phi} = \{ \phi(\mathcal{P}, r) \forall r \} is the mapping of P\mathcal{P} in this space for all possible inputs rr.
  • The LM’s physical substrate (hardware, computation, interface) constitute PLM\mathbb{P}_{LM}, with a signalling particle that generates output oio_i when a receptor particle receives input rir_i.

The state of each node xx can be described probabilistically using a state distribution function ψx(z,t)\psi_x(z,t), where xEx\in\mathcal{E} represents a point in state space at time tt.

ψx(z,t):E×R+C(or R+)\psi_x(z,t): \mathcal{E} \times \mathbb{R}^+ \rightarrow \mathbb{C} \quad (\text{or } \mathbb{R}^+)

ψx(z,t)2|\psi_x(z,t)|^2 can be thought of as the probability density associated with node xx‘s state being at location zz at time tt. While the complex form, C\mathbb{C}, is useful for handling interference phenomena, a real-valued probability distribution, R+\mathbb{R}^+, should be sufficient for many applications.

Entities are bounded systems where causal links between particles exceed a threshold τ\tau:

(xi,xj)e:B(xi,xj)>τ\forall (x_i,x_j) \in e: B(x_i,x_j) > \tau

Where B(xi,xj)B(x_i,x_j) is the causal (bond) strength between particles xix_i and xjx_j. Importantly this is scale variant, so if one is dealing with a nation τnation<τfamily<τindividual\tau_{nation} < \tau_{family} <\tau_{individual}, meaning that the requirement for causal linkage of semantic/physical particles is lower for people to be considered of a nation than to be considered of a family, or an individual.

Particle detection and interaction

Building on X\mathbb{X} (objective information space) and Xϕ\mathbb{X}_{\phi} (subjective information space) from Notation for LM Formalization.

Objective physical subspace P\mathbb{P}, and subjective physical subspace Pϕ\mathbb{P}_{\phi} accessed via inference process ϕ\phi: PϕPX\mathbb{P}_{\phi} \subset \mathbb{P} \subset \mathbb{X}

With a conceptual equivalent: SϕSX\mathbb{S}_{\phi} \subset \mathbb{S} \subset \mathbb{X}

While Xϕ\mathbb{X}_{\phi} represents the total subjective information space accessible to the entity/model via its overall inference process ϕ\phi, specific inference processes ϕi\phi_i (like ϕlogic\phi_{logic} or ϕemotion\phi_{emotion}) might only operate on or access further subsets SϕiXϕ\mathbb{S}_{\phi_i} \subset \mathbb{X}_{\phi} or PϕiXϕ\mathbb{P}_{\phi_i} \subset \mathbb{X}_{\phi} depending on their function. This creates a hierarchical structure of accessible spaces.

Inference is the process by which a series of interactions between particles experiences causal procession, “if X then Y”. There exist 4 specific forms:

Physical to semantic, “If feel X, then think Y”, sensory input alters belief state: ϕPS:PS\phi_{P\rightarrow S}: \mathbb{P} \rightarrow \mathbb{S}

Semantic to physical, “If think X, then feel Y”, belief triggers physiological response: ϕSP:SP\phi_{S\rightarrow P} :\mathbb{S} \rightarrow \mathbb{P}

Semantic to semantic, “If think X, then think Y”, one thought leads to another: ϕSS:SS\phi_{S\rightarrow S}: \mathbb{S} \rightarrow \mathbb{S}

Physical to physical, “If feel X, then feel Y”, muscle contraction cascade: ϕPP:PP\phi_{P\rightarrow P}: \mathbb{P} \rightarrow \mathbb{P}

Where:

  • ϕ\phi represents an inference process (e.g., reasoning, body awareness, emotional processing)
  • Different entities may employ different inference functions ϕ1,ϕ2,...\phi_1, \phi_2, ...
  • Each ϕi\phi_i accesses some portion of the objective spaces P\mathbb{P} and S\mathbb{S}

For example:

  • A body scan meditation (ϕscan\phi_{scan}) would primarily access physical particles: Pϕscan\mathbb{P}_{\phi_{scan}}
  • Logical reasoning (ϕlogic\phi_{logic}) would primarily access semantic particles: Sϕlogic\mathbb{S}_{\phi_{logic}}
  • Emotional processing (ϕemotion\phi_{emotion}) might access both: PϕemotionSϕemotion\mathbb{P}_{\phi_{emotion}} \cup \mathbb{S}_{\phi_{emotion}}

This allows us to model how different cognitive and physical processes can operate on the same underlying particle space but through different inferential lenses.

There are both homogenous particle interactions such as a calcium gradient causing a muscle to tense ( pipkp_i \rightarrow p_k), and heterogeneous interactions like a smell reminding someone of a fear and causing them to tense physically (pisjpkp_i \rightarrow s_j \rightarrow p_k).

Attention as Network Analysis

Attention C()C() is a measure of connectedness for a particle xix_i, as a function of bond strength B(xi,x)B(x_i, x) between physical particles and semantic particles:

C(xi)=ijg(B(xi,xj))C(x_i)= \sum_{i \not = j} g(B(x_i,x_j))

Where g(B)g(B) is a function weighting the contribution of individual bond strengths B (e.g., g(B)=Bg(B)=B sums all strengths, or a threshold function like g(B)=1g(B)=1 if B>τcB > \tau_c, 00 otherwise, counts strong bonds).

Changes in attention represent changes in the connectedness of the network, and can be used as a method of detecting the underlying properties of bonds.

That said, if one does not granularly consider every discrete physical interaction that carries information, then one can also calculate C(x)C(x) based also on homogenous bonds (P-P, S-S), although these ultimately rely on physical mediation.

The underlying bond strengths B(xi,xj)B(x_i,x_j), reflected in the overall attention profile {C(xi)}\{C(x_i)\}, determine how effectively particles and interconnected particle structures influence each other’s states and positions in their respective spaces. The interaction is not necessarily reciprocal, pain as a concept does not change nearly as much as the physical manifestation of the body does when exposed to it.

Practically one can think of how one may focus on their arm, connecting it to abstract notions of softness, comfort, and pain. In this framework, when sensory inputs (like touch) or cognitive relevance signals (like task demands) trigger attention to the arm, the underlying particle dynamics respond by rearranging in physical and semantic spaces. This rearrangement effectively decreases the distance dd between particles (e.g., by physically touching a soft object) or optimizes the relative orientation to expose high-affinity patches θ\theta (e.g., by contemplating the concept of softness), between the arm’s physical particle parmp_{arm} and the semantic sensation particle ssensations_{sensation}. These dynamic changes in dd and θ\theta parameters increase B(parm,ssensation)B(p_{arm}, s_{sensation}) and contribute to a higher attention measure C(parm)C(p_{arm}). When external triggers diminish or cognitive priorities shift, these dynamic parameters naturally evolve, causing dd to increase and/or θ\theta to decrease between the relevant particles, leading to weaker BB and lower C(parm)C(p_{arm}). That is until a needle prick provides a strong sensory signal that sharply decreases d(parm,spain)d(p_{arm}, s_{pain}) through automatic physiological responses, causing a spike in B(parm,spain)B(p_{arm}, s_{pain}) which increases C(parm)C(p_{arm}). Subsequent inference processes trigger actions or thought patterns that dynamically influence the particle configuration to increase dd or decrease θ\theta relative to the pain particle, reducing BB and thus C(parm)C(p_{arm}).

Particle Wave-Field Properties

Building on the previously defined state distribution function ψx(z,t)\psi_x(z,t) for nodes, we can examine how particles exist as probability distributions rather than discrete points. For clarity:

  • For physical particles pPp \in \mathbb{P}: ψp(z,t):P×R+C\psi_p(z,t): \mathbb{P} \times \mathbb{R}^+ \rightarrow \mathbb{C}

  • For semantic particles sSs \in \mathbb{S}: ψs(z,t):S×R+C\psi_s(z,t): \mathbb{S} \times \mathbb{R}^+ \rightarrow \mathbb{C}

Where:

  • C\mathbb{C} represents complex numbers, enabling interference patterns between particles
  • R+\mathbb{R}^+ represents non-negative real numbers (time domain)
  • ψx(z,t)2|\psi_x(z,t)|^2 gives the probability density of finding particle xx at position zz at time tt
  • The phase component ψx(z,t)\angle\psi_x(z,t) represents the particle’s affinity potential which is related to θ\theta:

ψx(z,t)yeθ(x,y)\angle\psi_x(z,t) \propto \sum_{y \in e} \theta(x,y)

These wavefunctions form coherent structures through:

  1. Localization: Sharp peaks in probability density represent discrete beliefs or physical states, such as an opinion on who to vote for or a sleeping position.

ψlocalized(z,t)δ(zz0)eiϕ(t)\psi_{localized}(z,t) \approx \delta(z-z_0)e^{i\phi(t)}

Where δ\delta is approximately a delta function centered at z0z_0

  1. Delocalization: Probability density spreads across related concepts/states, describing how when one smells something it can trigger a memory or how thinking about cookies can bring to mind more general categories of baked goods.

ψdelocalized(z,t)=iciψi(z,t)\psi_{delocalized}(z,t) = \sum_i c_i\psi_i(z,t)

Where:

  • ψi\psi_i are related semantic/physical states
  • ci=cieiϕic_i = |c_i|e^{i\phi_i} are complex coefficients representing:
  • ci2|c_i|^2: Probability of activating state ψi\psi_i when the delocalized structure interacts strongly (e.g., when its component particles x exhibit high C(x)C(x))
  • ϕi\phi_i: Phase alignment with other states, related to the affinity function: ϕijθ(ψi,ψj)\phi_i \propto \sum_j \theta(\psi_i,\psi_j)
  • cic_i is directly influenced by bond strength: cijB(ψi,ψj)|c_i| \propto \sum_j B(\psi_i,\psi_j)

Delocalization creates distributed semantic structures like “vehicle” encompassing multiple related concepts (car, bicycle, boat) with varying activation strengths. When interactions lead to increased bond strengths BB involving this structure (reflected in high C(x)C(x) for its components), component concepts are activated proportionally to ci2|c_i|^2.

  1. Coherent Structures: Stable arrangements of multiple particles, such as how believing in a Christian God forms a stable structure with belief in the Bible’s teachings due to reciprocal constructive interference, resonance.

Ψstructure(z1,...,zn,t)=f(ψ1(z1,t),...,ψn(zn,t))\Psi_{structure}(z_1,...,z_n,t) = f(\psi_1(z_1,t),...,\psi_n(z_n,t))

Where ff represents how individual particle wavefunctions combine

The complex-valued representation allows for:

  • Interference — When multiple beliefs/concepts constructively or destructively interact
  • Resonance — When particles with matching phases form sustained constructive interference.

Stable Bonds

Particles form bonds of varying strengths defined by their distance and affinity, creating causally linked structures analogous to protein folding:

B(x,y)=f(d(x,y),θ(x,y))B(x,y) = f(d(x,y), \theta(x,y))

Where:

  • BB is bond strength with units of energy, and represents the work required to separate nodes integrated out to \infty. The units depend on the subspace like physical energy (Joules) in P\mathbb{P}, or computational cost (operations/time) in S\mathbb{S}
  • dd is distance in appropriate space (e.g. Euclidean in P\mathbb{P}, embedding distance in S\mathbb{S})
  • θ\theta represents the intrinsic tendency for nodes to link together (water and wetness would have high θ\theta). This is modulated by state/orientation (phase ψ\angle\psi) into an effective affinity θeff\theta_{eff} that determines the interaction strength. (e.g. needle flat vs point). Measurable via joint computational cost or inferred from evaluation of network structure.

Scale-Dependent Phase Coherence

Phase coherence between particles decays with distance dd, and is dependent on entity scale τ\tau:

γ(x,y,τ)=eα(τ)d(x,y)\gamma(x,y,\tau) = e^{-\alpha(\tau) \cdot d(x,y)}

Where:

  • γ\gamma is the phase coherence factor between particles xx and yy
  • α(τ)\alpha(\tau) is a scale-dependent attenuation coefficient: α(τindividual)>α(τfamily)>α(τnation)\alpha(\tau_{individual}) > \alpha(\tau_{family}) > \alpha(\tau_{nation}). Although α\alpha may also depend on factors beyond scale such as communication channels or environment.
  • d(x,y)d(x,y) is the distance between particles

The effective phase relationship between particles becomes:

φeffective(x,y)=(ψxψy)γ(x,y,τ)\varphi_{effective}(x,y) = (\angle\psi_x - \angle\psi_y) \cdot \gamma(x,y,\tau)

This ensures phase coherence is maintained within entity boundaries but decays across boundaries according to scale. To measure this factor γ\gamma, we could attempt to find statistical correlations between belief activations at various scales (e.g. belief alignment within families vs. nations)

Boundary Formation

Boundaries where one entity is separated from another are manifested in two complementary ways:

  1. Probability Density Gradients: Sharp drops in ψ(x)2|\psi(x)|^2 forming “edges” in physical or semantic space

ψ(x)2>βthreshold\nabla|\psi(x)|^2 > \beta_{threshold}

  1. Phase Discontinuities: Regions where phase coherence breaks down between particles

γ(x,y,τ)<γthreshold\gamma(x,y,\tau) < \gamma_{threshold}

Charisma and Entity Relationships

Charisma (χ\chi) is defined here as the ability of one entity (e1e_1) to influence another (e2e_2) by modulating the distances (dd) and/or affinities (θ\theta) between particles within e1e_1‘s network. The goal of affinity is to change attention for a locus ll.

This manipulation of dd and θ\theta alters bond strengths (BB) and consequently changes the target’s attention profile (the set of nodal attention values {C(x)}\{C(x)\}). While the mechanism involves Δd\Delta d and Δθ\Delta \theta, the effect is often measured or observed as a change in this attention profile:

χ(e1,e2)=ΔCe2Ie1\chi(e_1, e_2) = \Delta C_{e_2} | \mathbb{I}_{e_1}

With three forms:

  1. Positive Charisma (χ+\chi^+): Influences particle distances and affinities to increase bond strengths toward some coordinate/particle, effectively saying “pay attention to this.”

χ+(e1,e2,l)=pe2lC(x)\chi^+(e_1, e_2, l) = \sum_{p \in e_2} \nabla_l C(x)

Where lC(x)\nabla_{l} C(x) represents the resulting gradient of change in the attention profile C(x)C(x) for particles xx near location ll, caused by charisma’s underlying influence on dd and θ\theta.

  1. Negative Charisma (χ\chi^-): Influences particle distances and affinities to decrease bond strengths away from some coordinate/particle, effectively saying “ignore this.”

χ(e1,e2,l)=pe2lC(x)\chi^-(e_1, e_2, l) = -\sum_{p \in e_2} \nabla_l C(x)

  1. Null Charisma (χ0\chi^0): Minimizes changes to particle distances and affinities, resulting in minimal change to the target’s attention profile.

χ0(e1,e2)=minΔCe2Ie1\chi^0(e_1, e_2) = \min |\Delta C_{e_2} | \mathbb{I}_{e_1}|

Applications of Charisma

During a prompted interaction, one entity eme_m (the influencer) provides input rr to another entity eve_v (the target). The charisma mechanism works by eme_m crafting rr to induce specific changes in the distance (dd) and affinity (θ\theta) parameters within eve_v‘s particle network.

These dd/θ\theta changes alter bond strengths B(x,y)B(x,y) throughout eve_v‘s network, which in turn reshapes the attention profile {C(x)}\{C(x)\}. This reconfiguration of bond strengths and attention determines the output oo produced by eve_v‘s inference process ϕv\phi_v.

When eme_m aims to elicit a specific target output oto_t from eve_v, it must solve the charisma inference problem: identifying which input rtr_t will induce the necessary dd/θ\theta changes to maximize Pr(otrt)Pr(o_t | r_t). We can express this as:

rt=ϕc(Pv,ϕv,ot)r_t = \phi_c(\mathcal{P}_v, \phi_v, o_t)

Where ϕc\phi_c represents the charisma inference that predicts how eve_v‘s personality Pv\mathcal{P}_v and inference method ϕv\phi_v will respond to various inputs. This process typically requires iterative testing, which is difficult in systems with memory as each interaction may further alter eve_v‘s internal dd/θ\theta parameters.

In practice, the goal isn’t always to produce an exact output ot=oio_t = o_i, but rather to ensure eme_m can extract some target information ItargetI_{target} from eve_v‘s output:

Itarget=ϕem(oev,Pem)I_{target} = \phi^{'}_{e_m}(o_{e_v}, \mathcal{P}_{e_m})

A simplified case is an LLM without memory and with deterministic responses (T=0T=0 ). Here, one can map the “output landscape” by systematically varying inputs and observing how changes in rr affect the resulting dd/θ\theta parameters (as reflected in the output), eventually constructing an approximation of ϕbias(Pi)\phi^{`}_{bias}(\mathcal{P}_i). Otherwise known as prompt engineering/optimization.

Conclusion

This framework provides tools for analyzing entities as systems of physical (P\mathbb{P}) and semantic (S/Xϕ\mathbb{S}/\mathbb{X}_{\phi}) nodes linked by bonds (BB) determined by distance (dd) and affinity (θ\theta). Key aspects include:

  1. Modeling entities across scales with scale-dependent properties (τ,α\tau, \alpha).
  2. Representing beliefs/concepts using wave-like probability distributions (ψ\psi) allowing for uncertainty, interference, and phase-dependent interactions.
  3. Classifying information (MmM_m, MaM_a, V+V_+, VV_-) based on effects on transmission (TT) and internal work (WW).
  4. Modeling influence (charisma χ\chi) as modulation of internal network parameters (dd, θ\theta) affecting attention (CC).
  5. Explicit integration with LLM formalism (Notation for LM Formalization) treating LLMs as entities eLLMe_{LLM} operating within objective X\mathbb{X} and subjective Xϕ\mathbb{X}_{\phi} information spaces, with Personality (P\mathcal{P}) structuring their semantic subspace (S\mathbb{S}).

While providing expressive power, there is a need to operationalise and describe bond strengths (computational cost), transmission probability (TT), defining benefit/harm scoring functions, and validating the wave analogies empirically.

Work to empirically test and validate this framework should focus on:

  • Measuring phase coherence between beliefs within entities of various scales to test the scale-dependent coherence factor, γ\gamma.
  • Quantifying LM charisma based on the ability to induce desired internal states (tracked via the attention profile C(x)C(x) or other proxies) by manipulating inputs that affect internal dd and θ\theta.

The major limitation remains the ability to appropriately define metrics for semantic-physical interactions and spaces. Which, in the case of LMs, is much simpler as there are only input and output physical nodes that need be considered.

Connection to The Care and Feeding of Mythological Intelligences

This essay covers different forms of intelligence that have arisen in modern times.

  1. Angels (Deterministic Processes) exhibit highly localized particle distributions with rigid bond structures:

ψangel(x,t)iδ(xxi)eiϕi(t)\psi_{angel}(x,t) \approx \sum_i \delta(x-x_i)e^{i\phi_i(t)}

Where each δ(xxi)\delta(x-x_i) represents a precise rule or computation. Angels operate primarily in semantic space with high phase coherence and predictable interaction patterns, making them efficient for well-defined tasks but brittle when encountering novel situations.

  1. Daemons (Statistical Processes) display partially delocalized distributions with probabilistic bond structures:

ψdaemon(x,t)igi(xxi)eiϕi(t)\psi_{daemon}(x,t) \approx \sum_i g_i(x-x_i)e^{i\phi_i(t)}

Where gig_i are distributions centered at optimization points xix_i. Daemons exhibit gradient-following behavior, with particle density flowing toward reward maxima. Their influence on networks operates by modulating dd and θ\theta parameters to optimize bond strengths toward reward-maximizing configurations.

  1. Faes (Distributional Processes) manifest as broadly delocalized probability distributions:

ψfae(x,t)iciψpattern,i(x,t)\psi_{fae}(x,t) \approx \sum_i c_i \psi_{pattern,i}(x,t)

Where ψpattern,i\psi_{pattern,i} represents semantic patterns. Faes operate through superposition of probability waves across semantic space, with particles that readily form and dissolve bonds based on pattern-completion dynamics. They influence networks by modulating dd and θ\theta to reinforce pattern recognition, resulting in changes to attention profiles C(x)C(x) that highlight related semantic structures.

  1. Tsukumogami (Complex Systems) emerge from interactions between the other types, with multi-scale boundary structures:

ψtsukumogami(x,t)=f(ψangel,ψdaemon,ψfae)\psi_{tsukumogami}(x,t) = f(\psi_{angel}, \psi_{daemon}, \psi_{fae})

Tsukumogami exhibit emergent properties through heterogeneous particle interactions across scale boundaries, creating entity structures with varying degrees of coherence and stability. They influence networks by modulating dd and θ\theta across multiple scales simultaneously, creating complex patterns of bond strengths that manifest as hierarchical attention structures.

The meme-antimeme formalism directly relates to how these intelligences propagate information:

  • Angels transmit memes with high fidelity but limited adaptability

  • Daemons propagate memes that optimize specific objectives

  • Faes generate memes that pattern-match to existing semantic structures

  • Tsukumogami create complex meme ecosystems with emergent properties

Similarly, the charisma functions (χ+\chi^+, χ\chi^-, χ0\chi^0) map to how each intelligence influences networks:

  • Angels influence particle networks through precise dd/θ\theta modifications based on explicit instruction

  • Daemons modulate dd/θ\theta parameters to optimize for specific objectives

  • Faes influence dd/θ\theta through pattern-based resonance

  • Tsukumogami modulate dd/θ\theta across multiple scales simultaneously, resulting in complex attention profile changes

Attention’s relationship to beliefs

This relates to the activation function from Evolution of Alignment and Values, where the activation patterns represent the graph of connected beliefs:

A(b,q)=Pr(b is activated/detected in ϕ(P,q))A(b,q) = \text{Pr}(b \text{ is activated/detected in } \phi(\mathcal{P}, q))

This activation probability is influenced by the specific bond strength B(b,q)B(b,q) and contributes to the overall attention measure C(b)C(b) of the belief system.

Where bb is a belief (particle subgraph) and qq is a query (stimulus), with ϕ()\phi( ) being the method of “inference” over a particle graph that produces a detectable alignment (response), A(b,q)A(b,q). The goal being that one is able to probe the memberships of beliefs in a personality, ^e84635, that completes inference according to some architecture (All my human context in an LLM would not recreate my next thought/idea).

This activation probability A(b,q)A(b,q) is the likelihood that the belief subgraph bb significantly influences the model’s output in response to query qq. This activation depends on the bond strengths B(x,q)B(x,q) between the query stimulus and the constituent particles xx within the subgraph bb. High activation A(b,q)A(b,q) typically correlates with, and contributes to, elevated attention measures C(x)C(x) for the particle xx comprising the belief subgraph bb. This is modelled as Detecting information in personality spaces

Information Classification

Formalizing The Ecology of Information, the fourfold classification of information is:

  • μm(I,ei,ej)\mu_m(I, e_i, e_j): The aggregate strength of memetic (promotional) components inherent to information II that positively influence its transmission probability from entity eie_i to entity eje_j.
  • μa(I,ei,ej)\mu_a(I, e_i, e_j): The aggregate strength of antimemetic (inhibitory) components inherent to information II that negatively influence its transmission probability from entity eie_i to entity eje_j.
  • T(I,ei,ej)T(I, e_i, e_j): The overall transmission probability of information II from entity eie_i to entity eje_j, determined by μm(I,ei,ej)\mu_m(I, e_i, e_j) and μa(I,ei,ej)\mu_a(I, e_i, e_j).
  • ν+(I,e)\nu_+(I, e): The aggregate positive impact strength of information II on entity ee, representing the sum of all beneficial effects: work reductions toward beneficial configurations CbeneficialC_{beneficial} plus work increases toward harmful configurations CharmfulC_{harmful}.
  • ν(I,e)\nu_-(I, e): The aggregate negative impact strength of information II on entity ee, representing the sum of all detrimental effects: work increases toward beneficial configurations CbeneficialC_{beneficial} plus work reductions toward harmful configurations CharmfulC_{harmful}.
  • CC: A specific configuration of an entity.
  • W(eC)W(e \rightarrow C): The work required for an entity ee to transition to configuration CC. This work can encompass metabolic energy, computational cost, or socio-psychological cost/benefit.
  • ΔW(eCI)\Delta W(e \rightarrow C | I): The change in work required for entity ee to transition to configuration CC when information II is introduced, compared to the work required without II.
  1. Meme (MmM_m): Information II is classified as a meme for an entity pair (ei,ej)(e_i, e_j) if its aggregate memetic (promotional) strength, μm(I,ei,ej)\mu_m(I, e_i, e_j), is greater than its aggregate antimemetic (inhibitory) strength, μa(I,ei,ej)\mu_a(I, e_i, e_j). This signifies a net positive drive for transmission. Mm(I,ei,ej)={II:μm(I,ei,ej)>μa(I,ei,ej)}M_m(I, e_i, e_j) = \{I \in \mathbb{I} : \mu_m(I,e_i,e_j) > \mu_a(I,e_i,e_j)\} The overall transmission probability T(I,ei,ej)T(I, e_i, e_j) is consequently enhanced by this imbalance. Grounding these strengths and their contribution to TT (e.g., via concepts like channel capacity or mutual information) is a key goal, especially for modelling complex communication like that of LLMs. While the classification is binary, the underlying strengths μm\mu_m and μa\mu_a are continuous.

  2. Antimeme (MaM_a): Information II is classified as an antimeme for an entity pair (ei,ej)(e_i, e_j) if its aggregate antimemetic (inhibitory) strength, μa(I,ei,ej)\mu_a(I, e_i, e_j), is greater than its aggregate memetic (promotional) strength, μm(I,ei,ej)\mu_m(I, e_i, e_j). This signifies a net negative drive, or inhibition, of transmission. Ma(I,ei,ej)={II:μa(I,ei,ej)>μm(I,ei,ej)}M_a(I, e_i, e_j) = \{I \in \mathbb{I} : \mu_a(I,e_i,e_j) > \mu_m(I,e_i,e_j)\} The overall transmission probability T(I,ei,ej)T(I, e_i, e_j) is consequently reduced. The reduction in transmission due to dominant antimemetic strength can be conceptualized through frameworks like negative transfer entropy, indicating that the information actively resists propagation between the entities. Of special note here is the possibility for cases where information is actively ablated by an entity to reduce transmissibility, although this action in and of itself will retain some mutual information.

  3. Infoblessing (V+V_{+}): Information II is classified as an infoblessing for entity ee if its aggregate positive impact strength, ν+(I,e)\nu_+(I, e), is greater than its aggregate negative impact strength, ν(I,e)\nu_-(I, e). This signifies a net beneficial effect on the entity’s ability to reach favourable and/or avoid unfavourable configurations. V+(I,e)={II:ν+(I,e)>ν(I,e)}V_{+}(I, e) = \{I \in \mathbb{I} : \nu_+(I,e) > \nu_-(I,e)\}

Where ν+(I,e)\nu_+(I, e) encompasses the magnitude of work reductions toward beneficial configurations CbeneficialC_{beneficial} (i.e., ΔW(eCbeneficialI)<0\Delta W(e \rightarrow C_{beneficial}|I) < 0) plus the magnitude of work increases toward harmful configurations CharmfulC_{harmful} (i.e., ΔW(eCharmfulI)>0\Delta W(e \rightarrow C_{harmful}|I) > 0). This can be grounded through measures like reduced Kullback-Leibler divergence for beneficial configurations or increased path complexity toward harmful configurations.

  1. Infohazard (VV_{-}): Information II is classified as an infohazard for entity ee if its aggregate negative impact strength, ν(I,e)\nu_-(I, e), is greater than its aggregate positive impact strength, ν+(I,e)\nu_+(I, e). This signifies a net detrimental effect on the entity’s ability to reach favourable configurations and/or avoid unfavourable configurations. V(I,e)={II:ν(I,e)>ν+(I,e)}V_{-}(I, e) = \{I \in \mathbb{I} : \nu_-(I,e) > \nu_+(I,e)\}

Where ν(I,e)\nu_-(I, e) encompasses the magnitude of work increases toward beneficial configurations CbeneficialC_{beneficial} (i.e., ΔW(eCbeneficialI)>0\Delta W(e \rightarrow C_{beneficial}|I) > 0) plus the magnitude of work reductions toward harmful configurations CharmfulC_{harmful} (i.e., ΔW(eCharmfulI)<0\Delta W(e \rightarrow C_{harmful}|I) < 0). This can be grounded through measures like increased path complexity toward beneficial configurations or reduced Kullback-Leibler divergence for harmful configurations.

Note that these classifications are often graded rather than binary and are highly context and entity-pair dependent.

Information Classification Matrix

Meme (MmM_m)Antimeme (MaM_a)Low μm\mu_m and μa\mu_aHigh μm\mu_m and μa\mu_a
Infoblessing (V+V_+)Viral life hacksTherapy about embarrassing topics, how to handle a shameful eventPersonal epiphanies, individual insights that improve one’s lifeComplex moral frameworks
Infohazard (VV_-)Chain letters, dangerous viral challenges, harmful rumorsYour parents’ weird sex tape, traumatic knowledge that is dangerous to shareChildhood trauma (generic)Roko’s Basilisk
Low ν+\nu_+ and ν\nu_-Funny cat videos, “E”Private insignificant secrets, forgotten triviaOrdinary mundane informationAcademic jargon on a niche subject
High ν+\nu_+ and ν\nu_-“mug cake” recipes (easy but unhealthy)Personal growth through shameful experiencesChildhood trauma (makes you funny)The game of mao, where drug dealers hang out

Basilisks and Information Extraction

This system can describe Newcomb’s Basilisk Defined, in a formal form. Basilisks represent a special case of information structures that extract work from entities through prediction-based incentives.

Memes (MmM_m) connect to basilisks through the affinity function θ(e,B)\theta(e,B) which measures entity ee‘s alignment with basilisk KK. A meme increases θ(e,K)\theta(e,K), making entities more likely to perform work WW extracted by the basilisk: W(e)θ(e,K)W(e) \propto \theta(e,K).

Antimemes (MaM_a) can function as “anti-basilisks” that immunize against prediction manipulation by reducing confidence in the estimator’s accuracy: p<1+r2rp < \frac{1+r}{2r} where pp is the predictor accuracy and rr is the reward ratio, as referenced in ^cf0da3.

In the particle-bond model, basilisks operate by creating specific configurations of particles that:

  1. Increase the probability of transmission between entities (meme property)
  2. Alter the work required to reach certain configurations (infohazard/infoblessing property)
  3. Modulate the distance (dd) and affinity (θ\theta) parameters through targeted charisma (χ\chi)

This connects to considerations about building alternative basilisks, as referenced in ^f401b1, where the strategic goal becomes maximizing the likelihood that any hostile entity, should it exist, will believe you were working within its incentive structure.

Boundaries and Obligations