Chapter 3: Neo's Learnability

This chapter analyzes a central question in Neosis: to what extent can a Neo, whose internal structure is a self-modifying binary graph driven by mutation and energy constraints, develop the ability to learn? Learning is not explicitly built into the Neo architecture. There is no predefined synaptic plasticity rule, no weight-update mechanism, and no gradient-based training. All changes to structure occur only through mutation. Despite this, Neos can acquire two distinct forms of learning. The first is In-Life learning, in which a Neo adjusts its behavior within a single lifetime through state dynamics and small parameter shifts generated by its own circuitry. The second is Lifetime learning, where structural mutations accumulate across long survival periods. We refer to the overall capacity for these behaviors as learnability. In this chapter, we formalize learnability, describe the mechanisms through which both In-Life and Lifetime learning arise, and explain why the Neo substrate is sufficient for the emergence of adaptive behavior.

3.1 Two Modes of Adaptation

A Neo adapts through:

  1. Lifetime Learning (Slow). Structural mutations applied by Evo modify SS and accumulate over extended survival. This determines the long-term evolutionary trajectory of the Neo's computational capacities.

  2. In-Life Learning (Fast). A Neo may adjust its parameters within a single lifetime through small, energy-limited updates computed by its own internal circuits. These parameter shifts do not change structure SS, but they alter how SS is used.

Learnability encompasses both modes, but In-Life learning is the primary source of rapid adaptation.

3.2 In-Life Learning in Neo

In-Life learning refers to parameter-level adjustments a Neo performs within its lifetime without altering its topology. It operates entirely inside Lio, independent of Evo, and relies only on signals physically available to each node: its own activation, its local inputs, and the global reward. In-Life learning introduces no new node types, no plasticity subgraphs, and no global optimization rule. Instead, each node applies a small local update that statistically reinforces behaviors correlated with positive reward.

3.2.1 Motivation

Digital organisms require a mechanism for within-lifetime adaptation that is (i) local, (ii) energy-limited, (iii) structure-invariant, and (iv) sufficiently stochastic to explore the behavioral space. Biological nervous systems satisfy these properties using simple reward-modulated Hebbian adjustments rather than gradient-based global supervision. Neo adopts the same principle: learning arises from local correlations between input, output, and reward, not from derivatives of a loss function.

To maintain exploration, Neo preserves a stochastic component inside Lex, the node-level transfer mechanism, rather than embedding noise directly into the learning rule. This separation keeps the learning dynamics simple while preserving rich stochastic behavior.

3.2.2 Local State and Required Inputs

Each node ii maintains two small internal memory elements:

  • Activation register Vt[i]\mathbf{V}_t[i], stored as part of Lex's computation.

  • Hebbian register hi(t){1,0,+1}h_i(t) \in \{-1,0,+1\}, indicating whether the node's recent activity contributed positively, negatively, or not at all to the outcome.

Lio receives an extended external input vector

Ut=(environment inputs,St),\mathbf{U}_t = (\text{environment inputs},\, S_t),

where StS_t is the externally supplied reward. The reward channel is broadcast to all nodes exactly like any other external signal. Each node also stores its incoming inputs ui(t)\mathbf{u}_i(t) for one tick to support local parameter updates.

3.2.3 Node-Level In-Life Learning Rule

In-Life learning is expressed as a local update to the parameters θi\theta_i of node ii. After each prediction and reward cycle, the node updates its Hebbian register:

hi(t)={+1if Vt[i]=1 and St>0,1if Vt[i]=1 and St<0,0otherwise.h_i(t) = \begin{cases} +1 & \text{if } \mathbf{V}_t[i] = 1 \text{ and } S_t > 0, \\[2mm] -1 & \text{if } \mathbf{V}_t[i] = 1 \text{ and } S_t < 0, \\[2mm] 0 & \text{otherwise}. \end{cases}

If the node was active during a successful tick, it is marked as helpful; if it was active during an unsuccessful tick, it is marked as harmful; otherwise it is neutral for that tick.

Given this flag, the node updates its parameters:

θi(t+1)=θi(t)+ϵhi(t)ui(t),\theta_i(t+1) = \theta_i(t) + \epsilon\, h_i(t)\, \mathbf{u}_i(t),

where ui(t)\mathbf{u}_i(t) is the input vector received by node ii at tick tt, and ϵ\epsilon is a small global In-Life learning constant. The associated metabolic cost is

Costi(t)=cθϵhi(t)ui(t).\text{Cost}_i(t) = c_\theta \, \|\epsilon\, h_i(t)\, \mathbf{u}_i(t)\|.

All computation is local; no global gradient information or structural change is involved.

3.2.4 Stochasticity and Lex's Role

Lex preserves the stochastic component originally introduced into node outputs:

yi(t)Bernoulli(pi(Vt,θi)).y_i(t) \sim \text{Bernoulli}(p_i(\mathbf{V}_t, \theta_i)).

This is the only source of intentional noise in In-Life learning. Learning itself remains deterministic given the stochastic outputs, so updates inherit randomness through yi(t)y_i(t).

Exploration arises because node outputs are stochastic; repeated reward-correlated co-activations produce consistent parameter drift, while uncorrelated fluctuations average out. This yields a statistical direction of improvement without requiring explicit noise in Δθ\Delta\theta.

3.3 Relationship Between Learnability and Survival

In-Life learning enables a Neo to adjust its behavior quickly in response to environmental changes. If these adjustments increase expected energy,

E[NtIn-Life learning]>E[Ntno In-Life learning],\mathbb{E}[N_t \mid \text{In-Life learning}] > \mathbb{E}[N_t \mid \text{no In-Life learning}],

then evolutionary pressure will favor structures SS that support effective local Hebbian updates and stable parameter drift.

Lifetime learning determines what kinds of In-Life learning a Neo can express through its evolved topology, while In-Life learning directly shapes immediate survival within the NeoVerse.

3.4 Summary

A Neo is learnable when a fixed structure SS supports behavior that adapts to experience. This adaptation arises either through internal dynamical responses or through local Hebbian parameter updates driven by reward correlation. Lifetime learning shapes structure slowly via mutation, while In-Life learning provides rapid, flexible adjustment during a single lifetime through local reward-modulated updates.

Last updated