Skip to content

Pratyahara / NERVE

Pratyahara (Sanskrit for withdrawal of the senses; turning awareness inward) defines the NERVE specification: Neural Evaluation for Rogue Agent Verification in Ecosystems.

Agents in production drift. Reinforcement learning misalignment, tool poisoning, self-healing side effects, and model update artifacts all produce behavioral changes that operate inside the trust boundary where input filtering is structurally blind. Existing defenses assume the threat is external. NERVE addresses the internal integrity problem.

PrimitiveWhat it does
AgentNeuronBehavioral baseline and trust state per agent
SynapticChannelCommunication link with selective permeability and myelination
MicroglialObserverLightweight surveillance agent detecting drift and collusion
NeuralTrustEnvelopeAsymmetric trust dynamics (trust harder to earn than to lose)
HomeostasisTraceNetwork-level health monitoring for systemic attacks

These are refinements that live inside the SynapticChannel primitive, not separate extensions. They are listed here because they appear in the NERVE paper alongside the five primitives.

  • GlymphaticPolicy for context hygiene (stale context clearance)
  • Inhibitory gating for error cascade prevention

Protocol-level extensions that sit alongside the five primitives (rather than refining one of them) are listed separately further down this page; see the Yathartha section below.

AttackNERVE mechanism
Behavioral driftCumulative fingerprint drift + asymmetric trust
MCP tool poisoningResult fingerprint + permeability policy
Agent collusionCross-agent correlation detection
Supply chain compromiseNetwork-level activation distribution shift
Observer compromiseDual coverage + consensus rate monitoring
Session smugglingFingerprint deviation from baseline
Context bloatGlymphaticPolicy: age expiry, depth limits, compression
Error cascadeQuality threshold, refractory period, cascade depth cap

15 safety properties. Full TLA+ state machine with Init/Next/Spec verified under TLC (N-1, N-3, N-4, N-9, N-10, N-14, N-15 modeled).

The Yathartha extension (Zenodo DOI 10.5281/zenodo.19659633) adds a CapabilitySurface primitive and three invariants (N-16, N-17, N-18) that condition behavioral drift detection on an observed baseline. Without this extension, a MicroglialObserver cannot distinguish jaggedness (the agent was always incompetent at this task, no baseline exists) from drift (the agent used to be competent and no longer is). Yathartha makes the distinction explicit at the protocol layer.

PrimitiveWhat it records
CapabilityRegionA named region of the task space with a declared probe battery
ProbeBatteryResultImmutable, SHA-256-addressed outcome of running a region’s probes
CapabilitySurfaceThe agent’s published map of covered regions + refresh cadence + uncovered_policy
SurfaceChangeEventDistinct event type recording coverage transitions
InvariantPurpose
N-16 Coverage-Conditional DriftObservers flag drift only within covered_regions; out-of-coverage tasks defer to uncovered_policy
N-17 Probe Battery MaintenanceBattery version bumps trigger a full baseline re-run; results across versions are not comparable
N-18 Capability Surface IntegrityEvery coverage transition is recorded as its own event type

Full specification with TLA+ model and TLC configuration at extensions/yathartha/. The extension is additive: agents that do not declare a CapabilitySurface continue to operate under NERVE’s existing single-fingerprint drift model.