Research — Fall Risk AI

Core research

13 papers · publication order

Each paper extends a previous question. The natural reading path is in publication order — the program's questions unfolded that way for a reason.

Research paper · February 2026

The δ-Gene: Inference-Time Physical Unclonable Functions from Architecture-Invariant Output Geometry

Neural networks have a structural fingerprint — the third pre-softmax logit gap — that is invariant to temperature, architecture-stable across six families, and unforgeable under any adversarial KL budget.

Audience: ML security · cryptography · formal methods Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.18704275

Research paper · February 2026

Template-Based Endpoint Verification via Logprob Order-Statistic Geometry

When the model is behind an API and the weights are out of reach, the same identity can be measured through public logprob endpoints using PPP-residualized order-statistic geometry.

Audience: API security · enterprise integration Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.18776711

Research paper · February 2026

The Geometry of Model Theft: Distillation Forensics, Adversarial Erasure, and the Illusion of Spoofing

Distilled models inherit a measurable trace of their teacher; passive fine-tuning erases the trace faster than adversarial erasure does, and same-family spoofing is geometrically anti-aligned.

Audience: ML security · IP forensics · provenance Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.18818608

Research paper · March 2026

Provenance Generalization and Verification Scaling for Neural Network Forensics

Provenance detection generalizes across teachers, students, and training protocols — but the cosine alignment diagnostic is mandatory; scalar distance alone produces wrong answers.

Audience: ML security · forensics methodology Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.18872071

Research paper · March 2026

Beneath the Character: The Structural Identity of Neural Networks

An AI system's structural identity is mathematically distinct from its behavioral character. Two models can produce identical outputs while having different identities, and the same model produces wildly different characters under different prompts.

Audience: Philosophy of AI · alignment · public Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.18907292

Research paper · March 2026

Which Model Is Running? Structural Identity as a Prerequisite for Trustworthy Zero-Knowledge Machine Learning

Inference verification proofs are only as trustworthy as the binding between the proof and the model that actually ran. Hybrid proof-and-bridge attestation closes the gap.

Audience: Cryptography · zero-knowledge · runtime trust Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19008116

Research paper · March 2026

The Deformation Laws of Neural Identity

Neural network identity is organized in three layers — structural, thermodynamic, functional — each with distinct deformation laws under training.

Audience: ML theory · identity foundations Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19055966

Research paper · March 2026

What Counts as Proof? Admissible Evidence for Neural Network Identity Claims

Identity evidence comes in distinct classes — artifact, structural, provenance, behavior — and substituting one for another is formally insufficient. A theorem makes the constraint legally citable.

Audience: Compliance · governance · regulators Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19058540

Research paper · March 2026

Composable Model Identity: Formal Hardening of Structural Attestations in the Enterprise Identity Stack

Structural model identity composes cleanly with JWT, SPIFFE, and existing enterprise identity primitives. Four formal composition properties make it stack-safe.

Audience: Enterprise security · IAM · standards Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19099911

Research paper · March 2026

Where Identity Comes From: Path Sensitivity and Endpoint Underdetermination in Neural Network Training

Two models trained on identical data and architecture produce different structural identities. Endpoint statistics cannot recover the formative trajectory.

Audience: ML theory · training dynamics Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19118807

Research paper · March 2026

Post-Hoc Disclosure Is Not Runtime Proof: Model Identity at Frontier Scale

Structural identity verification scales to 70B+ parameters. When a frontier vendor's model lineage is disputed, only runtime measurement settles it — disclosure after the fact does not.

Audience: Frontier AI · enterprise governance Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19216634

Research paper · March 2026

Family-Dependent Response to Reasoning Distillation Across Structural and Functional Identity Layers

Reasoning distillation produces family-dependent structural and functional responses. Mistral, Llama, and Qwen react differently — and the structural layer can decouple entirely from the functional layer.

Audience: ML research · distillation Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19298857

Research paper · April 2026

Safety-Alignment Removal as a Model-Identity Failure

Public toolchains strip a model's safety constraints while preserving observable behavior. The structural fingerprint changes anyway — and we can detect it.

Audience: AI safety · regulators · governance Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19383019

Technical notes

4 notes · operational and definitional

Shorter artifacts. Threat models, formal results, and category-defining clarifications adjacent to the core papers.

Technical note · March 2026

Agent Identity Is Not Model Identity

Authorizing an agent is not the same as verifying which neural network produced its response. The 2026 identity-management products solve the first problem and assume the second.

Audience: Identity architects · CISOs · standards Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19240883

Technical note · March 2026

Gap Invariance: Why PPP Measurements Are Domain-Independent by Construction

Order-statistic gaps are invariant to log-softmax, temperature, and constant shifts. The endpoint-verification protocol's robustness is provable, not just empirical.

Audience: Cryptography · API security · formal methods Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19275524

Technical note · March 2026

Measured Model Substitution Under Valid Agent Credentials

Three substitution scenarios run against a live gateway with valid agent credentials. Three detected. Zero false accepts. Warm-path latency under seven seconds.

Audience: Threat modelers · incident response Read on fallrisk.ai → Zenodo DOI: 10.5281/zenodo.19342848

Technical note · Zenodo only · May 2026

Artifact Identity Is Not Runtime Identity: A Note on Trustfall Lite's Evidence Class ↗

Trustfall Lite verifies whether a local artifact's bytes match a signed enrollment record. It does not — and cannot — verify what runs at inference time. The boundary is the product.

Audience: Engineers · enterprise security Read on Zenodo → Zenodo DOI: 10.5281/zenodo.20019127

Runtime model identity, artifact identity, and signed verification infrastructure.

Reading paths

The δ-Gene: Inference-Time Physical Unclonable Functions from Architecture-Invariant Output Geometry

Template-Based Endpoint Verification via Logprob Order-Statistic Geometry

The Geometry of Model Theft: Distillation Forensics, Adversarial Erasure, and the Illusion of Spoofing

Provenance Generalization and Verification Scaling for Neural Network Forensics

Beneath the Character: The Structural Identity of Neural Networks

Which Model Is Running? Structural Identity as a Prerequisite for Trustworthy Zero-Knowledge Machine Learning

The Deformation Laws of Neural Identity

What Counts as Proof? Admissible Evidence for Neural Network Identity Claims

Composable Model Identity: Formal Hardening of Structural Attestations in the Enterprise Identity Stack

Where Identity Comes From: Path Sensitivity and Endpoint Underdetermination in Neural Network Training

Post-Hoc Disclosure Is Not Runtime Proof: Model Identity at Frontier Scale

Family-Dependent Response to Reasoning Distillation Across Structural and Functional Identity Layers

Safety-Alignment Removal as a Model-Identity Failure

Agent Identity Is Not Model Identity

Gap Invariance: Why PPP Measurements Are Domain-Independent by Construction

Measured Model Substitution Under Valid Agent Credentials

Artifact Identity Is Not Runtime Identity: A Note on Trustfall Lite's Evidence Class ↗

Tools and infrastructure