❦
osservadore · mmxxvi
The Observer
Independent research on AI evaluation infrastructure and structural systemic risks.
The failure mode this archive describes is not model collapse, not benchmark contamination, not distribution drift. It is a functional shift of the evaluation categories themselves — boundaries persist while what they separate becomes progressively opaque.
Recent papers
April 2026
A Category-Level Failure Mode Not Captured by Distribution Metrics
Externality conditions for substrate-independent audit, derived from the 2008 Gaussian copula collapse.