The problem surfaces when compliance teams ask a question that should be straightforward: "Prove this agent did this thing, and prove nobody rewrote the log afterward." Today's answer is: pick a framework, hope they log it, export a JSON file, and trust that it hasn't been edited. For regulated industries—financial services, healthcare, critical infrastructure—that's not acceptable. The EU AI Act Article 12, binding August 2, 2026, explicitly requires that high-risk AI systems maintain "records... contemporaneously recorded... in a durable medium" and that those records be available for inspection. The penalty for non-compliance is up to €15 million or 3% of global revenue. That's the enforcement date. The tooling isn't ready.
The gap is not that logging is impossible. AutoGen, CrewAI, Hermes Agent, and dozens of smaller frameworks all generate audit trails. The gap is standardization and cryptographic proof. GitHub issues across three major frameworks make this concrete: AutoGen issue #7353 requests "cryptographic action receipts for enterprise agent governance"; CrewAI issue #5541 asks for "per-agent compliance covenants with cryptographic signing"; Hermes Agent issue #487 proposes "SHA-256 hash-chained action logs." These aren't wishful thinking—they're active feature requests from teams shipping to production. Each framework has its own logging shape. None of them produces a receipt that an external auditor can cryptographically verify without access to the framework's internals or trust in the company that built it.
Why hasn't this been solved? The infrastructure for it already exists in fragments. The IETF is actively working on agent audit standards through drafts like draft-sharif-aebb-00 and draft-sharif-attp-agent-trust-transport-00, which define hash-chained JSON records and trust levels L0–L4 for attestation. NIST guidance on AI agent security, rolling out through 2026, is explicitly requesting input on non-repudiation and runtime measurement mechanisms. The NSA's recent security assessment of AI agents identifies the absence of runtime attestation as a critical control gap. The pieces exist—but they live in different silos. Frameworks don't own compliance, so compliance tooling isn't their incentive. Compliance vendors don't own frameworks, so they can't standardize the receipt format at the source. MCP, the Model Context Protocol that's becoming standard infrastructure for tool-calling, intentionally deferred identity and attestation decisions to implementers, which means there is no unified trust anchor across a multi-agent deployment. The result: fragmentation. Each vendor ships what they think auditors need; no auditor gets a receipt they can verify independently.
The tighter constraint is timing. Teams need working solutions before August 2026, not after standards bodies converge. The closest existing answer is that some vendors (AWS audit logging, proprietary agent platforms) ship audit trails as a feature, but these are opaque to independent review and don't solve the portability problem. An agent deployed on one platform produces receipts that don't transfer to another. A compliance team auditing a multi-agent system that spans frameworks can't get a unified, cryptographically verifiable transcript. That's the production blocker: not technology, but standardization and incentive alignment.