Sigurd Schacht 08/01/2026 Sigurd Schacht 08/01/2026

Democratizing Mechanistic Interpretability: Bringing Neural Network Analysis to Apple Silicon

Sigurd Schacht 12/10/2025 Sigurd Schacht 12/10/2025

Beyond Reasoning: The Imperative for Critical Thinking Benchmarks in Large Language Models

Current evaluation frameworks for Large Language Models (LLMs) predominantly assess logical reasoning capabilities while neglecting the crucial dimension of critical thinking. This gap presents significant challenges as LLMs transition from tools to colleagues in human-AI teams, demanding a fundamental reconsideration of how we evaluate machine intelligence.

Sigurd Schacht 05/10/2025 Sigurd Schacht 05/10/2025

The Flight Recorder for AI Agents: Toward Reproducible and Accountable Autonomy

As AI agents become autonomous decision-makers, we need "flight recorders" that capture their complete internal reasoning—inputs, neural activations, and decisions—in a deterministic, reproducible format. Such infrastructure would transform opaque agent behavior into auditable evidence, enabling both mechanistic interpretability research and accountability by allowing researchers to replay and inspect exactly what happened and why.

Sigurd Schacht 02/10/2025 Sigurd Schacht 02/10/2025

Automated Detection of Scheming Behavior in Frontier AI Models: Preliminary Findings from Our Dual-LLM Framework Study

Our Blog about early Research Ideas

Democratizing Mechanistic Interpretability: Bringing Neural Network Analysis to Apple Silicon

Beyond Reasoning: The Imperative for Critical Thinking Benchmarks in Large Language Models

The Flight Recorder for AI Agents: Toward Reproducible and Accountable Autonomy

Automated Detection of Scheming Behavior in Frontier AI Models: Preliminary Findings from Our Dual-LLM Framework Study

COAI Research

Location

Contact

Info