Flamehaven LogoFlamehaven.space

Writing Hub

AI governance essays, reasoning systems notes, experiment logs, and technical writing across BioAI and engineering practice.

Current ViewSearch: Agent
The Two Problems No One Talks About in AI Agent Coding Pipelines
AI Governance Systems

The Two Problems No One Talks About in AI Agent Coding Pipelines

AI agent coding pipelines fail not because models are weak, but because verification is structurally broken. This article identifies four empirically documented failure mechanisms — agreement bias, latent entanglement, echoing, and right-for-wrong-reasons — and proposes a concrete architecture: hash-chained audit records, hybrid recurrence scoring, dynamic context budgets, and evidence-first review across three independent axes. Covers multi-agent pipeline design, agentic code review, blueprint indexing, and P0–P4 governance gates.

Control, auditability, and safe boundaries#Data Orchestration#Contextengineering#Architecture#Prompt Engineering#Mlops#AI Governance#AI Alignment#AI
Stanford. Princeton. A bioRxiv Paper. So Why Did Nobody Ask Where the Data Goes?
Scientific & BioAI Infrastructure
STEM_BIO_AI Audit Report

Stanford. Princeton. A bioRxiv Paper. So Why Did Nobody Ask Where the Data Goes?

BioClaw processes EHR data. Its primary showcase channel is WhatsApp. We audited the repository: 60/100, Tier 2 Caution. Here is what the bioRxiv paper says that the README does not.

Evidence-aware scientific systems#AI#AI Alignment#Biomedical#Bioinformatics#Mlops#Cognitive Science#AI Code#Data Orchestration#Agent#Code Review#Claim custody#AI Productivity#Contextengineering#Prompt Engineering
Your Bio Repo Could Get You Fined. Here Is Why We Check Every Single One.
Scientific & BioAI Infrastructure
STEM_BIO_AI Audit Report

Your Bio Repo Could Get You Fined. Here Is Why We Check Every Single One.

When a bio AI repository claims HIPAA compliance but the code says otherwise, the legal exposure falls on whoever deploys it. STEM-BIO-AI evaluated yorkeccak/bio — 322 stars, modern stack, one dangerous README line. Score: 48/100. T1 Quarantine. Full audit report with score matrix, regulatory traceability, and raw machine output.

Evidence-aware scientific systems#AI#AGI#Biomedical#Bioinformatics#Agent#AI Productivity#Architecture#Code Review
Beyond Repo Scanning: How AIRI Expanded the Risk Vocabulary in STEM BIO-AI 1.7.x
Scientific & BioAI Infrastructure
STEM-AI:Soverign Trust Evaluator for Medical AI Artifacts

Beyond Repo Scanning: How AIRI Expanded the Risk Vocabulary in STEM BIO-AI 1.7.x

How STEM BIO-AI uses the MIT AI Risk Repository as a governed local risk-vocabulary layer without replacing deterministic repository scanning

Evidence-aware scientific systems#AI#AGI#AI Alignment#AI Governance#Biomedical#Bioinformatics#Cognitive Science#Open Source#AI Research#Scientific Integrity#Prompt Engineering#Software Development#Github#AI Code#Architecture#Security#Data Orchestration#Agent
When Control Becomes Authority: Calibration Governance in STEM BIO-AI 1.7.x
AI Governance Systems
STEM-AI:Soverign Trust Evaluator for Medical AI Artifacts

When Control Becomes Authority: Calibration Governance in STEM BIO-AI 1.7.x

Why STEM BIO-AI treats calibration as governed policy instead of a free-form score-tuning console for bio and medical AI repository audits.

Control, auditability, and safe boundaries#Bioinformatics#Biomedical#AI#AGI#AI Alignment#AI Governance#AI Hallucination#Cognitive Science#Open Source#AI Research#AI Code#Architecture#Data Orchestration#Agent
Role Separation Is Not Verification: The Structural Failures Hidden in Your Multi-Agent Pipeline
AI Governance Systems

Role Separation Is Not Verification: The Structural Failures Hidden in Your Multi-Agent Pipeline

A research-backed breakdown of why agent role design alone does not produce reliable audits — and what actually does

Control, auditability, and safe boundaries#AI#AGI#Agent#AI Alignment#AI Governance#Software Development#Architecture#Data Orchestration
Prompt → RAG → MCP → Agent → Harness, and What?
Cloud & Engineering Foundations

Prompt → RAG → MCP → Agent → Harness, and What?

Why the next layer in AI may be governance infrastructure, not just better agents.

Operational surfaces that survive real deployment#AI#AGI#AI Ethics#AI Alignment#AI Governance#AI Hallucination#LLM#Cognitive Science#Developer Tools#Prompt Engineering#Software Development#AI Code#Contextengineering#Architecture#Data Orchestration
The Harness Is the Product: What the Claude Code Leak Actually Revealed About AI Agent Architecture
Cloud & Engineering Foundations

The Harness Is the Product: What the Claude Code Leak Actually Revealed About AI Agent Architecture

The Claude Code leak exposed more than source. It revealed that modern AI agent performance depends heavily on the harness around the model.

Operational surfaces that survive real deployment#AI#AGI#AI Alignment#AI Governance#LLM#Deep Learning#Machine Learning#DevOps#Prompt Engineering#Software Development#Product Management#AI Code#Contextengineering#Architecture#Security#Data Orchestration
Prompt, Pray & Push: Why Your AI Agent Keeps Failing You
Cloud & Engineering Foundations

Prompt, Pray & Push: Why Your AI Agent Keeps Failing You

The one concept that turns expensive spaghetti into great agentic engineering.

Operational surfaces that survive real deployment#AI#AGI#AI Alignment#AI Governance#AI Hallucination#Future of Work#LLM#Deep Learning#Machine Learning#SR9/DI2#Cognitive Science#DevOps#Programming#AI Code#Business Strategy#Software Development#Prompt Engineering
Your Agentic Stack Has Two Layers. It Needs Three.
AI Governance Systems
Governed Reasoning

Your Agentic Stack Has Two Layers. It Needs Three.

Most agentic stacks cover tools and skills, but miss intent governance. Learn why a third layer is needed to stop AI drift, scope creep, and technically correct systems heading in the wrong direction.

Control, auditability, and safe boundaries#AI#AGI#AI Alignment#AI Governance#AI Hallucination#LLM#Deep Learning#Machine Learning#SR9/DI2#Cognitive Science#Prompt Engineering#AI Code#Contextengineering#Architecture
AI Agents Are Poisoning Your Codebase From the Inside
Cloud & Engineering Foundations

AI Agents Are Poisoning Your Codebase From the Inside

Explore how AI-generated code can silently degrade software quality through weakened tests, rising code churn, and duplication—and how teams can prevent it with better governance.

Operational surfaces that survive real deployment#AI#AI Ethics#AI Alignment#AI Governance#AI Hallucination#LLM#Deep Learning#Machine Learning#Developer Tools#DevOps#Programming#Prompt Engineering#Product Management#Software Development#AI Code