Flamehaven LogoFlamehaven.space
back to case notes
bioaiPROTOTYPEPUBLICApache License 2.0

STEM-BIO-AI

Rubric-based trust evaluator for bio/medical AI repositories with dual-path evidence, governance overlay, and institutional audit outputs.

About This Work

A T4 score means strong observable evidence signals. It does not mean the repository is safe for clinical deployment — that requires independent expert validation.

ai-governancebio-aiclinical-aicomputational-biologyllm-evaluationmedical-airepository-auditresearch-governancerisk-governancesafety-evaluationtrust-evaluationopen-source-audit

Repository Overview

A T4 score means strong observable evidence signals. It does not mean the repository is safe for clinical deployment — that requires independent expert validation.

README Core

Bio and medical AI repositories vary enormously in evidence quality — from rigorous academic tools to marketing-grade demos that carry clinical language with no data provenance, no reproducibility path, and no clinical-use disclaimer. Manual review is slow and inconsistent.

A T4 score means strong observable evidence signals. It does not mean the repository is safe for clinical deployment — that requires independent expert validation.

Use & Documentation

Detailed installation, commands, examples, and deeper usage notes live in the repository README and docs.

README Map

  • Why STEM BIO-AI
  • Quick Start
  • Triage Tiers
  • Scoring Model
  • Architecture
  • Output Artifacts

Key Signals

  • Demo: Hugging Face Space
  • API contract: docs/API CONTRACT.md
  • Secret handling: docs/ADVISORY SECRET HANDLING.md
  • Advisory runtime boundary: docs/ADVISORY RUNTIME.md
  • Example audits: docs/EXAMPLE AUDITS.md

Announcements

synced May 17, 2026

flamehaven01Announcements

When Control Becomes Authority: Calibration Governance in STEM BIO-AI 1.7.x

Control slowly becomes authority when nobody marks the boundary.

flamehaven01Announcements

From Score to Workflow: Turning STEM BIO-AI Into a Local Audit System

Earlier in this series, I wrote about why bio/medical AI repositories need more than benchmarks, what I learned after auditing 10 public repositories, and why an AI auditor itself needs a memory contract.

flamehaven01Announcements

Medical AI Repositories Need More Than Benchmarks. We Built STEM-AI to Audit Trust

If you have been paying attention to GitHub recently — the past six months — you have seen the pattern.