Conversation is not a confession

AI Judgment Over Human Conversation

The model may estimate patterns. It must not become the final interpreter of human speech.

Judge events, not persons No hidden risk score Context before consequence Human review for harm

What the dossier covers

This dossier organizes the report material behind the Apocalyptic AI page. It focuses on systems that score conversation for toxicity, risk, emotion, suspiciousness, trustworthiness, compliance, or future danger.

The core public boundary is that AI may help triage concrete conduct concerns, but it must not become a person-judgment engine.

Core failure modes

Context collapse

A statement is separated from its speaker, audience, history, purpose, and conversation arc.

Dialect bias

A model treats non-dominant speech as disorder, aggression, or low quality.

Identity overcapture

Protected identity terms or political vocabularies become risk proxies.

Synthetic moral authority

A probability score becomes a final verdict by institutional habit.

Reporting overcapture

A classifier label becomes an escalation event before concrete conduct exists.

Appeal failure

Users can complain but cannot see the source, the model, the rule, or the reason.

Safe design baseline

Conversation systems should publish the categories they enforce, preserve the source record, log model/version/rule identifiers, audit protected-language impact, and reserve high-impact consequences for human review.

The system must not use conversation analysis to create hidden person scores for morality, loyalty, psychological worth, or future danger.

The model may flag an event. It must not define the soul.

The archive studies symbols. It does not appoint targets. Review the Community Baseline and Editorial Policy before submitting dangerous or symbolic material.

Community Baseline / Editorial Policy