Evidence · medicine

Differential diagnosis
with a reason for every ruling

Medicine is one of the domains we built the engine against, because a wrong call costs something here and the right call has to be defensible. A patient describes symptoms. FROS checks 30,981 diagnoses against formally proven rules and returns a ranked differential with exclusion certificates showing why each diagnosis was ruled in or out. Every ruling is inspectable, and it is one piece of a larger system. See how the engine keeps a model coherent over a long task.

See it in action

FROS DDx Engine

A patient describes crushing chest pain, sweating, and nausea. FROS identifies the clinical findings, weighs all candidate diagnoses, and returns a ranked differential. It runs from the patient's description alone.

By the numbers

Verified medical knowledge

30,981

Diagnoses

Each with machine-checked rules derived from medical knowledge bases.

855

Clinical vignettes

100% hit rate on curated vignettes. 78.4% on out-of-sample USMLE / MedQA questions, engine-only.

100%

Engine precision

The engine is logically precise. Every exclusion is a proof the clinician can inspect.

How it differs

Every ruling out comes with a certificate

AI-based diagnosis

Neural models produce ranked lists with probability scores. "Myocardial infarction: 73% likely." The score arrives bare: the reasoning stays hidden, the alternatives it weighed go unrecorded, and tomorrow the same input can return a different answer.

FROS diagnosis

The engine tests each diagnosis against the patient's findings using formally verified rules. Diagnoses are ranked by how much evidence supports them. Each exclusion comes with a certificate showing exactly which findings ruled it out. The same inputs always yield the same answer, with a full trace you can inspect.

Disambiguation in medicine

Words matter in clinical text

Medical language is full of ambiguity. "Cold" can mean temperature or illness; "discharge" can mean release from hospital or bodily fluid; "positive" can mean good news or a concerning test result.

FROS resolves these ambiguities before clinical reasoning begins. The same engine that achieves 94.5% on standard NLP benchmarks processes clinical text, ensuring that downstream diagnosis operates on resolved meanings rather than surface words. Every sense assignment and every diagnosis exclusion is produced by the deterministic engine, which owns the decision path end to end. This eliminates an entire class of errors where ambiguous terms match the wrong diagnoses.

See NLP benchmarks

Operator-led pilots running now

Get pilot access Email Daniel

Differential diagnosiswith a reason for every ruling