Cortex - Protocol Intelligence Pipeline

Job ID	Status	Step	Progress	Created
`job_c4591001_clinica...`	COMPLETED	DONE	100%	2026-03-22 07:43
`job_c4591001_clinica...`	COMPLETED	DONE	100%	2026-03-21 22:34
`job_c4591001_clinica...`	COMPLETED	DONE	100%	2026-03-21 21:42
`job_c4591001_clinica...`	COMPLETED	DONE	100%	2026-03-21 20:27
`job_prot_sap_000_202...`	COMPLETED	DONE	100%	2026-03-20 21:53

Protocol / Job ID	Status	Outcome	Fields	Created	Details
Loading jobs...

Current Cortex Rules

Current Cortex field registry: 238 fields across 15 modules. Uploads run the full Cortex pipeline by default, and the current rules are versioned from the canonical rule loaders.

Canonical rules: defs:3.4 gates:3.0 Downstream rules: agent_configs=2.1 • schema_mapping=2.1 • validation_rules=3.0 • usdm_path_registry=2.0 • gate_rules=3.0 • schema_agent_config=2.1 • mapping_agent_config=2.1 • amendment_agent_config=2.1 • validator_agent_config=2.1 • schema_field_mapping=2.1 • validation_rules_standards=3.0 • schema_validation_rules=3.0 • gate_rules_standards=3.0

Implementation Overview

Models: Cortex is controller-led. Tier 1 is narrow deterministic extraction (~120 fields). Tier 2 is manifest-gated learned ranking (~30 fields) — currently operating in lexical fallback mode (no trained CrossEncoder bundle). Tier 3 is the universal non-system LLM extractor with bounded self-critique (~82 fields), primary model: ministral3-14b (Ollama Cloud). Tier 4 is selective arbitration for hard fields, model: qwen3.5 (Ollama Cloud). Zero OpenAI.

Data: Protocol PDFs are processed with Docling-first extraction. The active path is Docling text + table, yielding an evidence pack with page and line IDs. PyMuPDF is fallback-only when Docling returns no usable lines. Synthetic ground truth comes from synthetic_protocols. Artifacts remain immutable in GCS and job state in Firestore.

Process: Upload or synthetic generation creates the job, the worker runs extraction and builds the evidence pack, Cortex applies hybrid semantic zoning and field planning, then bounded controller rounds execute Tier 1, Tier 3, and selective Tier 4 before validation emits design_output_v1. Schema, Mapping, Amendment, and Validator continue downstream. Progress, step timings, token usage, and estimated cost are exposed on GET /api/v1/jobs/{id}.

Synthetic scenarios (generator-native): BASELINE, DESIGN_CHALLENGE, SCHEMA_CHALLENGE, MAPPING_CHALLENGE, AMENDMENT_CHALLENGE, VALIDATOR_STRESS, FULL_STRESS, STATUS_CHALLENGE, NOISE_CHALLENGE — short path MODE_A except long-path for VALIDATOR_STRESS/FULL_STRESS.

Runtime Flow

Contract-first pipeline with 238 fields across 15 modules (rules vdefs:3.4 gates:3.0). Each agent consumes upstream artifact(s) and emits one versioned JSON output. Mapping and Amendment run in parallel after Schema. Full audit trail and cost tracking per step.

PDF Upload

→

Extraction
Docling text + table

→

Evidence Pack
lines + page IDs

→

Hybrid Zoning + Planning
controller prep

→

Design Agent
bounded controller loop

→

Schema Agent

→

Mapping + Amendment

→

Validator

→

Validated Output

Artifacts: _1_design.json → _2_schema.json → _3_mapping.json + _4_amendment.json → _5_validated.json

Pipeline Agents

D

Design Agent

Cortex processes the protocol with Docling-first extraction, builds an evidence pack, applies hybrid semantic zoning, and prepares field plans before entering bounded controller rounds. Tier 3 is the main non-system extractor, Tier 1 stays narrow, Tier 2 is manifest-gated, and Tier 4 is selective arbitration for hard fields. Validation and trace outputs stay explicit.

PDF → Text → Evidence Pack → Hybrid Zoner + Planner → Controller Round 1 → T3 + Self-Critique → Selective T4 Arbitration → Validation → Final output

↓

Sc

Schema Agent

Deterministic normalization and CT mapping from Design records. Driven by Schema workbook rules and CDISC CT dictionary. Ownership modes: PASSTHROUGH, SINGLE-CT, UCUM, DERIVED.

↓

Parallel Execution (ThreadPoolExecutor, 2 workers)

M

Mapping Agent (TS-only MVP)

Assembly engine: builds SDTM TS rows from schema output using the mapping matrix. Deterministic row builder (no AI). MVP: Study Definition + Study Design (31 fields).

Am

Amendment Agent

Computes N vs N-1 field-level diff for same protocol ID with severity tagging. Change types: ADD/MODIFY/DELETE. Severities: MAJOR (CT code changed), MINOR (SDTM but no CT), COSMETIC.

↓

V

Validator Agent

Merges all upstream contracts into unified validated records with combined confidence, QC flags, and human review tasks. Two-stage confidence: verifier (design-dominant weighting) then gating (evidence + agreement + rules + extraction quality).

Technical Details

Model Inventory

Provider	Model	Role
Ollama Cloud	ministral-3:14b	Primary extractor (T3)
Ollama Cloud	qwen3.5	Tier 4 arbiter
Ollama Cloud	ministral-3:14b	Validator AI review
Local	MiniLM CrossEncoder	Tier 2 candidate ranking (lexical fallback)

Tunable Thresholds

Threshold	Default	Purpose
auto_approve	0.85	Auto-approve confident records
min_extractor_conf	0.70	Min extraction confidence
min_ocr_quality	0.55	Local OCR (Tesseract) quality gate
min_verifier_conf	0.80	Verifier confidence gate
min_evidence_match	0.70	Evidence match requirement

Storage

GCS (immutable artifacts): source PDF, evidence pack, agent outputs (JSON), page snapshots.
Firestore (operational state): job metadata, per-field records, review tasks, audit events.

Pipeline Modes

pipeline_mode=FULL_CORTEX_PIPELINE — canonical end-to-end Cortex path.
Design-only early exit is removed; all new jobs run the full pipeline.
Cancellation at any step via POST /api/v1/jobs/{id}/cancel.

Rule Files

Canonical rules: canonical_rules/ directory.
schema_agent_config.1.0.yaml — Bayesian priors, telemetry.
field_definitions_standards.1.0.yaml — processing order (15-domain dependency graph), extraction profiles.
gate_rules_standards.1.0.yaml — field classification, system fields.
usdm_path_registry.1.0.yaml — 232-entry canonical path registry.
tier_eligibility.yaml — centralized tier routing (~120 T1, ~30 T2, ~82 T3-only fields).

Medical Standards

CDISC CT (51K+ terms) • SDTM IG v3.4 • ICH E6(R3) • NCI EVS • USDM • Kairos field registry v3.4 (237 fields)

API Surface

REST API (FastAPI + Uvicorn). All endpoints under /api/v1/.

Method	Endpoint	Description
POST	`/api/v1/jobs`	Upload PDF, create job
GET	`/api/v1/jobs/{id}`	Job status, progress, timings, cost
GET	`/api/v1/jobs/{id}/design`	Design Agent summary
GET	`/api/v1/jobs/{id}/design/records`	Paginated extraction records
GET	`/api/v1/jobs/{id}/schema`	Schema Agent output
GET	`/api/v1/jobs/{id}/mapping`	Mapping Agent output (SDTM TS)
GET	`/api/v1/jobs/{id}/amendment`	Amendment Agent output
GET	`/api/v1/jobs/{id}/validated`	Validator Agent output
GET	`/api/v1/jobs/{id}/metrics`	Validation metrics (synthetic)
POST	`/api/v1/jobs/{id}/cancel`	Cancel processing
DELETE	`/api/v1/jobs/cleanup`	Delete old jobs

Core Principles

Evidence Traceability

Every extracted value is tied to evidence (page, section, quote, bounding box) and preserved through all downstream contracts with full rule and model trace.

AI + Rules Synergy

Tier 3 is the universal non-system extractor, but Cortex stays evidence-driven. Deterministic rules validate outputs, Tier 1 stays narrow, and Tier 4 arbitrates hard disagreements instead of applying a fixed winner order.

Contract Boundaries

Each agent reads upstream artifacts only. No downstream component re-reads raw text after Design Agent. Immutable artifacts in GCS; operational state in Firestore.

Human Review Safety

Low-confidence, conflicting, or blocked records are flagged with explanation and evidence for manual resolution. AI review tips assist but never auto-resolve.

Cost-Aware Extraction

Domain-aware batch sizing with parallel execution. Zone-filtered evidence reduces tokens 60-80%. Full cost tracking per agent and per API call.

Full Audit Trail

Every pipeline step, review action, and status transition is logged as an audit event with timestamps, token usage, and cost breakdown.

Upload Protocol PDF

Generate Synthetic Protocol (with Validation)

Synthetic Protocol Archive

Recent Jobs

Job console (full page)

Cortex Configuration

Extraction Models

Thresholds

Pipeline Configuration

Runtime Modes

API Governor Limits

Aggregated Statistics

Current Cortex Rules

Implementation Overview

Runtime Flow

Pipeline Agents

Design Agent

Schema Agent

Mapping Agent (TS-only MVP)

Amendment Agent

Validator Agent

Technical Details

API Surface

Core Principles

Evidence Traceability

AI + Rules Synergy

Contract Boundaries

Human Review Safety

Cost-Aware Extraction

Full Audit Trail

Upload Protocol PDF

Generate Synthetic Protocol (with Validation)

Synthetic Protocol Archive

Recent Jobs View All Jobs →

Job console (full page)

Cortex Configuration

Extraction Models

Thresholds

Pipeline Configuration

Runtime Modes

API Governor Limits

Aggregated Statistics

Current Cortex Rules

Implementation Overview

Runtime Flow

Pipeline Agents

Design Agent

Schema Agent

Mapping Agent (TS-only MVP)

Amendment Agent

Validator Agent

Technical Details

API Surface

Core Principles

Evidence Traceability

AI + Rules Synergy

Contract Boundaries

Human Review Safety

Cost-Aware Extraction

Full Audit Trail

Recent Jobs