Platform · Autonomous Risk Intelligence

ARIN Platform

Autonomous Risk Intelligence Network — 22 specialized AI agents form a decision council that independently analyzes every entity and produces a single explainable verdict (BUY / SELL / HOLD / AVOID) with full consensus, dissent map, and confidence score. Meta-Decision Governor, Entity Memory, Learning Agent, Product Accelerators. NVIDIA NIM inference, NeMo Guardrails, GNN systemic risk mapping.

Credit

Market

Operational

Liquidity

Regulatory

Systemic

Narrative

Counterparty

Model Risk

ESG

Physical

Sentinel

Ethicist

Geopolitical

Cyber

Macro

Supply Chain

Digital Asset

Insurance

Concentration

Legal

Tail Risk

AI Agents

Verdicts

Data Sources

LLM Models

∞

Entity Memory

Launch ARIN Platform Back to Platform

System Overview

ARIN Platform is the multi-agent AI decision engine at the core of SAA Alliance infrastructure. Unlike single-model architectures, ARIN implements a council paradigm: 22 specialized risk agents — each with distinct analytical mandates, independent reasoning chains, and domain-specific calibration — process every entity in parallel and submit individual assessments to the Meta-Decision Governor.

The Governor aggregates agent outputs using Confidence-Reliability Weighted Mean consensus, tracks dissent patterns, and produces a single explainable verdict (BUY / SELL / HOLD / AVOID) with a composite confidence score, full dissent map, and regulatory audit trail. The Entity Memory module maintains persistent context across sessions — tracking how risk profiles evolve over time, enabling longitudinal analysis and drift detection. The Learning Agent monitors prediction accuracy and continuously recalibrates agent weights based on historical performance.

All inference runs through NVIDIA NIM API (DeepSeek R1, Llama 3.3 70B, Llama 3.1 8B/70B, Nemotron) with NeMo Guardrails for compliance, safety, and factuality filtering. GPT-4o serves as a fallback provider. Graph analysis leverages PyTorch Geometric and DGL for GNN-based systemic risk detection, dependency mapping, and cascade effect simulation across interconnected entities.

Agent Roster — 22 Specialized Units

Each agent operates independently with its own reasoning chain, domain-calibrated prompts, and reliability score. The council architecture ensures no single point of analytical failure.

AG-01

Credit Risk

Default probability, credit spreads, rating migration, XGBoost scoring

AG-02

Market Risk

VaR/CVaR, volatility regime, factor exposure, drawdown analysis

AG-03

Operational Risk

Process failure, fraud detection, business continuity, key person risk

AG-04

Liquidity Risk

Bid-ask spreads, redemption pressure, cash flow coverage, funding stress

AG-05

Regulatory Risk

Compliance exposure, sanction screening, OFAC/SDN, regulatory change impact

AG-06

Systemic Risk

GNN contagion mapping, cascade simulation, interconnectedness scoring

AG-07

Narrative Intelligence

Sentiment extraction, narrative drift, disinformation detection, media pressure

AG-08

Counterparty Risk

Exposure concentration, counterparty creditworthiness, netting agreements

AG-09

Model Risk

Validation of analytical assumptions, model drift detection, backtest integrity

AG-10

ESG Risk

Environmental exposure, social governance scoring, greenwashing detection

AG-11

Physical Asset Risk

Climate hazard exposure, infrastructure vulnerability, natural catastrophe

AG-12

Sentinel

Real-time alert monitoring, anomaly detection, threshold breach, early warning

AG-13

Ethicist

Ethical compliance, bias detection, reputational risk, responsible AI audit

AG-14

Geopolitical Risk

Sovereign instability, trade war impact, sanctions chain analysis, conflict zones

AG-15

Cyber Risk

CISA KEV monitoring, attack surface analysis, data breach probability, CVE tracking

AG-16

Macro-Economic

GDP/inflation forecasting, yield curve analysis, central bank policy impact

AG-17

Supply Chain Risk

Logistics disruption, supplier concentration, commodity dependency mapping

AG-18

Digital Asset Risk

On-chain analytics, DeFi protocol exposure, smart contract risk, whale tracking

AG-19

Insurance Risk

Coverage adequacy, loss ratio analysis, reinsurance exposure, catastrophe bonds

AG-20

Concentration Risk

Portfolio concentration, sector overweight, geographic correlation, HHI index

AG-21

Legal & Litigation

Pending litigation exposure, regulatory penalties, class action tracking, IP disputes

AG-22

Tail Risk & Stress

Black swan scenarios, extreme value theory, multi-factor stress testing, correlation breakdown

Meta-Decision Architecture

Meta-Decision Governor

Aggregates 22 agent assessments using Confidence-Reliability Weighted Mean. Implements Hard Routing for domain-specific queries, Zero-Outcome Freeze for insufficient data, and Dual-Publish for contested verdicts. Produces final verdict (BUY/SELL/HOLD/AVOID) with composite confidence and full dissent map.

Entity Memory

Persistent per-entity analysis database (agent_entity_memories) tracking every risk assessment across sessions. Before each new analysis, agents receive historical context: trend direction (rising/falling/stable), anomaly detection (Z-score > 2σ flags deviations), score delta vs. previous, and 7-day trend slope. After each verdict, the analysis snapshot is saved with key factors and reasoning summary. Enables longitudinal drift detection — the system knows «entity X risk has been rising for 5 consecutive analyses» before agents even begin their assessment.

Outcome Feedback Loop

Every verdict is pre-recorded in agent_outcomes with predicted score and direction. When actual outcomes arrive via POST /api/v1/system/outcome, the system computes Brier Score per agent, triggers Platt Scaling recalibration (minimum 10 outcomes per agent), and updates the adaptive coupling matrix. The consensus engine auto-loads Platt parameters with 60-second TTL caching. Red Team adversarial validation blocks verdicts with excessive error probability. Without outcomes: frozen weights. With outcomes: self-calibrating system.

RAG Knowledge Corpus

12 structured knowledge chunks across 6 domains injected into agent context before assessment: Financial Regulation (Basel III CET1/LCR/NSFR, Solvency II SCR, TCFD/ISSB S2, IFRS 9 ECL), Climate Science (IPCC AR6 tipping points, NGFS scenarios, physical risk AAL/PML), Geopolitical (conflict escalation ladder, supply chain chokepoints), Cyber Security (NIST CSF 2.0, MITRE ATT&CK), Macro Economics (recession indicators, business cycle), Supply Chain (disruption taxonomy JIT/JIC). Domain auto-matched by object type — financial entities get Basel III, climate assessments get IPCC.

Learning Agent

Continuously monitors verdict accuracy against realized outcomes. Adjusts agent reliability weights using Platt Scaling (sigmoid calibration of raw scores) and Isotonic Regression (monotonic probability mapping). Computes Expected Calibration Error (ECE) to detect overconfidence. Red Team validation — adversarial challenge based on dissent strength, error probability, and reliability metrics. Identifies degrading agents and triggers automatic weight recalibration. 5 calibration runs for adaptive coupling matrix convergence.

Historical Events Engine

52+ curated historical events across 5 domains feeding agent stress-test context: Financial crises (Asian 1997, LTCM, Dot-com, Euro Debt, Black Monday, Barings, Tequila, Argentine, Russian), Wars (Ukraine 2022, Syria, Gulf War, Taiwan Strait), Climate (Katrina, Maria, Bushfires, Pakistan Floods, Tohoku, Heatwave 2003), Pandemics (SARS, MERS, Ebola, H1N1), Cyber attacks (SolarWinds, Colonial Pipeline, NotPetya, WannaCry, Equifax). Each event includes severity, financial loss, casualties, cascade effects, lessons learned, and recovery timeline.

Closed-Loop Agent Architecture

Every assessment flows through a 6-stage closed loop. No service is isolated — data cascades from memory to knowledge to agents to consensus to feedback and back to memory:

① Load → ② Enrich

① Entity Memory Load: Before assessment, load previous analyses for the entity — trend direction, anomaly flags, historical scores. ② Knowledge Injection: RAG corpus auto-selects relevant domain knowledge (Basel III for banks, IPCC for climate) and injects into agent context. Agents start every analysis with full institutional memory.

③ Assess → ④ Consensus

③ Agent Assessments: 22 agents run in parallel with enriched context — each seeing entity history + domain knowledge. ④ Consensus Engine: Platt-calibrated confidence-weighted aggregation with dissent detection, Red Team adversarial validation, and adaptive coupling from outcome feedback. Produces DecisionObject with full provenance chain.

⑤ Record → ⑥ Save

⑤ Outcome Pre-Record: Verdict saved to outcome table. When actual results arrive (manually or via automation), Brier Score computed → Platt recalibration triggered → coupling matrix updated. ⑥ Entity Memory Save: Current analysis persisted with delta, trend, key factors. Next assessment cycle starts at ① with accumulated intelligence.

Consensus Mechanics

ARIN does not use simple majority voting. The Governor employs a multi-layered consensus protocol designed for adversarial robustness:

Hard Routing — Domain-specific queries are routed to relevant agent subsets. A credit analysis query activates Credit, Counterparty, Concentration, and Macro-Economic agents; irrelevant agents (e.g., Cyber, Supply Chain) are excluded from the voting pool to prevent noise dilution
Confidence-Reliability Weighted Mean (CRWM) — Each agent’s vote is weighted by two factors: (a) its self-reported confidence for the current query, and (b) its historical reliability score maintained by the Learning Agent. High-confidence + high-reliability votes dominate the consensus
Zero-Outcome Freeze — If no agent achieves confidence above the configurable threshold (default: 0.45), the Governor refuses to issue a verdict and flags the entity for manual review. This prevents false certainty in data-scarce scenarios
Dual-Publish Protocol — When the council is deeply split (e.g., 11 agents BUY vs. 11 agents SELL), the Governor publishes both majority and minority verdicts with separate confidence scores, explicit reasoning chains, and a «CONTESTED» flag. No forced resolution of genuine analytical disagreement
Dissent Mapping — Every verdict includes a full dissent map showing which agents agreed, disagreed, and abstained, with individual reasoning summaries. Enables regulatory audit and explainability requirements (EU AI Act, SEC, MiFID II)

Unified Integration Layer

ARIN is the convergence point for the entire SAA Alliance ecosystem. All seven production modules export analytical data to ARIN via the Export API, enabling cross-domain synthesis:

Inbound Data Sources

Risk Analyzer — VaR/CVaR, stress test results, portfolio metrics, factor decomposition
Investment Dashboard — V2 institutional reports, Report Agent outputs, sector scans, watchlist signals
News Analytics — Sentiment scores, narrative events, RiskMirror alerts, impact graph clusters
Crypto Analytics — Token ratings (v1.4), DeFi protocol snapshots, on-chain anomalies
Global Risk Platform — Country/city/enterprise risk scores, 280-factor Unified Stress Report, digital twin outputs
External Feeds — USGS, FEMA, WHO, World Bank, IMF, OFAC SDN, CISA KEV (real-time ingestion)

Output & Export

Verdict Object — JSON: verdict, confidence, dissent_map, agent_assessments[], entity_memory_ref, audit_hash
Export Storage — PostgreSQL persistence with entity binding (portfolio, symbol, ISIN, country code)
API Endpoints — POST /api/v1/unified/export, GET /api/v1/unified/verdict/{entity_id}
WebSocket Stream — Real-time verdict updates for portfolio monitoring dashboards
Audit Trail — Immutable log of every agent assessment, Governor decision, and data input for regulatory compliance

AI Engine & Infrastructure

LLM Inference

Primary: NVIDIA NIM API — DeepSeek R1 (reasoning), Llama 3.3 70B (general analysis), Llama 3.1 8B (fast routing), Llama 3.1 70B (deep analysis), Nemotron (structured extraction). Fallback: GPT-4o. Response caching (Redis), retry with exponential backoff, multi-provider load balancing. NeMo Guardrails for compliance filtering, toxicity prevention, and factuality enforcement.

Graph & ML

Graph: Neo4j (persistent graph DB), NetworkX (analysis), PyVis (visualization), PyTorch Geometric / DGL (GNN for systemic risk and contagion). ML: XGBoost (credit scoring, classification), PyTorch (deep learning). Data: pandas, numpy, polars, scipy. All compute GPU-optimized via NVIDIA CUDA.

Backend Stack

Python 3.11+, FastAPI, Uvicorn (ASGI), Pydantic 2 (strict schema validation). SQLAlchemy 2.x + asyncpg + Alembic (migrations). PostgreSQL (users, roles, verdicts, exports). Redis (cache, Entity Memory hot store). Celery (async task queue). JWT (python-jose), Passlib (bcrypt), OAuth-ready.

Frontend Stack

Next.js 14, React 18, TypeScript. Tailwind CSS (Midnight Command design system). ECharts, D3.js, Plotly, Three.js, Recharts, react-force-graph-2d/3d (decision graph visualization). Zustand (state), TanStack React Query (data fetching), Axios. Full responsive design.

Compliance & Audit

Authentication — Registration, login, JWT tokens, OAuth-ready; role-based access control (RBAC) — Admin, Analyst, Viewer; users and roles in PostgreSQL with Alembic migrations
GDPR Compliance — Data access, export (JSON/CSV), deletion (right to erasure); audit log for all data processing operations; configurable retention policies
Regulatory Audit Trail — Immutable log of every agent assessment, consensus computation, Governor decision, and data input. Designed for SEC, MiFID II, EU AI Act, and DORA compliance review
NeMo Guardrails — NVIDIA NeMo safety framework prevents hallucination, toxicity, and non-compliant outputs. All LLM responses pass through guardrail validation before reaching the verdict pipeline
Backup & Recovery — Automated PostgreSQL backups, Redis persistence, disaster recovery procedures; 99.8% uptime SLO target

API Reference

Base: /api/v1 · Auth: Bearer JWT · Format: JSON

Health — GET /health, GET /api/v1/performance/health
Agents — GET /api/v1/agents (list all 22), GET /api/v1/agents/{id} (agent detail + reliability score)
Analysis — POST /api/v1/risks/analyze — triggers full council assessment; params: entity_id, analysis_type, agent_subset (optional)
Graph — GET /api/v1/graph, GET /api/v1/graph/visualization — entity dependency graph, systemic risk clusters
Auth — POST /api/v1/auth/register, POST /api/v1/auth/login, GET /api/v1/auth/me
Unified — POST /api/v1/unified/export, GET /api/v1/unified/exports/{entity_id}, GET /api/v1/unified/verdict/{entity_id}
Entity Memory — GET /api/v1/memory/{entity_id} — historical assessments, drift alerts
Alerts — GET /api/v1/alerts — Sentinel and threshold breach notifications
LLM — POST /api/v1/llm/generate, POST /api/v1/llm/extract — direct LLM interaction for custom queries

Related — Seven Production Modules

ARIN is the central decision engine. All other SAA Alliance modules export to it for cross-domain unified verdicts: Global Risk Platform (digital twins, cascade analysis), Risk Analyzer (VaR, stress testing), Investment Dashboard (equity research), News Analytics (narrative intelligence), Crypto Analytics (digital assets), KOKON Control Company (governance, agent calibration, multi-project rollout) — see Platform Overview.

Launch ARIN Platform All Modules How It Works Contact