WP - MM - Telco / Tower

Phase 1 of 6

Scoping & Autonomy / Rollback Constraints

Define the fault domains in scope, the remediation latency budget, the autonomy level you are prepared to operate at, and the rollback guarantees every downstream architectural decision must respect.

0/9

Phase Progress

Required Recommended Optional Open-Source Proprietary Trinidy

Network Domains & Fault Surface

Identify network domains in scope for fault prediction and self-healing

Required

Confirm which domains the closed-loop model must observe and remediate.

Select all that apply

Radio Access Network — 4G LTE eNodeB

Radio Access Network — 5G NR gNodeB (SA / NSA)

O-RAN disaggregated RU / DU / CU

Transport / backhaul / fronthaul (IP, microwave, fiber)

Baseband / BBU pool

Site power, rectifiers, batteries, HVAC

Core network (5GC / EPC) fault surfaces

Tower-level environmental and structural telemetry

required

✓ saved

Define end-to-end remediation latency budget

Required

Select the target latency for the full detect → classify → act → verify loop.

Single choice

< 100ms (URLLC / 5G MEC adjacent faults)

< 500ms (site-level self-healing target)

< 5s (near-real-time RIC control loop)

< 60s (Non-RT RIC / SMO control loop)

Tiered by severity (mixed SLA)

requirededgetrinidy

Trinidy — Cloud-routed fault inference alone consumes 50–200ms of network round-trip before a score is computed — often past the point where the fault has already cascaded. Trinidy runs the full three-stage pipeline on-node with sub-500ms end-to-end remediation, surviving backhaul degradation.

✓ saved

Select target autonomy level on the TMF L0–L5 scale

Required

Pick the autonomous-network maturity level that governs which decisions the model may make unsupervised.

Single choice

L1 — Assisted operations (human in every decision)

L2 — Partial autonomy (human approves remediations)

L3 — Conditional autonomy (human in loop for exceptions)

L4 — High autonomy (human on escalation only)

L5 — Full autonomy (aspirational / research only)

required

✓ saved

Define acceptable auto-remediation rate and escalation rate

Required

Quantify how much of routine fault volume the model is permitted to close without a human.

Single choice

< 40% auto-remediate (conservative — mostly escalation)

40% – 60% auto-remediate

60% – 80% auto-remediate (typical well-tuned SON)

> 80% auto-remediate (aggressive L3+ deployment)

Not currently budgeted at the action-class level

required

✓ saved

Establish MTTR reduction target versus current baseline

Required

Specify the MTTR improvement the program commits to, benchmarked against today.

Single choice

50% MTTR reduction target

75% MTTR reduction target

90%+ MTTR reduction (Nokia AVA-class deployment)

No hard MTTR target — measure and improve

Not yet measured at the fault-class level

required

✓ saved

Map FCC NORS / DIRS outage-reporting obligations into the remediation flow

Required

Confirm which auto-remediation outcomes trigger FCC Part 4 outage reporting and ensure the model flow respects the obligation.

Select all that apply

FCC NORS — 30-minute outages affecting 900k+ user-minutes

FCC NORS — airport / 911 / special-office outages

FCC DIRS — active hurricane / disaster reporting

State PUC outage reporting overlay

International regulatory outage reporting (CRTC, Ofcom, BNetzA, ACMA)

No reportable outages in scope

required

✓ saved

Define rollback guarantee for every auto-remediation class

Required

Specify the maximum time to revert an auto-action and the conditions that force reversion.

Single choice

< 1s rollback (atomic config revert)

< 10s rollback (Near-RT RIC action revert)

< 60s rollback (Non-RT RIC / SMO-mediated)

< 5min rollback (NOC-assisted)

Rollback is best-effort / not guaranteed

requiredtrinidy

Trinidy — Rollback must run inside the same on-node control loop that applied the action — a cloud-routed rollback inherits the original latency problem in reverse. Trinidy keeps the forward action and its reversal on the same site-resident runtime.

✓ saved

Confirm deployment topology for the inference plane

Required

Select the physical and logical deployment target for the closed-loop pipeline.

Single choice

Site-resident edge (cell-site router / DU sleeve)

Regional aggregation point (metro / MEC)

Central Non-RT RIC / SMO cluster

Operator private cloud / VPC in-region

Public cloud managed inference

Hybrid — site-edge inference + central training

requirededgetrinidy

Trinidy — For sub-500ms remediation with backhaul-tolerant survivability, cloud inference is physically incompatible. Trinidy is the on-site inference substrate — site-resident for RAN fault classes, regional-aggregation-resident for cross-site correlation, both on the same deployment fabric.

✓ saved

Confirm data sovereignty and residency constraints for telemetry

Required

Map equipment, subscriber-adjacent, and configuration telemetry to jurisdictional residency requirements.

Select all that apply

EU GDPR — telemetry must remain in EU

UK GDPR — UK residency required

National lawful-intercept data cannot leave country

India / Brazil / China localization rules

Equipment-vendor telemetry-sharing contract limits

No cross-border data flow permitted for any CP/UP telemetry

Cross-border permitted under SCCs / approved vendors

requiredtrinidy

Trinidy — EU GDPR, country-level lawful-intercept rules, and operator-specific equipment telemetry contracts all constrain cloud-hosted inference. Trinidy keeps telemetry, model scoring, and audit logging entirely within the operator's own perimeter — no cross-border data flow for any fault decision.

✓ saved