Platform ·Atlas Control Tower

Atlas — The Autonomous Control Tower

The System of Action — not another dashboard.

✦ 50–60% reduction in control-tower manpower at Latin America's largest airline group; a measured shift from reactive to proactive operations.

Dhruv Agrawal

Co-Founder & COO

The problem it solves

A conventional control tower is a wall of red rows. A shipment is late — a row turns red. A temperature excursion — a row turns red. A customer calls — a row turns red. Then someone picks up a phone. The industry has spent fifteen years perfecting the view and almost no years closing the loop. Operators end up staffing the dashboard — twenty, forty, sometimes a hundred people reading red rows, calling carriers, messaging depots, filing incidents, writing post-mortems. The cost isn’t just headcount; it’s the gap between detection and resolution, where SLA breaches live. By the time someone has read the red row, opened the case, found the carrier contact, got a promise back, and typed the update into three systems — the customer has already called, the claim has already landed, the truck has already idled for another forty minutes at the dock. Atlas exists to close that loop. Detection, decision, and action — in the same platform, on the same event.

What it is

Atlas is Shipsy’s Autonomous Control Tower — the System of Action that orchestrates the AgentFleet agents (Clara, Nexa, Vera, Astra) across a single event-driven surface. Where a traditional control tower shows you that a shipment is late, Atlas detects the lateness, classifies the incident, picks the next-best action, dispatches the right agent to execute it, and records the outcome — in a closed loop. It is built on an event bus, not a pollable database. Every scan, every GPS ping, every temperature reading, every driver app event becomes a first-class signal. Policies, contracts, SLAs, and business rules become executable playbooks. What’s new-age about it: Atlas doesn’t replace the operator — it replaces the operator’s waiting. The person who used to read red rows now reviews exceptions. The person who used to type into three systems now approves or overrides. The rest is closed automatically.

Core capabilities

Capability	What it does
Event-driven architecture	Kafka-backed bus. Every shipment event, GPS ping, scan, POD, temp reading, carrier API callback is a first-class signal. No polling, no stale dashboards — the whole platform reacts in real time.
Unified operational surface	Shipments, trips, drivers, customers, carriers, exceptions, cost — one pane, not twelve. Filtered, faceted, and contextually grouped. The operator sees the problem and the handle in one screen.
Closed-loop incident management	Detect → classify → decide → execute → verify → log. Each stage is instrumented, each handoff is audited, each outcome feeds tomorrow’s playbooks. 17+ auto-detected incident types out of the box.
Next-best-action engine	For each incident type, a playbook ranks possible actions by expected outcome (SLA recovery, cost, customer impact) and either executes autonomously or offers the top three to a human supervisor.
Agent orchestration	Atlas is the conductor — it decides when to hand an incident to Clara (CX), Astra (re-route), Nexa (settlement), or Vera (dispute). Agent handoffs are event-driven, not workflow-scripted.
Policy and SLA engine	Hard SLAs, soft SLAs, carrier contracts, customer-specific rules — declared as data, enforced as code. A breach doesn’t just turn red; it triggers the enforcement action defined in the contract.
Stress-factor and incident coupling	Detects when multiple lightweight anomalies combine into a high-risk situation — a driver running hot on hours, a peak-window surge, a weather alert, a carrier scorecard dip — and escalates before any single threshold trips.
Live KPI dashboards with BI	Real-time KPI boards for each persona (depot supervisor, regional ops, CxO), plus no-code BI for historical analysis. Same data, same definitions, same source of truth.
White-label tracking and merchant portal	Customer-facing tracking pages, merchant portals, and webhook feeds run off the same event bus — so the public view is never out of sync with the internal view.
Carrier and 3PL integration bench	240+ carrier integrations, with lightweight mobile-app fallback for small local providers — so adding a new last-mile partner takes days, not weeks.
Autonomous post-mortem	Every closed incident auto-generates the timeline, the RCA, and the dispatched playbook. The operator reviews, not writes.
Three-tier autonomy governance	Configurable per incident type and per customer — from “suggest only” to “autonomous with guardrails.” Ops leadership owns the tier map; compliance owns the audit trail.

How it works

Atlas is an event bus with an intelligence layer on top. Every source system — driver app, GPS, scanner, WMS, ERP, carrier API — pushes events onto the bus. The Signal Processor enriches, deduplicates, and classifies. The Incident Engine runs detection rules and ML classifiers to promote signals into named incidents (late arrival, temperature excursion, failed delivery, dock dwell anomaly). The Playbook Orchestrator picks the right playbook for the incident type, consults the Policy Engine (SLAs, contracts, autonomy tier), and dispatches work — either directly (webhook, API call, driver-app nudge) or by handing off to a sister agent. The Closure Verifier watches for the resolution event and writes the outcome back into the training ledger.

graph TB A[Driver App] --> B[Event Bus] C[GPS / Telemetry] --> B D[Scanners / Hub Ops] --> B E[WMS / ERP] --> B F[Carrier APIs] --> B B --> G[Signal Processor] G --> H[Incident Engine] H --> I[Playbook Orchestrator] I --> J[Policy Engine] J --> K{Autonomy Tier} K -->|Tier 3| L[Agent Handoff] K -->|Tier 2| M[Suggest and Approve] K -->|Tier 1| N[Supervisor Queue] L --> O[Clara / Astra / Nexa / Vera] O --> P[Closure Verifier] M --> P N --> P P --> Q[Training Ledger]

In execution, the value of Atlas shows up in the time between events. A single late arrival becomes a sequence of actions — proactive customer comms, re-route planning, carrier scorecard update, dispute pre-audit — all fired in seconds, not hours.

sequenceDiagram participant DRV as Driver App participant ATL as Atlas participant CLA as Clara participant AST as Astra participant NEX as Nexa participant CUS as Customer DRV->>ATL: ETA drift +35 min (stop 4) ATL->>ATL: incident.classify (late_arrival, SLA risk) ATL->>ATL: playbook.select (customer-not-present-prevention) ATL->>CLA: proactive comms task (stop 4 and downstream) CLA->>CUS: WhatsApp ETA update + confirm presence CUS-->>CLA: reschedule to evening slot ATL->>AST: re-plan stops 4-9 (remove 4, insert evening slot) AST-->>ATL: new sequence published ATL->>NEX: pre-audit trip cost vs contract NEX-->>ATL: within rate card, no dispute ATL->>ATL: incident.close + write ledger

Proven outcomes

Customer type and scale	Outcome
Latin America’s largest airline group, USD 12–13B revenue, 500,000+ tonnes transported	~50–60% reduction in control-tower manpower; ~25% reduction in vehicle preparation/loading time; 5–10% working-hour savings; measured shift from reactive to proactive ops
Southeast Asia’s largest air-logistics network, 164 hubs across APAC	20+ hours/week ops savings; single dashboard for orders, rider performance, weekly business reviews
A leading national postal operator powering 200+ countries	12–18%+ cost reduction; 25% reduction in manual workload; 90% first-attempt delivery rate
A leading UK newspaper distributor, £1.1B revenue, 24,000 retail locations	10–15% reduction in package processing time; 10–12% driver productivity increase; 10–15% NPS lift
One of Asia’s largest quick-commerce arms, 5M+ orders/month, 200+ dark stores	~82% reduction in COD loss; ~30% decrease in RTO; ~21% reduction in cost-per-delivery

Integrations

Atlas is the substrate — integrations are how it listens and speaks:

ERP — Oracle, SAP, Microsoft Dynamics, NetSuite
WMS and YMS — Manhattan, Blue Yonder, SAP EWM, custom via REST
Driver and fleet telemetry — Wialon, Wheelseye, native Driver App, third-party GPS providers
Carrier APIs — 240+ integrations including IndiGo cargo, regional LTL, hyperlocal 3PLs, international parcel networks
CX and CRM — Salesforce Service Cloud, Zendesk, Freshdesk, WhatsApp Business API, Voice over SIP
Quality and compliance — Veeva QMS, LIMS, cold-chain temperature loggers
Data platforms — Snowflake, BigQuery, Databricks, Azure Data Lake — every event is stream-exported

Deployment

Phase 1 — Discovery (weeks 1–2). Event inventory, incident taxonomy, SLA and contract ingestion. We name the top 15 incident types — the ones that drive 80% of the control-tower workload today — and define the target playbook for each.

Phase 2 — Configuration (weeks 2–6). Event bus wiring (source systems pushed, not polled), policy engine loaded with SLAs and contracts, playbook library configured, agent handoffs wired. Shadow mode runs alongside the incumbent control tower.

Phase 3 — Pilot (weeks 4–8). One region or lane goes live. Tier 1 autonomy (suggest only) for the first two weeks. Tier 2 (autonomous for low-risk incident types) unlocks once plan-vs-actual scoring is within tolerance.

Phase 4 — Scale (weeks 10–16). Progressive regional rollout, Tier 3 autonomy for mature incident types, manpower re-deployment (control-tower operators become exception analysts). Governance dashboard in the hands of ops leadership.

Most enterprises reach steady-state in 12–16 weeks. The 50–60% manpower reduction typically lands by month four. Success criteria are pre-agreed — manpower deployed, mean time to detection, mean time to resolution, SLA adherence, working hours saved. Governance is weekly in the first quarter, monthly thereafter.

Security and compliance

SOC 2 Type II, ISO 27001, GDPR-ready data handling
21 CFR Part 11, GDP, and GMP-aligned audit trails for pharma and cold-chain deployments
Full audit trail on every incident — signals, classification, playbook, actions, approvers, outcome
Three-tier confidence and autonomy scoring, configurable per incident type and per customer
Human-in-the-loop gating for high-risk actions — cross-border re-routes, cost-policy breaches, cold-chain exception overrides
Data residency options — EU, India, Middle East, APAC regional hosting
Number masking, OTP/POD verification, QR-COD, geofencing, fraud detection primitives built in

Case study callouts

Southeast Asia’s largest air-logistics network · 164 hubs

A single dashboard replaced fragmented hub-level visibility across 164 hubs and 207 fleets. 20+ hours/week ops savings and a 20% driver productivity increase — driven by Atlas’s event-driven architecture orchestrating Astra planning and Clara CX in real time.

→ Read the full case study

One of Asia’s largest quick-commerce arms · 5M+ orders/month

Atlas’s policy engine reduced COD loss by ~82%, RTO by ~30%, and cost-per-delivery by ~21%. During the Big Billion Days peak, the same surface handled a ~37% month-over-month jump without staffing up the control tower.

→ Read the full case study

Frequently Asked Questions

Is Atlas a control tower or a TMS?

Neither, and both. A TMS plans and executes trips; a control tower watches them. Atlas is the System of Action that binds the two — it detects, decides, and dispatches across whatever TMS, WMS, and carrier systems you already have. Customers who adopt Shipsy's TMS get tighter agent handoffs; customers who keep their existing TMS get Atlas as an orchestration layer on top.

How long does deployment typically take?

Most enterprises go live in 12–16 weeks, with a pilot region in weeks 4–8. The 50–60% manpower reduction lands by month four in mature deployments. Peak-season readiness in 90 days.

What incident types are covered out of the box?

Seventeen-plus, with the top fifteen covering ~80% of control-tower workload: late arrival, failed delivery, customer-not-present, dock dwell, temperature excursion, carrier SLA breach, driver no-show, attendance deficit, address-quality failure, COD variance, ETA drift, NDR-prone, stress-factor combinations, and more. Customers extend the taxonomy with custom incident types via the policy engine.

How do you handle exception cases and human-in-the-loop?

Three autonomy tiers, configurable per incident type. Tier 1 — Atlas suggests, operator confirms. Tier 2 — autonomous for low-risk incidents, operator confirms for high-risk. Tier 3 — autonomous end-to-end with policy guardrails. Ops leadership owns the tier map; every action is audited.

Does Atlas replace control-tower operators?

It replaces control-tower *staffing*. Operators shift from reading red rows to managing exceptions, tuning playbooks, and owning governance. In mature deployments, a team of forty becomes a team of fifteen exception analysts — same throughput, higher-value work. Retention goes up.

How does Atlas handle peaks (Black Friday, BBD, festival seasons)?

Peaks are when closed-loop orchestration earns its keep. A major quick-commerce arm absorbed a ~37% month-over-month jump during Big Billion Days without adding control-tower headcount — because the incremental load was autonomously resolved, not queued for a human.

Can it plug into our existing BI and data lake?

Yes. Every event is stream-exported to Snowflake, BigQuery, Databricks, or a custom data lake. The KPI definitions match — no reconciliation between the control-tower view and the executive dashboard. Historical analytics runs on the same event stream.

What does the pricing model look like?

Shipsy does not publish pricing publicly. Engagements are scoped by network size (shipments, drivers, hubs, carriers) and the depth of Atlas modules deployed. Talk to us via [Request a demo](/request-a-demo) and we'll share a commercial proposal within two business days.

How does Atlas work with sister agents like Clara and Astra?

Atlas is the conductor. Each agent subscribes to a set of event types and a set of incident classes. When the playbook for a late-arrival incident fires, Atlas sends a proactive-comms task to Clara, a re-plan task to Astra, and a pre-audit task to Nexa — in parallel, with policy-aware sequencing. The agents don't call each other directly; they handshake through Atlas's event bus and playbook orchestrator. This is what makes the loop closable without hard-coding workflows.

What does a typical governance review look like?

Weekly in the first quarter, monthly thereafter. The dashboard shows autonomy-tier usage by incident type, mean time to detect/resolve, SLA adherence, and cost-per-incident trend. Ops leadership approves tier changes; compliance reviews the audit trail sample. Most customers tune their tier map every month in year one and every quarter in year two.

Can Atlas coexist with our existing SAP TM / Oracle OTM / Manhattan deployment?

Yes. Atlas reads events from the incumbent TMS/WMS via their native APIs or a lightweight adapter, and writes actions back the same way. In sidecar mode, Atlas is the orchestration brain on top of the existing stack. Most customers eventually collapse planning and CX into the Shipsy agents once they see the handoff speed, but that is a choice, not a requirement.