Astral
Back to blog

Research · 13 min read

What 1,000 Drones Need to Coordinate: UWB Is Non-Negotiable Above 100

June 9, 2026

Scale a drone swarm from 32 to 1,000 units and something breaks. Not gradually — it breaks hard. Camera-only coordination falls 15.8 percentage points below the omniscient baseline at 1,000 drones and produces 8× more collisions: 3,620 versus 452 in the same mission. That is not a tuning problem. It is an architectural one. UWB ranging is non-negotiable above roughly 100 drones.

Four Sensing Configurations

  • Camera-only: onboard vision for neighbor detection and obstacle avoidance.
  • UWB-only: ultra-wideband radio ranging — precise distances, independent of lighting or occlusion.
  • Hybrid (camera + UWB): both modalities combined.
  • Omniscient baseline: perfect ground-truth position knowledge — the ceiling.

At 32 drones the differences are negligible. Camera-only achieves 57.5% coverage, nearly matching the 57.6% omniscient ceiling. This is the regime where most drone swarm demos live. The problem is not visible yet.

By 200 drones, camera-only has opened a 19.2 percentage-point gap (40.0% vs. 59.2%, p<0.001). UWB and Hybrid stay within 1–2 pp of omniscient at every scale. The separation is structural: cameras cannot maintain reliable neighbor awareness when mutual occlusion becomes the norm. UWB ranges through everything.

The Urban Penalty

Urban environments impose a 14–32 percentage-point coverage penalty versus open forest across all swarm sizes and sensing configurations. Two effects compound: visual range is effectively halved (30 m → 15 m in city clutter), and 22% of the map is physically occupied by buildings — coverage saturates at ~28% regardless of mission time.

At 1,000 drones, city environments produce 8–11× more collisions than forest under identical sensing. Camera-only in a city at scale is not just suboptimal — it is operationally unacceptable.

Hybrid Squad Coordination: Centralized Performance at 3.3× Less Overhead

In 16-drone, 8-target experiments:

  • Centralized: 95.0±6.1% detection, 3.15 s reaction, 192 msg/s.
  • Decentralized: 85.0±5.0% detection, 0.44 s reaction.
  • Hybrid squad-based: 95.0±6.1% detection (matching centralized), 1.08 s reaction, 58 msg/s — 3.3× less communication overhead.

For a 1,000-drone fleet: centralized at 192 msg/s per node generates ~192,000 messages/s fleet-wide; hybrid brings that to ~58,000 msg/s. At 200 bytes per message, that is the difference between 38 Mbps and 11 Mbps of coordination traffic before any payload data.

Resilience

GPS denial: with UWB present, coverage drops only 1.3 pp in city. Fleets carrying only GPS lose this fallback entirely.

Communications denial: any architecture that relies on shared state collapses to camera-only performance. UWB hardware provides no benefit when the ranging data cannot be shared.

75% drone attrition: forest coverage holds at 87.7%; city drops to 63.8%. Urban operations require 30–40% more drones to maintain equivalent resilience under attrition.

VLMs on Edge Hardware

SmolVLM-2B on Orin Nano (210 ms latency, 5.06 GB VRAM) provides a 25 pp detection advantage under total comms denial in forest (62.5% vs. 37.5%, p<0.05). In city environments, this advantage disappears: building occlusion prevents the scene context VLM reasoning depends on. Spend those resources on UWB hardware and better mesh radio for urban operations.

Requirements Summary

100-drone fleet: UWB on every airframe (~$20–50/unit), hybrid squad organization (squads of 8–16), mesh radio for 58+ msg/s per drone, GPS primary / UWB fallback.

1,000-drone fleet: UWB without exception, tiered coordination hierarchy (squad → sector → global), 30–40% reserve capacity for urban attrition, comms-denial firmware on every unit.

Full methodology and per-configuration data: Scaling the Separation Principle.