V2V Energy-Safe Swarm Explorer · network battery safety

Each robot is bound by an individual safety constraint: its current state of charge must leave enough budget to return to a central depot. With vehicle-to-vehicle (V2V) energy exchange, the constraint relaxes from "depot is reachable" to "some peer with surplus is reachable, and they can ferry me home." Explore how the effective coverage, task throughput, and fragility shift as the safety property moves from individual to network.

© 2026 Theodore P. Pavlic · MIT License

Worker V2V on
t0.0 s
live0
pinned0
stranded0
lost0

Tasks

Active
0
Tasks done
0
Tasks expired
0
Safety vetoes
0
Edges (j → i)
Help (active)
0
Feasible
0
Reachable
0

Robots

Max reach
Past solo-reach
0%
V2V-dependent
0
Mean SoC
0%
Peak reach
Mean reach
Time at outer 80%
0%
Time past solo
0%

SoC distribution

Reference implementations

Starter templates: Levy-walk foraging, depot return, metered V2V transfer on contact, and the one-hop safety filter. They omit the widget's multi-hop chain reachability and the refueler role — just enough scaffolding to extend on your own.
The safety condition, formally

Let bi be robot i's state of charge, pi its position, α the energy cost per unit distance, and d(·,·) a distance metric on the workspace (Euclidean in this simulator). Robot i is directly safe when its reachable disk contains the depot:

bi ≥ α · d(pi, depot)

Robot i is V2V-safe via peer j when both clauses hold:

bi ≥ α · d(pi, pj) (reachability) bi + bj ≥ α · (d(pi, pj) + 2 · d(pj, depot)) (surplus sufficiency)

The first clause is a pure proximity condition: i can physically drive to j's position. The second is an energy-balance condition: once at the rendezvous, j's surplus sj = bj − α · d(pj, depot) covers i's deficit δi(j) = α · (d(pi, pj) + d(pj, depot)) − bi. The first gives the broad reachability graph (red edges in the visual); the conjunction gives the strict help-feasibility subgraph (blue edges). The gap between them is "wasted proximity" — peers i could reach but who can't save it.

The conditions are per-pair: there is no general "helper" property. A given j may save some is with small deficits while being unable to save others. Rather than counting "helper robots," we count the directed j → i edges that could feasibly provide help (the "Feasible" count) and the subset of them that are actually being acted on (the "Help (active)" count). That is, each feasible edge is a directed j → i edge representing a viable hand-off from a robot with enough surplus to one whose deficit is small enough to cover in the field. Then, each "Help (active)" edge is a feasible edge that has been committed to — i is en route to its pinned helper j, where the transfer completes.

The condition is the same whether j stays deployed after the in-place exchange (default) or accompanies i home (ferry toggle), since j reserves α · d(pj, depot) for itself either way. Operationally, when i commits to j, j is pinned: it stops wandering and waits in place until i arrives and the transfer completes. This keeps the rendezvous deterministic; without it, helpers can drift away mid-approach.

The pairwise test above is the one-hop condition, and it is exactly what the three reference implementations compute: a robot is safe if it is directly safe, or if some single peer j satisfies both clauses. (Their check bi − α·d(pi,pj) + sj ≥ α·d(pj,depot) rearranges to the surplus-sufficiency clause verbatim.) With V2V off it collapses to the direct condition alone, which is why conservative solo motion never strands a robot.

The widget itself runs the multi-hop generalization. Every frame it assigns each robot a safety depth by breadth-first search over the help graph: depth 0 is the direct-safe set; a robot is depth d if it can reach a depth-(d−1) peer, take that peer's surplus, and from there reach the depot or another already-safe peer — i.e. a whole chain i → j → k → … → depot. This is the recursive, fixed-point form of "helper" on the help graph, exactly where a network-CBF or graph-theoretic invariant would live.

One caveat is worth stating plainly, because it is the whole reason V2V trades safety for reach: that multi-hop set is an optimistic, instantaneous estimate. It is recomputed from a frozen snapshot each frame and assumes every peer holds its position and donates its full surplus — assumptions the wandering peers break the moment anyone moves or drains, which is what lets a robot over-commit to a chain that then dissolves beyond its solo reach. Only the direct-safe set (depth 0) is a true invariant; the one-hop set is nearly one when the helper is a station-holding refueler. Turning the multi-hop layer into a guaranteed charge-safety certificate — via reserved donors, stationary anchors, or a margin sized for worst-case peer drift — is the open problem, not something this simulator claims to have solved. Its job is to show that such chain strategies reach measurably more tasks, not to prove they are safe.

How motion is filtered

When free of return commitments, each robot does a Levy walk: alternating long ballistic transits (step lengths drawn from a heavy-tailed Pareto distribution, μ ≈ 1.8) with bouts of area-restricted local search. There is no global task sensing — robots only discover task sites by physically passing within contact range. The motion is filtered by the safety condition: the proposed step must keep the robot inside its safe set — just the direct-safe set when V2V is off, or the full multi-hop safe set (any robot that can chain home through a sequence of peers, per the breadth-first safety-depth pass described above) when V2V is on. The reference implementations use the simpler one-hop version of this set. When the desired step would leave the safe set, the desired direction is blended toward the depot until it lies inside.

When SoC drops below a threshold, the robot commits to a return: it heads straight for the depot if possible, otherwise toward the nearest helper that can feasibly cover its deficit. On reaching a helper, the helper transfers just enough charge to make the receiver directly safe (with a small margin), then heads home. Task engagement burns extra energy; if engaging would push the robot off the safe set, it abstains and the encounter counts as a missed opportunity. The robot then ignores that particular task until it recharges (so a single under-budget encounter is counted only once, not every frame the robot lingers nearby).

Refuelers are the deliberate exception to this forage-everywhere default. When the Refueler role is enabled, a designated fraction of the swarm is pulled out of the Levy walk and assigned to hold station at fixed shell radii (a single ring at 0.5× solo-reach, or inner and outer rings at 0.33× and 0.67×). Depending on the motion mode they hold position exactly, drift tangentially around their shell, or roam on a short leash and engage only the tasks that come within it. The point is to turn ad hoc, position-dependent peer help into a dependable spatial scaffold: a worker that runs low near a shell can count on a donor being parked there to dock with, instead of hoping another forager happens to be carrying surplus nearby.

What to look for

With V2V off, the team is bounded by the smallest reachable disk that still contains the depot — a hard individual constraint. With V2V on, the coverage expands because peers near the depot have surplus to spend on saving others. Turn on the help-feasibility layer and watch a directional structure emerge: peers near the depot with high SoC accumulate outgoing edges (they can save many), while peripheral peers with low SoC accumulate incoming edges (they have many potential saviors). There's no fixed split into "donor" and "recipient" robots — the same robot may simultaneously show feasibility edges in both directions. The current SoC of a robot and its distance from the depot help decide which way it leans. That structure is also coupled and fragile: when a peer's surplus shrinks — it engaged a task, or drifted away — its outgoing feasibility edges silently vanish, potentially stranding dependents that were counting on it. The motion filter keeps each robot individually safe but does nothing to preserve those edges, so network-level V2V safety can't be taken for granted; guaranteeing it would mean filtering motion against the help graph as a whole — exactly where a network-CBF invariant would live.

Read the trade off the metric panels rather than the animation. Whether V2V actually bought range shows up in Peak reach and Mean reach — the all-time and time-averaged farthest distance any robot held, as a multiple of solo-reach — together with Time past solo and Time at outer 80%: a value above 1.0×, or any nonzero time past solo, means a robot held ground no individual could have sustained alone. The cost shows up opposite it, in Tasks done versus Tasks expired — charge spent ferrying and time spent pinned is throughput not spent closing tasks — and in Safety vetoes, the encounters a robot declined because engaging would have left it unsafe (the "missed opportunities" from the motion filter). Mean SoC and the SoC distribution panel show the energy state underneath all of it: the harder V2V is working, the more charge should spread apart — depleted robots out at the periphery on the low end, topped-up donors near the depot on the high end.

To compare regimes cleanly, let the cumulative metrics build up over a minute or so, then change one thing and rerun on a comparable task stream: toggle Worker-to-worker V2V to isolate what peer help adds, or enable the Refueler role to ask whether a dedicated, spatially-fixed donor class outperforms ad hoc worker-to-worker transfer.

Related Work

Related work

The literature this work draws on splits into two strands. The first is the biological precedent — social insect colonies that have organized cooperative energy transfer around a central nest for a great deal longer than engineering has had vehicles. The second is the engineering literature on formal safety frameworks for multi-robot operations, which is gradually catching up to that precedent without yet stitching the network-safety invariant fully together.

Biological inspiration: trophallaxis, spatial foraging, self-assembly, rescue behavior, and central-place theory

Energy circulates in ant colonies through mouth-to-mouth liquid exchange (trophallaxis), in-nest storage castes (repletes), and spatially structured forager deployment — and, when the collective demands it, individuals abandon the primary task altogether to become living structure, as in army-ant bridges and walls or fire-ant rafts, or to retrieve a trapped or injured nestmate from the field. These are the structural elements this work is trying to formalize for robotic swarms. The papers below establish that natural precedent and motivate the heterogeneous "refueler" mode in particular, which asks whether peer-to-peer energy transfer needs division of labor (in role or in space) — and whether that division might arise on its own, from an agent's position in the help graph, rather than being assigned in advance. Underneath all of it sits central-place foraging theory — the optimal-foraging account of how the distance to a home or replenishment point modulates the value of foraging beyond it. That point need not be the depot: a helper met in the field can act as a mobile, relational central place, so the same distance-modulates-value logic may recur at more than one scale rather than attaching to the depot alone. And when those mobile helpers fall out of reach and a worker is left stranded or injured, some species answer with a further structural layer — precisely directed rescue by specialized first responders, a retrieval mode that activates only after ordinary sharing has failed.

Trophallaxis and colony-level energy regulation

Greenwald, E., Baltiansky, L., & Feinerman, O. (2018). Individual crop loads provide local control for collective food intake in ant colonies. eLife, 7, e31730. doi:10.7554/eLife.31730
WhatReal-time fluorescent tracking of liquid food in every ant of a Camponotus sanctus colony, showing that foragers unload to many receivers in small portions and that the collective food intake rate emerges from local trophallactic interactions modulated by crop fullness.
InspirationThe biological counterpart of the V2V donation rule: an agent with surplus distributes to multiple peers in proportion to their deficit, and the colony-level outcome (steady inflow) is an emergent property of these local energy transfers — closely paralleling how this simulator's V2V edges propagate reachability.
Baltiansky, L., Frankel, G., & Feinerman, O. (2023). Emergent regulation of ant foraging frequency through a computationally inexpensive forager movement rule. eLife, 12, e77659. doi:10.7554/eLife.77659
WhatShows that the linear relationship between foraging frequency and colony hunger emerges from a simple movement rule rather than from a complex internal decision, with the forager's spatial trajectory inside the nest mediating the connection between her crop state and her exit decision.
InspirationDemonstrates that nontrivial colony-level energy-budget regulation can emerge from purely local, geometry-based individual rules — the same design philosophy underlying the per-robot V2V safety filter used here.

Spatial cooperative foraging and stationary relays

Denton, K. K., & Nonacs, P. (2018). Habitat complexity and predictability effects on finding and collecting food when ants search as cooperative groups. Animal Behaviour, 141, 77–84. doi:10.1016/j.anbehav.2018.05.004
WhatExperimental study of Argentine ant (Linepithema humile) cooperative search showing colonies sacrifice independent finding ability in favor of aggregating to amplify response to discovered food.
InspirationThe earliest empirical evidence cited by Nonacs's group for the structural insight that a colony may benefit from concentrating a stationary fraction of agents at intermediate locations to serve as encounter relays — directly analogous to placing refueler robots at intermediate shells.
Nonacs, P. (2025). Ant foraging: optimizing self-organization as a solution to a traveling salesman problem. Oecologia, 207, 73. doi:10.1007/s00442-025-05720-5
WhatEvolutionary-optimization (genetic algorithm) simulation of ant-like agents foraging across spatial networks of varying connectivity and modularity, compared against L. humile experiments. Explicitly articulates that at the group level it may be optimal to concentrate a stationary fraction of foragers at locations where they meet successful returning agents (citing Denton & Nonacs 2018).
InspirationThe closest published statement of the "refueler shell" hypothesis — that a heterogeneous spatial allocation (stationary intermediaries plus mobile foragers) is a candidate optimal strategy for cooperative groups searching beyond the reach of any single agent.
Hunt, E. R., Franks, N. R., & Baddeley, R. J. (2020). The Bayesian superorganism: externalized memories facilitate distributed sampling. Journal of the Royal Society Interface, 17(167), 20190848. doi:10.1098/rsif.2019.0848
WhatExperimental evidence and MCMC-inspired model showing that Temnothorax albipennis colonies use cuticular hydrocarbon footprints as a shared external memory that prevents redundant exploration of the same space.
InspirationThe genus (Temnothorax) most often invoked in conjectures about externally-deployed repletes as intermediate refueling stations. Documented uses of externalized substrate in this genus are spatial-memory markers rather than energy depots; whether comparable structural use of agents themselves as relays exists in this taxon remains an open question that this simulator's refueler mode is partly meant to motivate.
Plowes, N. J. R., Johnson, R. A., & Hölldobler, B. (2013). Foraging behavior in the ant genus Messor (Hymenoptera: Formicidae: Myrmicinae). Myrmecological News, 18, 33–49. doi:10.25849/myrmecol.news_018:033
WhatA review of foraging behavior across the seed-harvesting genus Messor, with Fig. 4 showing that M. pergandei colonies form daily foraging columns whose direction can shift from day to day, terminating in a distal fan of individual foragers at a new angular position each cycle.
InspirationA M. pergandei foraging column is a transient spatial bridge from nest to foraging fan, forming in a chosen direction, persisting briefly, then dissolving and reforming elsewhere — the closest biological analog to the V2V chains that form, fail, and reform between the depot and the region where the work is happening.

Self-assembled structures and task diversion

Reid, C. R., Lutz, M. J., Powell, S., Kao, A. B., Couzin, I. D., & Garnier, S. (2015). Army ants dynamically adjust living bridges in response to a cost–benefit trade-off. Proceedings of the National Academy of Sciences, 112(49), 15113–15118. doi:10.1073/pnas.1512241112
WhatField experiments showing Eciton army ants build living bridges from their own bodies to shortcut gaps in the foraging trail; the bridges lengthen, widen, and migrate with traffic and terrain, and ants stay locked in place only while the structure is used, dissolving it once the cost of immobilized workers outweighs the benefit of the shortcut — all without any individual sensing the global cost or benefit.
InspirationThe clearest template for what the simulator does not yet do: individuals diverting from the primary task into a stationary, load-bearing role, held there by ongoing use and released when no longer worthwhile. It suggests V2V workers could self-select into refuelers — or into a load-bearing "phalanx" that ferries energy outward from the depot — based on their position and the traffic through them, rather than being designated in advance.
Graham, J. M., Kao, A. B., Wilhelm, D. A., & Garnier, S. (2017). Optimal construction of army ant living bridges. Journal of Theoretical Biology, 435, 184–198. doi:10.1016/j.jtbi.2017.09.017
WhatA mathematical framework extending the Reid et al. cost–benefit picture, converting the benefit (shorter trail) and the cost (workers pulled from the foraging pool) into the common currency of colony foraging rate and predicting optimal bridge position and formation across obstacle geometries.
InspirationThe optimization-theoretic companion: it shows the divert-into-structure decision has a well-posed optimum — committed builders versus freed foragers — exactly the local cost–benefit calculus a self-organized refueler allocation would have to approximate.
Baudier, K. M., & Pavlic, T. P. (2020). Incidental interactions among Neotropical army-ant colonies are met with self-organized walls of ants (Hymenoptera: Formicidae). Myrmecological News, 30, 251. doi:10.25849/myrmecol.news_030:251
WhatDocuments transient, double-layered living walls that army ants self-assemble at incidental encounters with other raiding colonies (and non-prey ants such as leafcutters), establishing avoidance by diverting workers into a defensive boundary structure built from individuals that would otherwise be raiding.
InspirationA second self-assembly modality — a wall rather than a bridge — triggered and sustained by local encounters. That encounter-driven onset is the cue analog the simulator could use to recruit refuelers exactly where help-demand (encounter rate in the feasibility graph) runs highest.
Mlot, N. J., Tovey, C. A., & Hu, D. L. (2011). Fire ants self-assemble into waterproof rafts to survive floods. Proceedings of the National Academy of Sciences, 108(19), 7669–7673. doi:10.1073/pnas.1016658108
WhatShows Solenopsis invicta fire ants link their bodies within minutes into a buoyant, water-repellent raft, with individuals taking and holding structural positions that trade their own mobility for the survival of the collective.
InspirationThe canonical case of individuals becoming load-bearing structure; together with the army-ant bridges and walls it establishes self-assembly from diverted individuals as a widespread social-insect solution, motivating an emergent rather than a pre-assigned refueler caste.

Rescue behavior in social insects

Nowbahari, E., Scohier, A., Durand, J.-L., & Hollis, K. L. (2009). Ants, Cataglyphis cursor, use precisely directed rescue behavior to free entrapped relatives. PLoS ONE, 4(8), e6573. doi:10.1371/journal.pone.0006573
WhatFirst experimental demonstration that desert ants (then Cataglyphis cursor, now C. piliscapa) free trapped nestmates with a precisely targeted behavioral sequence — sand digging, limb pulling, and biting at the nylon snare itself — and discriminate sharply, offering aid only to nestmates and not to heterocolonial, heterospecific, or motionless test stimuli.
InspirationEstablishes rescue as a real, distinct, and discriminative form of altruism in social insects — structurally separate from foraging or defense. The biological precedent for a dedicated rescue layer in the swarm control stack, in which a stranded robot is retrieved rather than abandoned when V2V chain safety has already failed.
Hollis, K. L., & Nowbahari, E. (2022). Cause, development, function, and evolution: Toward a behavioral ecology of rescue behavior in ants. Learning & Behavior, 50(3), 329–338. doi:10.3758/s13420-022-00515-7
WhatTinbergen-four-questions synthesis of two decades of rescue research in C. piliscapa, integrating the precise behavioral sequence, when in an ant's life rescue capability emerges, the survival benefit measured at the colony level, and the heritability and patriline structure that make rescue a specialized first-responder role rather than a generic helping behavior.
InspirationFrames rescue as a heritable specialization with its own developmental schedule, not a side-effect of being available. Directly motivates the design choice between assigning a rescue role in advance and letting it emerge as a learned specialization in a heterogeneous swarm — the same question that arises for the refueler caste here.
Frank, E. T., Schmitt, T., Hovestadt, T., Mitesser, O., Stiegler, J., & Linsenmair, K. E. (2017). Saving the injured: Rescue behavior in the termite-hunting ant Megaponera analis. Science Advances, 3(4), e1602187. doi:10.1126/sciadv.1602187
WhatField study showing that M. analis workers injured during termite raids — typically having lost an extremity or with termites clinging to them — are carried back to the nest by nestmates and recover; two mandibular-gland compounds (dimethyl disulfide and dimethyl trisulfide) released by the injured ant trigger retrieval, and a model estimates the behavior sustains a ~29% larger colony.
InspirationA structurally different rescue regime from Cataglyphis: not extraction from a trap but retrieval of a depleted or disabled worker from the field, initiated by a chemical signal from the patient. The closer biological analog to a stranded robot, and a template for patient-initiated rescue routed through broadcast advertising rather than relying on V2V as the sole survival mode.
Frank, E. T., Wehrhahn, M., & Linsenmair, K. E. (2018). Wound treatment and selective help in a termite-hunting ant. Proceedings of the Royal Society B, 285(1872), 20172457. doi:10.1098/rspb.2017.2457
WhatExtends the M. analis rescue picture with triage and post-rescue care: lightly injured workers are retrieved and then receive intense allogrooming at the wound (which cuts 24-hour mortality from ~80% to ~10%), whereas heavily injured workers (five or more extremities lost) are not rescued — and the decision is regulated not by the helper but by the injured ant's own unresponsiveness, so the patient effectively self-selects.
InspirationEstablishes the bounded-rescue half of the design space: not every depleted agent must be saved. Rescue is governed by a patient-side cost-benefit signal, making it tractable as feedback-controlled service rather than a hard safety invariant — the empirical foundation for the weaker "eventually rescued within bounded time" framing of battery safety, as a counterpart to the chain-safe-at-all-times invariant the V2V filter enforces today.

Central-place foraging theory

Olsson, O., Brown, J. S., & Helf, K. L. (2008). A guide to central place effects in foraging. Theoretical Population Biology, 74(1), 22–33. doi:10.1016/j.tpb.2008.04.005
WhatA general patch-use model of central place foraging that catalogs how distance from the central place raises foraging costs, giving-up densities, and quitting harvest rates, with the effects amplified by metabolic cost, predation risk, and load-carrying cost.
InspirationThe formal account of why a fixed return point makes value decay with distance — the biological statement of the solo-reach boundary, and the baseline that V2V and refuelers are designed to push outward by relocating the central place.
Stephens, D. W., & Krebs, J. R. (1986). Foraging Theory. Princeton University Press.
WhatThe foundational synthesis of optimal foraging theory, including the central-place and patch-residence models that formalize the travel-cost-versus-gain trade-offs facing a forager tied to a home base.
InspirationThe canonical source for the currency-and-constraint framing borrowed here: net energetic gain maximized subject to a return-trip budget.

Engineering literature related to energy-safe multi-robot operations

Much of the engineering side of this problem sits at the intersection of three established themes that each address a piece of it without quite stitching the network-safety invariant together: (1) energy as a state to be safety-managed in multi-robot operations, including the closely-related rendezvous-recharging and peer-to-peer power-transfer literatures; (2) network properties as motion-controlled invariants, where the canonical case is communication connectivity preserved by the proximity graph; and (3) the recent methodological lift of CBFs from individual to graph-structured safety. The gap between these themes — a graph-theoretic, energy-aware extension of CBF safety where the invariant lives on the help graph rather than on individual robots — is the natural research opening, with canonical anchors in Ames et al. 2017 (CBF foundations, extended for high-relative-degree, adaptive, and event-triggered safety by the Xiao–Belta and Xiao–Belta–Cassandras line), Notomista–Ruf–Egerstedt 2018 (battery-as-CBF), Mathew–Smith–Waslander 2015 (rendezvous-recharging, building on the Bullo–Frazzoli–Pavone–Savla–Smith 2011 dynamic-vehicle-routing foundation and the Cassandras–Lin–Ding 2013 / Pourazarm–Cassandras 2016 energy-aware extensions), and Nikoletseas–Raptis–Raptopoulos 2017 (P2P energy balance among mobile peers). Two further strands are included because they bear on questions the safety framing alone does not: bio-inspired central-place and multiple-place foraging, which treats the depot itself as something that can be replicated or relocated, and encounter-driven self-assembly, which shows how an agent can latch into a structural role from purely local cues — the route by which refuelers might emerge rather than be assigned.

Battery-aware control barrier functions

Ames, A. D., Xu, X., Grizzle, J. W., & Tabuada, P. (2017). Control barrier function based quadratic programs for safety-critical systems. IEEE Transactions on Automatic Control, 62(8), 3861–3876. doi:10.1109/TAC.2016.2638961 · arXiv:1609.06408
WhatFoundational paper unifying control Lyapunov functions and control barrier functions in a real-time QP, providing the forward-invariance machinery on which nearly all subsequent multi-robot CBF work is built.
GapSingle-agent and safety-set-agnostic — no notion of inter-agent resource sharing or a network-level safe set.
Ames, A. D., Coogan, S., Egerstedt, M., Notomista, G., Sreenath, K., & Tabuada, P. (2019). Control barrier functions: theory and applications. 18th European Control Conference (ECC), 3420–3431. doi:10.23919/ECC.2019.8796030
WhatCanonical tutorial survey of CBFs covering safe sets, higher-relative-degree extensions, robustness and multi-robot examples.
GapSurveys individual-agent safety conditions and at most pairwise inter-robot constraints; does not address resource (energy) transfer or graph-structured invariants.
Notomista, G., Ruf, S., & Egerstedt, M. (2018). Persistification of robotic tasks using control barrier functions. IEEE Robotics and Automation Letters, 3(2), 758–763. doi:10.1109/LRA.2018.2789848
WhatIntroduces "persistification": augment each robot's state with its battery level and use a CBF to keep the robot in the set "can still reach a charger." This is essentially the per-robot depot-safety condition that the V2V network condition relaxes.
GapEach robot's CBF is private — there is no mechanism for one robot to donate reachability to another, so the invariant is exactly the individual constraint this work generalizes.
Notomista, G., & Egerstedt, M. (2021). Persistification of robotic tasks. IEEE Transactions on Control Systems Technology, 29(2), 756–767. doi:10.1109/TCST.2020.2978913 · arXiv:1903.05810
WhatJournal extension formalizing persistification, including coverage tasks and charging-station coordination via CBFs.
GapStill assumes fixed charging infrastructure and per-robot battery safety; explicitly handles mutually exclusive use of a single charging station rather than mutual energy donation.
Wang, L., Ames, A. D., & Egerstedt, M. (2017). Safety barrier certificates for collisions-free multirobot systems. IEEE Transactions on Robotics, 33(3), 661–674. doi:10.1109/TRO.2017.2659727
WhatDevelops decentralized pairwise CBFs in QP form for guaranteed collision-free multi-robot motion, with the canonical centralized-to-decentralized decomposition argument.
GapPairwise constraints are symmetric "avoid each other" — they do not encode cooperative-help feasibility nor anything depending on a third point (the depot).
Fouad, H., & Beltrame, G. (2022). Energy autonomy for robot systems with constrained resources. IEEE Transactions on Robotics, 38(6), 3675–3693. doi:10.1109/TRO.2022.3175438
WhatApplies CBFs to enforce both energy sufficiency (no robot runs out of battery) and coordination constraints (mutually exclusive use of a shared charging station) for a multi-robot system with limited charging infrastructure, with real-robot experiments demonstrating the approach.
GapThe closest existing work applying CBF safety guarantees to per-robot energy levels in a multi-robot setting. The gap is the transfer modality: robots queue for a fixed station rather than donating energy peer-to-peer in the field, so the help-graph structure and the α·distance donor feasibility condition are absent.

High-order, adaptive, and event-triggered control barrier functions

Xiao, W., & Belta, C. (2022). High-order control barrier functions. IEEE Transactions on Automatic Control, 67(7), 3655–3662. doi:10.1109/TAC.2021.3105491 · arXiv:1903.04706
WhatGeneralizes the standard CBF construction to safety constraints of arbitrary relative degree by recursively defining auxiliary barrier functions whose forward invariance implies the original constraint, with penalty and parameterization methods to maintain QP feasibility under tight control bounds.
GapSingle-agent and unstructured at the swarm level. Invariants that depend on a third location (the depot) or on the existence of a feasible help relation between robots fall outside the construction without an explicit network extension.
Xiao, W., Belta, C., & Cassandras, C. G. (2022). Adaptive control barrier functions. IEEE Transactions on Automatic Control, 67(5), 2267–2281. doi:10.1109/TAC.2021.3074895 · arXiv:2002.04577
WhatAdds time-varying penalty and relaxation functions inside the HOCBF construction so the QP remains feasible under time-varying control bounds and noisy dynamics, with the penalty functions themselves stabilized by control Lyapunov functions on auxiliary states.
GapAddresses feasibility under disturbed dynamics for a single robot's own safety set. The network of who-can-help-whom that V2V introduces, and the feasibility of donor surplus as a controlled invariant, are not in the formulation.
Xiao, W., Belta, C., & Cassandras, C. G. (2023). Event-triggered control for safety-critical systems with unknown dynamics. IEEE Transactions on Automatic Control, 68(7), 4143–4158. doi:10.1109/TAC.2022.3202088 · arXiv:2103.15874
WhatReformulates the safety-critical QP as an event-triggered scheme in which the controller is updated only at events defined by sensed error states relative to adaptive affine dynamics, with explicit conditions guaranteeing the CBF constraint between events — eliminating the need for continuous online model identification while preserving safety.
GapThe closest methodological precedent for a hybrid V2V setting, since charging is discrete and rendezvous-triggered. But the events here are dynamics-error events for a single robot's own safety, not network events such as "a feasible helper has entered range" — the swarm-level safety set the V2V filter defends is not yet inside this framework.

Persistent monitoring with battery / refueling constraints

Smith, S. L., Schwager, M., & Rus, D. (2012). Persistent robotic tasks: monitoring and sweeping in changing environments. IEEE Transactions on Robotics, 28(2), 410–426. doi:10.1109/TRO.2011.2174493 · arXiv:1102.0603
WhatSeminal LP-based speed controller for keeping a "dirt field" bounded with mobile sensors on closed paths; defines what "persistent" formally means for multi-robot coverage.
GapAssumes inexhaustible energy — robots never need to return home — so does not engage the depot-safety constraint at all.
Mathew, N., Smith, S. L., & Waslander, S. L. (2015). Multirobot rendezvous planning for recharging in persistent tasks. IEEE Transactions on Robotics, 31(1), 128–142. doi:10.1109/TRO.2014.2380593
WhatPlans routes for a separate team of charging robots that rendezvous with workers to recharge them mid-mission; formulates the meeting-location optimization as a TSP variant.
GapChargers are dedicated tankers; there is no peer-to-peer help among workers, and the safety condition is implicit in the schedule rather than enforced as a closed-loop network invariant.
Mitchell, D., Corah, M., Chakraborty, N., Sycara, K., & Michael, N. (2015). Multi-robot long-term persistent coverage with fuel-constrained robots. IEEE International Conference on Robotics and Automation (ICRA), 1093–1099. doi:10.1109/ICRA.2015.7139312
WhatFormulates persistent coverage as a discrete optimization with multiple fueling depots, requiring each target to be within L/2 of a depot.
GapDepot infrastructure is static and per-robot; the "L/2" reachability condition is exactly the individual safety invariant we generalize to a graph condition.
Asghar, A. B., Sundaram, S., & Smith, S. L. (2025). Multi-robot persistent monitoring: minimizing latency and number of robots with recharging constraints. IEEE Transactions on Robotics, 41, 236–252. doi:10.1109/TRO.2024.3502497 · arXiv:2303.08935
WhatApproximation algorithm for finding the minimum-fleet schedule meeting latency and discharge-time constraints around a single recharging depot.
GapRecharging depot is exclusive and stationary, and energy flows only depot-to-robot — there is no V2V transfer and no notion of one robot's surplus extending another's safe operating envelope.
Latif, E., Gui, Y., Munir, A., & Parasuraman, R. (2021). Energy-aware multi-robot task allocation in persistent tasks. arXiv preprint arXiv:2112.15282. arXiv:2112.15282
WhatA distributed, energy-conscious task-allocation scheme for continuous (unlimited-foraging) missions that schedules exploration, collection, and trips to a fixed charging station to keep robots alive while maximizing resources returned to a collection bin.
GapSurvivability comes from who-does-what-and-when scheduling against fixed charging infrastructure, not from peer transfer; workers cannot extend one another's range and there is no closed-loop motion-safety invariant — though the foraging-with-a-home framing is the same one used here.
Cassandras, C. G., Lin, X., & Ding, X. (2013). An optimal control approach to the multi-agent persistent monitoring problem. IEEE Transactions on Automatic Control, 58(4), 947–961. doi:10.1109/TAC.2012.2225539
WhatFoundational multi-agent persistent monitoring: cooperating agents control their motion to minimize an uncertainty metric over a mission space; in 1D the optimum reduces to a sequence of switching points and dwell times, analyzed as a hybrid system via Infinitesimal Perturbation Analysis (IPA) for on-line gradient-based optimization.
GapAgents are assumed inexhaustible, so the depot-return constraint and any notion of helper-extended reachability are outside the framework. Included here as the IPA-on-hybrid-systems backbone on which the energy-aware extensions in this subsection build.
Pourazarm, S., Cassandras, C. G., & Wang, T. (2016). Optimal routing and charging of energy-limited vehicles in traffic networks. International Journal of Robust and Nonlinear Control, 26(6), 1325–1350. doi:10.1002/rnc.3409
WhatRoutes energy-constrained vehicles through a network of charging nodes to minimize total elapsed (travel + charging) time, decomposing the single-vehicle mixed-integer nonlinear program into tractable sub-problems and extending to multi-vehicle congestion via flow-based formulations.
GapEnergy lives on a road graph with discrete recharging at fixed nodes; there is no peer-to-peer transfer, no continuous-state robot pose, and no closed-loop motion-safety invariant. The recharging decision is a planned route stop, not a real-time donation between agents that can each choose to be helper or helped.
Bullo, F., Frazzoli, E., Pavone, M., Savla, K., & Smith, S. L. (2011). Dynamic vehicle routing for robotic systems. Proceedings of the IEEE, 99(9), 1482–1504. doi:10.1109/JPROC.2011.2158181
WhatCanonical survey of Dynamic Vehicle Routing (DVR) for robotic systems: stochastic-demand spatial queueing, scaling laws for vehicle count and wait time, single- and multi-vehicle policies, and decentralized variants.
GapAt this level of abstraction vehicles are available-on-demand servers without intrinsic energy state — demand is exogenous and the vehicle is the resource, not vice versa. Refueling and helper-mediated reachability are absent by design; included as the upstream methodological foundation on which the Cassandras and Smith energy-aware extensions build.
Sindi, Y., Pipe, T., Winfield, A., Dogramadzi, S., & Melhuish, C. (2008). A feasibility study for energy autonomy in multi robot search and rescue operations. In L. Marques et al. (Eds.), Advances in Mobile Robotics: Proceedings of the 11th International Conference on Climbing and Walking Robots, 1146–1153. doi:10.1142/9789812835772_0137
WhatProposes and simulates an energy supply chain concept for search-and-rescue: a line of small robots relays power from a depot to a leading robot operating beyond its solo range, deriving a mathematical cost function for the energy overhead of sustaining the chain at varying depths.
GapThe chain is fixed and pre-deployed with a single purpose (keep the leader alive), not a freely moving swarm where any robot may become helper or receiver depending on encounter geometry. The cost-function framing is complementary to the formal chain-safety invariant the V2V filter aims to enforce, but the two have not been connected.
Kumar, N., Lee, J. J., Rathinam, S., Darbha, S., Sujit, P. B., & Raman, R. (2025). The persistent robot charging problem for long-duration autonomy. IEEE Robotics and Automation Letters, 10(3), 2191–2198. doi:10.1109/LRA.2024.3524897
WhatFormulates optimal recharging-schedule design for a heterogeneous fleet as an Integer Linear Programming problem, minimizing charging-station utilization (or equivalently maximizing servicing time) subject to all robots' periodic recharge requirements, and evaluates the formulation against a reference scheduling algorithm.
GapScheduling-centric: the problem is to determine initial charge states and sequencing so robots can share a fixed station without conflict. There is no peer transfer between field robots and no closed-loop motion-safety invariant — the energy constraint is handled in planning rather than in real-time control.

Mobile charging stations / tanker robots

Zebrowski, P., & Vaughan, R. T. (2005). Recharging robot teams: a tanker approach. 12th International Conference on Advanced Robotics (ICAR), 803–810. doi:10.1109/ICAR.2005.1507500
WhatFoundational "tanker" idea: a dedicated robot ferries energy to workers via breadcrumb-trail navigation, prefiguring much of the later mobile-charging literature.
GapThe tanker is a specialized agent whose only job is energy delivery; workers cannot help each other, so the network never re-organizes the safety set as the V2V scheme does.
Kundu, T., & Saha, I. (2021). Mobile recharger path planning and recharge scheduling in a multi-robot environment. IEEE/RSJ IROS. doi:10.1109/IROS51168.2021.9636078 · arXiv:2102.12296
WhatSMT-based hypercycle planning to schedule rendezvous between mobile-recharger robots and worker robots.
GapChargers are dedicated and assumed never-empty in the relevant horizon — the planning problem is offline and does not maintain a continuous-time network safety invariant.
Notomista, G., Mayya, S., Emam, Y., Kroninger, C., Bohannon, A., Hutchinson, S., & Egerstedt, M. (2022). A resilient and energy-aware task-allocation framework for heterogeneous multirobot systems. IEEE Transactions on Robotics, 38(1), 159–179. doi:10.1109/TRO.2021.3102379 · arXiv:2105.05586
WhatCombines CBF-style energy constraints with online task allocation for heterogeneous teams, surviving robot dropouts via slack-variable prioritization.
GapEnergy is consumed but not transferable — robots reallocate tasks, not battery, so the failure mode is task abandonment rather than peer rescue.

Energy trophallaxis: concept, simulation, and hardware

Ngo, T. D., & Schiøler, H. (2006). An approach to sociable robots through self-distributed energy. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2192–2199. doi:10.1109/IROS.2006.282559
WhatIntroduces the concept of robot energy trophallaxis — biologically inspired from social-insect food sharing — as a mechanism for sustaining a population of mobile robots beyond individual battery limits, with preliminary simulations of battery-exchange behavior among co-located peers.
GapThe exchange requires physical co-location for battery swap (not wireless or wired continuous transfer), and the energy-sharing rule is not linked to a formal safety invariant or reachability condition. The closest ancestor to the V2V filter concept presented here.
Ngo, T. D., & Schiøler, H. (2007). Randomized robot trophallaxis: from concept to implementation. IEEE International Conference on Systems, Man and Cybernetics (SMC). doi:10.1109/ICSMC.2007.4414153
WhatFormalizes the randomized-encounter model for robot-to-robot energy sharing: robots encounter each other stochastically (proportional to density and motion), exchange energy on contact, and the aggregate effect on population survival is simulated and analyzed.
GapThe sharing rule is triggered by spatial proximity without regard to whether the receiver actually needs the energy or whether the donor has enough surplus to return safely — the two conditions the V2V one-hop filter makes explicit.
Ngo, T. D., Raposo, H., & Schiøler, H. (2007). Potentially distributable energy: towards energy autonomy in large populations of mobile robots. IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA). doi:10.1109/CIRA.2007.382875
WhatExtends the trophallaxis concept to large robot populations, introducing the notion of "potentially distributable energy" — the surplus energy that is in principle available for sharing across the population — and modeling how population-level energy autonomy scales with robot count and exchange rate.
GapPopulation-level analysis without per-robot safety conditions: distributable surplus is computed globally rather than maintained as a local invariant at each robot, so no individual guarantee of safe return is provided.
Ngo, T. D., Raposo, H., & Schiøler, H. (2008). Multiagent robotics: toward energy autonomy. Artificial Life and Robotics, 12(1), 47–52. doi:10.1007/s10015-007-0440-1
WhatJournal synthesis of the energy-autonomy concept across the preceding conference papers: formal definition of energy autonomy for multi-robot systems, analysis of the conditions under which a population can sustain itself indefinitely through trophallaxis, and comparison of centralized versus decentralized sharing policies.
GapEnergy autonomy is defined at the population level (the collective does not run out) rather than at the individual level (each robot can always return). The individual-safety dimension — the core of the V2V reachability invariant — is not addressed.
Ngo, T. D., & Schiøler, H. (2008). Truly autonomous robots: hardware design for an energy trophallactic robot. Artificial Life and Robotics, 12(1), 335–345. doi:10.1007/s10015-007-0495-z
WhatDescribes the CISSbot — a physical robot platform designed for energy trophallaxis via autonomous battery exchange. Each robot carries a battery pack in a mechatronic cradle that can be removed and passed to a neighbor; the paper details the mechanical, electrical, and power-management design required to make physical energy handoff reliable between autonomous mobile robots.
GapPhysical battery exchange is a fundamentally different (and mechanically more demanding) transfer modality than the wireless or wired continuous-current transfer assumed in this simulator — exchange is discrete and slow, requiring physical docking. The work establishes that peer energy transfer in real hardware is feasible, but the latency and reliability constraints differ substantially from the instantaneous transfer model used here.
Ngo, T. D., & Schiøler, H. (2006). Sociable robots through self-maintained energy. International Journal of Advanced Robotic Systems, 3(4), 295–302. doi:10.5772/5724
WhatJournal companion to the IROS 2006 conference paper: extends the CISSbot framework with a probabilistic formulation of self-maintained energy, detailed hardware architecture of the battery-exchange cradle, and experimental validation of the decision function that governs whether a robot should donate, accept, or decline an energy exchange.
GapAs with the conference paper, energy sufficiency is evaluated at the population level via the probabilistic model; there is no per-robot return-to-depot invariant and no distance-dependent feasibility cost on the donation decision.
Kim, J.-O., & Moon, C. (2015). A vision-based wireless charging system for robot trophallaxis. International Journal of Advanced Robotic Systems, 12, 177. doi:10.5772/62043
WhatA vision-guided wireless charging system enabling one mobile robot to recharge another: a transmitter robot uses a camera and Bayesian pose estimator to align its coil with a marker on the receiver, achieving precise inductive alignment without physical docking and validating the approach across several receiver robot types.
GapSolves the hardware rendezvous problem (coil alignment for wireless transfer between mobile peers) but the charging decision is manual or pre-scripted — there is no autonomous protocol governing when a robot should seek a donor, what donor surplus threshold triggers acceptance, or whether the donor can afford to give without stranding itself.
Witkowski, M. (2007). Energy sharing for swarms modeled on the common vampire bat. Adaptive Behavior, 15(3), 307–328. doi:10.1177/1059712307082092
WhatModels energy sharing in robot swarms on the reciprocal food-sharing behavior of vampire bats (Desmodus rotundus): robots forage for fuel, encounter each other stochastically, and share energy under a bat-inspired reciprocity policy; simulations quantify the survival benefit and the effect of group size, sharing thresholds, and defectors.
GapThe sharing policy is triggered by encounter proximity and reciprocity history without a geometric feasibility condition — a donor robot does not check whether it retains enough charge to return safely before giving. The population-survival framing also means there is no individual-level guarantee analogous to the V2V safety invariant.

Long-term autonomy and energy management in multi-robot systems

Matusiak, M., Paanajärvi, J., Appelqvist, P., Elomaa, M., Vainio, M., Ylikorpi, T., & Halme, A. (2009). A novel marsupial robot society: towards long-term autonomy. In H. Asama et al. (Eds.), Distributed Autonomous Robotic Systems 8, 523–532. Springer, Berlin, Heidelberg. doi:10.1007/978-3-642-00644-9_46
WhatIntroduces the Marsubot Society: a heterogeneous multi-robot system in which a large "parent" robot carries smaller child robots, deploys them into the field, and recovers them for recharging when their batteries are low. The system is designed for indefinitely long autonomous operation, treating energy access as the central constraint on long-term capability.
GapEnergy is managed through deployment/recovery cycles (the parent as a mobile depot) rather than peer-to-peer transfer between field robots. Long-term autonomy is achieved by structural role separation (parent vs. child) rather than by an emergent help graph — the approach is closer to the refueler-role architecture than to V2V sharing among equals.
Mulgaonkar, Y., & Kumar, V. (2014). Autonomous charging to enable long-endurance missions for small aerial robots. Proceedings of SPIE, 9083, 90831S. doi:10.1117/12.2051111
WhatDemonstrates fully autonomous multi-MAV operation over 9.5 hours through scheduled contact-based recharging at a fixed station, with automated vision-guided landing and swapping among a pool of vehicles to maintain continuous task coverage.
GapLong-endurance autonomy is achieved by scheduling and station access, not peer transfer; the energy architecture is centralized (fixed depot) and the robots do not share with each other in the field. The vision-guided docking work is relevant to precision rendezvous for V2V contact transfer but the charging decision itself is not safety-critical in the same way.
Kim, J.-H. (2019). A robot system maintained with small-scale distributed energy sources. Energies, 12(20), 3851. doi:10.3390/en12203851
WhatProposes sustaining a robot system from spatially distributed small-scale renewable energy sources (e.g. small solar panels scattered over a large area) rather than a single central depot, analyzing the energy-collection and distribution logistics when no single node can supply the full fleet.
GapEnergy sources are fixed infrastructure nodes; robots travel to sources rather than transferring energy among themselves. The distributed-source framing is relevant to the "depot need not be singular" opening in the synthesis but does not address peer transfer between field robots.
Ziouzios, D., Baras, N., Dasygenis, M., & Tsanaktsidis, C. (2025). Fair and energy-efficient charging resource allocation for heterogeneous UGV fleets. Computers, 14(11), 473. doi:10.3390/computers14110473
WhatFormulates charging-resource allocation for a heterogeneous fleet of UGVs at a large photovoltaic park as a joint fairness and energy-efficiency optimization, balancing equitable access to charging across robot types against minimizing total mission energy cost.
GapThe problem is allocation to fixed charging infrastructure, not peer transfer; robots queue for stations rather than sharing with each other. The fairness formulation is directly relevant downstream: when refueler robots are the constrained resource in a V2V swarm, the same equity considerations arise — which field robots get served first, and whether the allocation is fair across a heterogeneous fleet.

Central-place and multiple-place foraging in robot swarms

Hecker, J. P., & Moses, M. E. (2015). Beyond pheromones: evolving error-tolerant, flexible, and scalable ant-inspired robot swarms. Swarm Intelligence, 9(1), 43–70. doi:10.1007/s11721-015-0104-z
WhatThe Central-Place Foraging Algorithm (CPFA): a seed-harvester-ant-inspired swarm that explores outward from a single nest and returns resources to it, with search and recruitment parameters tuned by a genetic algorithm for error tolerance, flexibility, and scalability.
GapCoordinates search and retrieval around a single fixed nest with no energy budget — robots never run out of charge, so the central place is a collection point, not a survival constraint, and there is no peer energy transfer.
Lu, Q., Hecker, J. P., & Moses, M. E. (2018). Multiple-place swarm foraging with dynamic depots. Autonomous Robots, 42(4), 909–926. doi:10.1007/s10514-017-9693-2
WhatExtends the CPFA to multiple collection zones and to dynamic depots that relocate toward the centroid of discovered resources, cutting travel cost and congestion; explicitly motivated by polydomous ants and other animals that keep several home sites.
GapMakes the central place plural and mobile — the spatial-infrastructure analog of the refueler shells — but the depots are collection points for retrieved resources, not energy sources, so there is still no battery state, no return-to-charge reachability, and no peer-to-peer transfer.

Encounter-driven self-assembly and role switching in robot swarms

Kannapiran, S., Oikonomou, E., Chu, A., Berman, S., & Pavlic, T. P. (2025). Ant-inspired walling strategies for scalable swarm separation: reinforcement learning approaches based on finite state machines. arXiv preprint arXiv:2510.22524. arXiv:2510.22524
WhatTwo decentralized, ant-inspired controllers that keep heterogeneous swarm subgroups spatially separated while running concurrent tasks: a finite-state-machine controller whose encounter-triggered transitions latch robots into a rigid wall, and an FSM-plus-Deep-Q-Network variant that learns to optimize separation through emergent buffer zones. Modeled on the temporary walls army ants raise to keep foraging trails from interfering.
GapThe latched role is a separation wall, not an energy structure, so no battery, transfer, or depot enters. But it is the closest robotic precedent for the mechanism an emergent refueler caste would need — a local, encounter- or position-dependent trigger that converts a mobile worker into a held-station structural agent. Here that trigger would key off the help-feasibility graph, latching workers into refuelers (or into an energy-carrying "phalanx" reaching outward from the depot) where demand is highest.

Peer-to-peer wireless power transfer in mobile networks

Nikoletseas, S., Raptis, T. P., & Raptopoulos, C. (2017). Wireless charging for weighted energy balance in populations of mobile peers. Ad Hoc Networks, 60, 1–10. doi:10.1016/j.adhoc.2017.03.005
WhatDesigns distributed protocols by which battery-limited mobile peers wirelessly exchange energy with each other (no special chargers) to reach a weighted energy-balance state.
GapThe protocol is purely about energy distribution among peers — there is no motion-safety coupling, no spatial cost (α·d) for transfer, and no depot.
Madhja, A., Nikoletseas, S., Raptopoulos, C., & Tsolovos, D. (2016). Energy-aware network formation in peer-to-peer wireless power transfer. ACM MSWiM, 43–50. doi:10.1145/2988287.2989166
WhatDefines protocols for forming a P2P energy-exchange network with target energy distributions over agents.
GapNetwork formation is logical and communicational, not physical-motion-coupled — agents do not have to drive to each other to transfer.
Chakraborty, P., Parker, R., Hoque, T., Cruz, J., Du, L., Wang, S., & Bhunia, S. (2022). Addressing the range anxiety of battery electric vehicles with charging en route. Scientific Reports, 12, 5588. doi:10.1038/s41598-022-08942-2
WhatPeer-reviewed treatment of "P2C2": a cloud-coordinated peer-to-peer on-the-move EV charging system in which surplus-charge EVs share energy with low-charge EVs while in motion, supplemented by mobile charging stations.
GapOptimizes throughput and halts in an EV highway setting with centralized cloud matching; there is no formal safety invariant ("every car can still reach a station") and no notion of cooperative rendezvous-and-transfer as a depot-safety relaxation.
Daymude, J. J., Richa, A. W., & Weber, J. W. (2021). Bio-inspired energy distribution for programmable matter. 22nd International Conference on Distributed Computing and Networking (ICDCN), 86–95. doi:10.1145/3427796.3427835
WhatA self-stabilizing distributed algorithm for energy distribution in active programmable matter (the amoebot model): the system draws from a single external source reachable by at least one module and relays charge module-to-module to keep the whole collective powered, with a sharing rule loosely inspired by metabolite signaling in Bacillus subtilis biofilms.
GapThe closest sibling in spirit — one source feeding a collective through peer relay — but energy flows along a fixed lattice of adjacent modules rather than via agents crossing space, so there is no α·distance transfer cost and no return-to-depot reachability. It is the structural-graph analog of depot-plus-V2V without the spatial-mobility safety dimension added here.
Arai, F., & Satoh, Y. (2021). A power sharing modular robot with power packet technology. 23rd European Conference on Power Electronics and Applications (EPE ECCE Europe). doi:10.23919/EPE21ECCEEurope50061.2021.9570442
WhatProposes a modular robot architecture in which electric power is distributed among modules via a "power packet" bus — discrete, tagged packets of energy routed through a shared bus with arbitration logic — providing fault tolerance when any single module's power source fails.
GapModules are physically connected in a fixed assembly; power packets flow through a shared bus, not across free space between mobile robots. The discretized-energy framing is conceptually related to battery-exchange trophallaxis but the transfer medium is wired and the topology is static.
Sanada, R., Satoh, Y., & Arai, F. (2024). Power-sharing mechanism for a modular robot using power packet technology. IEEE TENCON 2024. doi:10.1109/TENCON61640.2024.10902811
WhatExtends the power packet bus architecture to multi-source modular robots, implementing a power-sharing mechanism that balances load across modules with different state-of-charge levels and demonstrates bus arbitration in a physical prototype.
GapAs with Arai & Satoh 2021, the transfer is over a physically connected bus in a static modular assembly. The contribution is in the arbitration and balancing logic for the discrete-packet modality, which does not carry over directly to the mobile-robot setting where robots must move to rendezvous before any transfer occurs.
Sarin, A., & Avestruz, A.-T. (2020). Code division multiple access wireless power transfer for energy sharing in heterogeneous robot swarms. IEEE Access, 8, 132121–132133. doi:10.1109/ACCESS.2020.3010202
WhatProposes CDMA-modulated wireless power transfer enabling multiple robots in a heterogeneous swarm to exchange energy simultaneously while in motion — without a centralized controller — by encoding each transfer channel with a distinct spreading code; hardware demonstration with four robots achieving selective, scalable peer energy transfer.
GapThe contribution is in the physical-layer multi-access protocol, not the decision layer: which robot should donate to which, when, and whether the donor can afford to give without sacrificing its own return capability are outside the scope. Provides the transfer infrastructure on which a V2V safety filter could operate.

Energy/charging-aware EV routing and V2V power transfer

Alvaro-Hermana, R., Fraile-Ardanuy, J., Zufiria, P. J., Knapen, L., & Janssens, D. (2016). Peer-to-peer energy trading with electric vehicles. IEEE Intelligent Transportation Systems Magazine, 8(3), 33–44. doi:10.1109/MITS.2016.2573178
WhatActivity-based model and aggregator that lets EVs trade surplus energy peer-to-peer when parked; demonstrates substantial energy-cost reductions for participating drivers.
GapTransactions are stationary (parked) and purely economic; there is no rendezvous logistics and no analogue of the α·distance feasibility constraint between mobile prosumers.
Xu, Y., Alderete Peralta, A., & Balta-Ozkan, N. (2024). Vehicle-to-vehicle energy trading framework: a systematic literature review. Sustainability, 16(12), 5020. doi:10.3390/su16125020
WhatSystematic literature review of 61 articles categorizing the V2V energy-trading literature by mechanism, infrastructure, and economic model.
GapThe surveyed literature centres on grid-scale economics and protocol design; essentially no work treats V2V transfer as a closed-loop safety filter for a mobile autonomous swarm.
Zhang, Y., Chen, Y., Xie, R., Lin, G., Chen, X., & Cheng, X. (2025). A bidirectional AGV wireless charging system capable of vehicle-to-vehicle wireless charging with shared bridge arms. IEEE Journal of Emerging and Selected Topics in Power Electronics. doi:10.1109/JESTPE.2025.3593595
WhatProposes a multistage, multimode V2V wireless power transfer system for industrial AGV fleets in which one AGV can supply energy to another via shared bridge-arm hardware; four operating modes (one-to-one, one-to-two, two-to-one, relay) are demonstrated on a prototype achieving 94.7% efficiency at 2400 W.
GapThe hardware operates in controlled industrial settings with known AGV positions and pre-planned transfer modes; there is no autonomous decision protocol for when to initiate transfer, no mobility cost on the transfer decision, and no safety invariant on donor state-of-charge. Provides directly relevant circuit-level precedent for robot-to-robot wireless transfer.
Alghawi, M., & Mounsef, J. (2024). Overview of vehicle-to-vehicle energy sharing infrastructure. IEEE Access, 12, 54567–54589. doi:10.1109/ACCESS.2024.3388088
WhatSurvey of V2V energy-sharing architectures in the automotive domain, covering hardware topologies, communication protocols, grid interaction, and economic incentive structures for peer energy exchange between electric vehicles.
GapAutomotive V2V sharing is stationary or low-speed and economically motivated; the survey does not address mission-critical mobile robots where transfer decisions are safety constraints rather than economic choices, and the distance-dependent feasibility cost of travelling to a donor is absent.
Chatterjee, P., Majumder, P., & Das, S. K. (2025). V2VDisCS: Vehicle to vehicle distributed charge sharing in intelligent transportation systems. IEEE Transactions on Intelligent Transportation Systems, 26(4), 4960–4974. doi:10.1109/TITS.2025.3534025
WhatA distributed peer charge-sharing protocol for EVs in a transportation network: vehicles broadcast state-of-charge, negotiate transfers via a decentralized matching protocol, and exchange energy through opportunistic encounters, with analysis of fairness, convergence, and throughput.
GapThe distributed matching logic and fairness analysis are the most conceptually transferable elements to the robot-swarm setting. The gap is that EVs in a road network have fixed routes and no depot-return safety constraint; the protocol does not encode a donor feasibility condition equivalent to the V2V one-hop filter.

Connectivity-preserving multi-agent control

Olfati-Saber, R. (2006). Flocking for multi-agent dynamic systems: algorithms and theory. IEEE Transactions on Automatic Control, 51(3), 401–420. doi:10.1109/TAC.2005.864190
WhatThe reference framework: three algorithms (free-flocking, obstacle-avoiding, leader-tracking) defined on a proximity graph, with provable cohesion and migration without leaders.
GapThe safety invariant of interest is geometric (collision-free flocking), not resource-based; potential fields penalize disconnection, but no energy budget enters the analysis.
Jadbabaie, A., Lin, J., & Morse, A. S. (2003). Coordination of groups of mobile autonomous agents using nearest-neighbor rules. IEEE Transactions on Automatic Control, 48(6), 988–1001. doi:10.1109/TAC.2003.812781
WhatFirst rigorous proof that heading consensus emerges in Vicsek-type mobile agents under joint connectivity of the neighbor-graph sequence.
GapConnectivity is a sufficient condition for convergence, not a controlled invariant; the shared resource is information, not energy.
Zavlanos, M. M., Egerstedt, M. B., & Pappas, G. J. (2011). Graph-theoretic connectivity control of mobile robot networks. Proceedings of the IEEE, 99(9), 1525–1540. doi:10.1109/JPROC.2011.2157884
WhatSurvey-style synthesis of centralized and distributed algorithms that keep the algebraic (Fiedler) connectivity of a proximity graph above zero throughout motion.
GapMaintains topological connectivity, not energy reachability — there is no asymmetric edge condition like "j can help i" depending on a third location (depot).
Sabattini, L., Chopra, N., & Secchi, C. (2013). Decentralized connectivity maintenance for cooperative control of mobile robotic systems. International Journal of Robotics Research, 32(12), 1411–1423. doi:10.1177/0278364913499085
WhatDistributed control law that estimates and bounds the second-smallest Laplacian eigenvalue to preserve global connectivity online.
GapThe invariant is purely graph-spectral (λ₂ > 0); no relation to robot energy or to a privileged depot vertex.
Ong, P., Capelli, B., Sabattini, L., & Cortes, J. (2023). Nonsmooth control barrier function design of continuous constraints for network connectivity maintenance. Automatica, 156, 111209. doi:10.1016/j.automatica.2023.111209
WhatEncodes connectivity-maintenance as a continuous nonsmooth CBF constraint compatible with arbitrary nominal multi-robot controllers.
GapClosest methodological precedent — connects CBF and connectivity literatures — but the safe set is again topological connectivity, not energy reachability through cooperative peer-to-peer transfer.
Alamdar, K. G., & Petrović, T. (2025). Decentralized battery-aware connectivity maintenance for multi-UAV missions. IEEE Access, 13, 83738–83751. doi:10.1109/ACCESS.2025.3569206
WhatExtends a classical algebraic-connectivity maintenance controller for multi-UAV teams by folding each agent's battery state into the connectivity control input, so that depleting agents bias the team's motion toward preserving the communication graph.
GapThe controlled invariant is still communication connectivity, with battery only weighting it — energy is a constraint on staying connected, not a transferable resource, and there is no depot-return reachability or peer-to-peer donation.

MANET / VANET network-invariant maintenance through motion

Spanos, D. P., & Murray, R. M. (2004). Robust connectivity of networked vehicles. 43rd IEEE Conference on Decision and Control, 3, 2893–2898. doi:10.1109/CDC.2004.1428904
WhatEarliest formal treatment of "vehicle motion must preserve a graph-theoretic communication property" — uses a robust-connectivity measure and gradient flows over vehicle positions.
GapThe graph property is wireless link existence, not energy-help-feasibility; vehicles have no energy budget.
Konak, A. (2017). A distributed multi-agent algorithm for topology control in mobile ad-hoc networks. 18th International Conference on Advanced Robotics (ICAR), 244–249. doi:10.1109/ICAR.2017.8023525
WhatA decentralized algorithm in which autonomous agent-nodes reposition themselves to keep a mobile ad-hoc network connected, proactively augmenting the topology to forestall partitions without any central coordinator.
GapThe invariant shepherded through motion is wireless-link connectivity; there is no energy state, transfer, or depot — the agents steer the communication graph rather than an energy budget.

Networked / distributed CBFs and graph-level safety

Zhang, S., So, O., Garg, K., & Fan, C. (2025). GCBF+: a neural graph control-barrier-function framework for distributed safe multi-agent control. IEEE Transactions on Robotics, 41, 1533–1552. doi:10.1109/TRO.2025.3530348 · arXiv:2401.14554
WhatIntroduces graph CBFs parameterized by GNNs; trained with 8 agents and scales to over 1000-agent systems with hardware validation on Crazyflie swarms.
GapGraph CBFs are learned for collision avoidance — the graph structure is the proximity graph and the invariant is geometric, not energetic.
Mestres, P., Nieto-Granda, C., & Cortes, J. (2024). Distributed safe navigation of multi-agent systems using control-barrier-function-based controllers. IEEE Robotics and Automation Letters, 9(7), 6760–6767. doi:10.1109/LRA.2024.3414268 · arXiv:2402.06195
WhatDistributed synthesis where each agent solves a local CBF-QP that converges to the centralized solution of a state-dependent network optimization with safety constraints.
GapNetworked safety here means collision avoidance under formation-maintenance; no resource-coupling between agents.
Li, J., Wang, C., Li, B., Wei, Q., Song, L., & Huang, D. (2025). Maintaining connectivity in coverage control: a distributed algebraic connectivity estimation approach using control barrier functions. Aerospace Systems, 1–11. doi:10.1007/s42401-025-00424-3
WhatDistributed coverage control that preserves persistent connectivity by treating algebraic connectivity (the Fiedler value of the communication Laplacian) as a control barrier function, holding it positive while a Voronoi partition optimizes area coverage.
GapThe CBF-protected safe set is topological (λ₂ > 0); the conserved resource is the communication graph, not energy, and no depot or return-home reachability appears — the same graph-CBF machinery this work redirects onto an energy invariant.

Synthesis of gaps. Despite the natural precedent above, the engineering literature has yet to formalize an analogous safety framework. Across the engineering categories above, no surveyed work simultaneously provides (a) every robot performing both the primary task and donating energy, (b) safety as a closure property of the energy-aware proximity graph rather than of individual agents, and (c) the CBF / forward-invariance machinery lifted from a single-agent state variable to a network invariant. That conjunction is the gap a contribution should aim to fill.

The Ngo–Schiøler energy-trophallaxis line (2006–2008) is the closest prior body of work in the mobile-robot-swarm setting: bio-inspired peer energy transfer among mobile robots, population-level energy-autonomy conditions, and a physical battery-exchange platform (the CISSbot) — extended by the contemporaneous Witkowski 2007 vampire-bat simulation and the Kim & Moon 2015 vision-guided wireless docking work. What none of it provides is a per-robot safety condition — distributable surplus is tracked at the population level, not guaranteed to each individual as a return-to-depot invariant. Daymude et al. 2021 is the closest structural sibling: a self-stabilizing distributed algorithm in which one source feeds a collective through module-to-module relay — the graph-relay analog of depot-plus-V2V — but energy flows along a fixed lattice with no spatial mobility or distance-dependent transfer cost. Sindi et al. 2008 establish a cost-function analysis for a fixed energy-relay chain serving a leading robot, the closest precursor to the chain-safety geometry formalized here. On the control side, Fouad & Beltrame 2022 is the closest existing work applying CBFs directly to per-robot battery sufficiency in a multi-robot system, but robots queue for a fixed station rather than transferring peer-to-peer in the field. The closest methodological precedent for lifting that to a network invariant is Ong–Capelli–Sabattini–Cortes 2023 (CBF on a network connectivity property); the closest precedent for the hybrid, rendezvous-triggered character of V2V is the Xiao–Belta–Cassandras event-triggered CBF line; the closest physical precedent for P2P energy transfer in motion is Chakraborty et al. 2022 and the Sarin & Avestruz 2020 CDMA-WPT hardware, with the Ngo CISSbot as the earlier discrete-exchange analog. Combining these — per-robot reachability invariant, energy as the conserved network resource, depot as the privileged sink — is the territory.

Four further openings sit just past it:

  • Relocatable depot. Central-place foraging theory, the multiple-/dynamic-depot robot algorithms, and the Marsubot Society (Matusiak et al. 2009, where a parent robot acts as a mobile depot) all point toward replicating or relocating the privileged sink — which is what the refueler shells begin to do.
  • Emergent refueler role. The refueler role is assigned in advance here, whereas the self-assembly and encounter-driven walling literatures suggest it could instead emerge — a worker latching into a held-station refueler, or into an energy-carrying phalanx reaching outward from the depot, from nothing more than its position in the help-feasibility graph and the traffic through it.
  • Layered safety. The safety invariant need not be absolute: precision rescue in Cataglyphis and patient-initiated retrieval in Megaponera treat stranded individuals as recoverable failures rather than invariant breaches, pointing toward a framing in which the chain-safe V2V condition is the strong-safety mode and a separate rescue layer handles its failures within bounded time.
  • Fairness in refueler allocation. Once refueler robots are the constrained resource rather than a fixed charging station, the question of equitable service across a heterogeneous fleet becomes live. Ziouzios et al. 2025 treat this for fixed-station UGV fleets; the analogous problem for a V2V swarm — which field robot gets the next available refueler, and whether the allocation is fair across robot types with different energy demands — is open.

None of these are represented in the simulator, and none need to be, but all are where the literature points.

© 2026 Theodore P. Pavlic · MIT License