Reward Engineering with Large Language Models for Multi-Agent Formation Control

Wei Jie Tan; Xiu Fen Li; Jun Hao Ng

doi:10.71465/ajml3637

Vol. 7 No. 1 (2026), Articles

Vol. 7 No. 1 (2026)

Reward Engineering with Large Language Models for Multi-Agent Formation Control

Articles

Published 2026-03-14

Wei Jie Tan ⁺⁻
Xiu Fen Li ⁺⁻
Jun Hao Ng ⁺⁻

https://doi.org/10.71465/ajml3637

Wei Jie Tan

Department of Mechanical Engineering, National University of Singapore, Singapore 117575, Singapore

Xiu Fen Li

Department of Mechanical Engineering, National University of Singapore, Singapore 117575, Singapore

Jun Hao Ng

Department of Mechanical Engineering, National University of Singapore, Singapore 117575, Singapore

PDF

Keywords

Reward shaping
LLM-guided reinforcement learning
formation control
multi-agent robotics

Abstract

This study explores LLM-guided reward shaping for multi-agent formation control with collision avoidance. LLMs iteratively generate and refine reward functions based on high-level task objectives and observed failures. Evaluation across 600 simulated and real-world episodesdemonstrates that the approach achieves target formations with 35.2% fewer training iterations and improves collision-free success rates by 28.4% compared with manually designed rewards.

PDF

This work is licensed under a Creative Commons Attribution 4.0 International License.