Glider soaring via reinforcement learning in the field - Institut Pasteur Accéder directement au contenu
Article Dans Une Revue Nature Année : 2018

Glider soaring via reinforcement learning in the field

Résumé

Soaring birds often rely on ascending thermal plumes (thermals) in the atmosphere as they search for prey or migrate across large distances. The landscape of convective currents is rugged and shifts on timescales of a few minutes as thermals constantly form, disintegrate or are transported away by the wind. How soaring birds find and navigate thermals within this complex landscape is unknown. Reinforcement learning provides an appropriate framework in which to identify an effective navigational strategy as a sequence of decisions made in response to environmental cues. Here we use reinforcement learning to train a glider in the field to navigate atmospheric thermals autonomously. We equipped a glider of two-metre wingspan with a flight controller that precisely controlled the bank angle and pitch, modulating these at intervals with the aim of gaining as much lift as possible. A navigational strategy was determined solely from the glider’s pooled experiences, collected over several days in the field. The strategy relies on on-board methods to accurately estimate the local vertical wind accelerations and the roll-wise torques on the glider, which serve as navigational cues. We establish the validity of our learned flight policy through field experiments, numerical simulations and estimates of the noise in measurements caused by atmospheric turbulence. Our results highlight the role of vertical wind accelerations and roll-wise torques as effective mechanosensory cues for soaring birds and provide a navigational strategy that is directly applicable to the development of autonomous soaring vehicles.
Fichier principal
Vignette du fichier
Reddy2018_AcceptedVersion.pdf (7.72 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

pasteur-02914599 , version 1 (12-08-2020)

Identifiants

Citer

Gautam Reddy, Jérôme Wong Ng, Antonio Celani, Terrence J Sejnowski, Massimo Vergassola. Glider soaring via reinforcement learning in the field. Nature, 2018, 562 (7726), pp.236-239. ⟨10.1038/s41586-018-0533-0⟩. ⟨pasteur-02914599⟩

Collections

PASTEUR TDS-MACS
152 Consultations
675 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More