Reinforcement learning based robust control algorithms for coherent pulse stacking

Abulikemu Abuduweili; Jie Wang; Bowei Yang; Aimin Wang; Zhigang Zhang

doi:10.1364/OE.426906

Reinforcement learning based robust control algorithms for coherent pulse stacking

Opt Express. 2021 Aug 2;29(16):26068-26081. doi: 10.1364/OE.426906.

Authors

Abulikemu Abuduweili, Jie Wang, Bowei Yang, Aimin Wang, Zhigang Zhang

PMID: 34614920
DOI: 10.1364/OE.426906

Abstract

For the fast and robust control of the delay lines for coherent pulse stacking, we combined the stochastic parallel gradient descent with momentum (SPGDM) and the soft actor-critic (SAC) into a powerful algorithm, SAC-SPGDM. The simulation shows that the algorithm can find the optimal delay-line positions to ensure the 128 pulses are coherently stacked for 7-stage pulses stacking within 25 steps.