Welcome to Journal of University of Chinese Academy of Sciences,Today is

Journal of University of Chinese Academy of Sciences ›› 2022, Vol. 39 ›› Issue (4): 543-550.DOI: 10.7523/j.ucas.2020.0045

• Research Articles • Previous Articles     Next Articles

Downlink power allocation scheme for LEO satellites based on deep reinforcement learning

ZHANG Huaming1,2, LI Qiang1   

  1. 1. Shanghai Institute of Microsystem & Information Technology, Chinese Academy of Sciences, Shanghai 201800, China;
    2. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2020-06-22 Revised:2020-09-02 Online:2022-07-15

Abstract: Most of the current satellite resource allocation schemes are designed for geosynchronous orbit satellites. In view of the highly dynamic characteristics and limitation of frequency and power resources in LEO satellites, a power allocation algorithm based on deep reinforcement learning is proposed. First of all, we model the LEO satellite power allocation scenario, and introduce a time slot division scheme to simplify the dynamic characteristics model of the LEO satellite. Then a power allocation policy is proposed based on deep reinforcement learning algorithm which can reduce the co-channel interference by adjusting the power value of the subcarriers in each beam of a single LEO satellite, thus improving the spectral efficiency of the LEO satellite. Simulation results illustrate that the proposed algorithm can converge and reach a stable state in a relatively short time. Under the condition of constant total power, this scheme can effectively improve the throughput of a single LEO satellite. The spectral efficiency based on deep reinforcement learning algorithm is significantly higher than that of water-filling algorithm and Q-learning algorithm.

Key words: LEO satellite, spectrum efficiency, power allocation, deep reinforcement learning

CLC Number: