Learning Humanoid Robot Running Skills through Proximal Policy Optimization | IEEE Conference Publication | IEEE Xplore