Notations in reinforcement learning