Batch (Offline) Reinforcement Learning