A Reinforcement Learning-Based Scheduler For Minimizing Casualties Of A Military Drone Swarm