Decision-Making with Non-Markovian Rewards: Multi-Agent Learning and Applications to Medical Diagnostics