Memory for Prediction: A Transformer-based Integrative Account of Sentence Processing