Using max_position_embeddings instead of max_sequence_length to standardise with HF 6595bef verified lgcharpe commited on Mar 6