Using max_position_embeddings instead of max_sequence_length to standardise with HF 9894fa3 verified lgcharpe commited on Mar 6