Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
TensorFlow and Flax checkpoints are not affected, and can be loaded within PyTorch architectures using the from_tf and from_flax kwargs for the from_pretrained method to circumvent this issue.