Mistral - Large Scale Language Modeling Made Easy

_images/mistral_components.png

Mistral combines Hugging Face 🤗, DeepSpeed, and Weights & Biases , with additional tools, helpful scripts, and documentation to facilitate:

  • training large models with multiple GPU’s and nodes

  • incorporating new pre-training datasets

  • dataset preprocessing

  • monitoring and logging of model training

  • performing evaluation and measuring bias