Databricks

Fine-tuning Llama 3.1 with Long Sequences

2024-9-19

We are excited to announce that Mosaic AI Model Training now supports the full context length of 131K tokens when fine-tuning the Meta...