--- title: README emoji: 💻 colorFrom: gray colorTo: yellow sdk: static pinned: false ---

This is the official space for the paper: "Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation" [![Conference](https://img.shields.io/badge/NAACL-2025-4b44ce)](https://2025.naacl.org/) [![arXiv](https://img.shields.io/badge/arXiv-paper-b31b1b.svg)](https://arxiv.org/abs/2504.17025v1) [![License: CC BY-NC-SA 4.0](https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg)](https://creativecommons.org/licenses/by-nc-sa/4.0/) [![Hugging Face Collection](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Collection_Mistral-FCD21D)](https://huggingface.co/collections/SemanticAlignment/mistral-7b-v01-adapted-679243206cec8a21f75435dd) [![Hugging Face Collection](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Collection_Llama-FCD21D)](https://huggingface.co/collections/SemanticAlignment/llama-31-adapted-67924314d8957c78a3e7bcaf)
This space is shared between three Italian institutions that led the work: - [SapienzaNLP](https://nlp.uniroma1.it/) -- Sapienza university of Rome - [Istituto di Scienza e Tecnologie dell'Informazione "A. Faedo"](https://www.isti.cnr.it/it/) -- CNR Pisa - [Istituto di Linguistica Computazionale "A. Zampolli"](https://www.ilc.cnr.it/) -- CNR Pisa