File size: 1,439 Bytes
aff4df8
 
02e4518
aff4df8
 
 
 
 
 
02e4518
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1542638
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
---
title: README
emoji: 💻
colorFrom: gray
colorTo: yellow
sdk: static
pinned: false
---

<div align="center">

<img src="https://github.com/Andrew-Wyn/images/blob/master/sava/italian_adapt-img.jpg?raw=true" width="400" style="border-radius:10%"/>

<br>

This is the official space for the paper: "Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation"

[![Conference](https://img.shields.io/badge/NAACL-2025-4b44ce)](https://2025.naacl.org/)
[![arXiv](https://img.shields.io/badge/arXiv-paper-b31b1b.svg)](https://arxiv.org/abs/2504.17025v1)
[![License: CC BY-NC-SA 4.0](https://img.shields.io/badge/License-CC%20BY--NC--SA%204.0-lightgrey.svg)](https://creativecommons.org/licenses/by-nc-sa/4.0/)
[![Hugging Face Collection](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Collection_Mistral-FCD21D)](https://huggingface.co/collections/SemanticAlignment/mistral-7b-v01-adapted-679243206cec8a21f75435dd)
[![Hugging Face Collection](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Collection_Llama-FCD21D)](https://huggingface.co/collections/SemanticAlignment/llama-31-adapted-67924314d8957c78a3e7bcaf)
</div>

This space is shared between three Italian institutions that led the work:

- Sapienza university of Rome
- Istituto di Scienza e Tecnologie dell'Informazione "A. Faedo" -- CNR Pisa
- Istituto di Linguistica Computazionale "A. Zampolli" -- CNR Pisa