UniRef50 is built by clustering UniRef90 seed sequences that have at least 50% sequence identity to, and 80% overlap with, the longest sequence in the
khairi abidi
khairi
AI & ML interests
Language Modeling, Protein Language Modeling, Protein Annotation
Recent Activity
updated
a model
about 5 hours ago
khairi/SmolLM2-135M-Instruct
updated
a dataset
3 days ago
khairi/swissprot-instructions
published
a dataset
3 days ago
khairi/swissprot-instructions
Organizations
Collections
3
models
35

khairi/SmolLM2-135M-Instruct
Text Generation
•
Updated
•
8

khairi/Kaggle-Model-Instruct
Updated

khairi/Llama-3.2-1B-Instruct
Updated

khairi/GraphFLAN-Small
Text2Text Generation
•
Updated
•
12

khairi/Llama-3.2-1B-Instruct-ProtLang
Updated

khairi/SmolLM2-360M
Updated

khairi/SmolLM2-135M
Updated

khairi/Gemma3-4B
Updated

khairi/Llama-3.2-1B
Updated

khairi/protein-t5
Text2Text Generation
•
Updated
•
12
datasets
34
khairi/swissprot-instructions
Viewer
•
Updated
•
1.26M
•
47
khairi/flan-link-prediction
Viewer
•
Updated
•
46.3k
•
30
khairi/protein-function-annotation
Viewer
•
Updated
•
400k
•
33
khairi/ptlm-dataset
Viewer
•
Updated
•
606k
•
13
khairi/ptlm-tiny-dataset
Viewer
•
Updated
•
20.5k
•
29
khairi/uniref50-10
Viewer
•
Updated
•
3M
•
34
khairi/uniref50-9
Viewer
•
Updated
•
2.68M
•
18
khairi/uniref50-8
Viewer
•
Updated
•
3.4M
•
18
khairi/uniref50-7
Viewer
•
Updated
•
3.86M
•
23
khairi/uniref50-6
Viewer
•
Updated
•
14.3M
•
17