Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
15
Nyaribari Reuben
foscraft
Follow
ims-ke's profile picture
Iammcqwory's profile picture
2 followers
ยท
43 following
foscraft
foscraft
AI & ML interests
LLMs, VLMs , Vision
Recent Activity
liked
a model
about 1 month ago
mistralai/Mistral-7B-Instruct-v0.3
liked
a model
about 1 month ago
mistralai/Mistral-7B-v0.1
reacted
to
DavidGF
's
post
with ๐ฅ
6 months ago
๐ Celebrating One Year of #SauerkrautLM with Two Groundbreaking Releases! We're thrilled to announce the release of SauerkrautLM-v2-14b in two specialized versions: https://huggingface.co/VAGOsolutions/SauerkrautLM-v2-14b-SFT and https://huggingface.co/VAGOsolutions/SauerkrautLM-v2-14b-DPO. Built on the robust Qwen2.5-14B foundation, these models represent a significant leap forward in multilingual AI capabilities. ๐ฌ Technical Breakthroughs: ๐ Innovative three-phase Fine-Tuning approach ๐ Two-step Spectrum SFT + one-step Spectrum DPO optimization phase for enhanced performance ๐ Balance of German and English language capabilities ๐ Advanced function calling - almost on par with Claude-3.5-Sonnet-20240620 ๐ฉ๐ช German Language Excellence: What sets this release apart is our unique achievement in simultaneously improving both German and English capabilities. Through our specialized training approach with over 1.2B tokens across two phases, we've managed to: ๐ Enhance German language understanding and generation (SFT Version > DPO Version) ๐ Maintain authentic German linguistic nuances ๐ Improve cross-lingual capabilities ๐ Preserve cultural context awareness ๐ Training Innovation: Our three-phase approach targeted specific layer percentages (15%, 20% and 25%) with carefully curated datasets, including: ๐ Mathematics-focused content (proprietary classifier-selected) ๐ High-quality German training data ๐ Specialized function calling datasets ๐ Premium multilingual content ๐ Community Contribution: We're also releasing two new datasets in a few days: 1๏ธโฃ SauerkrautLM-Fermented-GER-DPO: 3,300 high-quality German training samples 2๏ธโฃ SauerkrautLM-Fermented-Irrelevance-GER-DPO: 2,000 specialized samples for optimized function call irrelevance handling Thank you to our incredible community and partners who have supported us throughout this journey. Here's to another year of AI innovation!ย ๐
View all activity
Organizations
foscraft
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 models
about 1 month ago
mistralai/Mistral-7B-Instruct-v0.3
Text Generation
โข
Updated
Aug 21, 2024
โข
783k
โข
โข
1.68k
mistralai/Mistral-7B-v0.1
Text Generation
โข
Updated
Jul 24, 2024
โข
546k
โข
3.78k
liked
a model
8 months ago
openai-community/gpt2
Text Generation
โข
Updated
Feb 19, 2024
โข
11.9M
โข
2.72k
liked
4 models
9 months ago
tiiuae/falcon-rw-1b
Text Generation
โข
Updated
Jul 12, 2023
โข
84.7k
โข
108
TheBloke/wizard-vicuna-13B-GGML
Updated
Jun 9, 2023
โข
143
TheBloke/orca_mini_3B-GGML
Updated
Jun 25, 2023
โข
60
TheBloke/Llama-2-7B-Chat-GGUF
Text Generation
โข
Updated
Oct 14, 2023
โข
92.2k
โข
469
liked
a model
10 months ago
AdamCodd/donut-receipts-extract
Image-to-Text
โข
Updated
Jan 11
โข
20
โข
34
liked
a Space
10 months ago
Runtime error
4
4
Donut Base Finetuned Kuzushiji
๐ก
liked
a dataset
10 months ago
naver-clova-ix/cord-v1
Viewer
โข
Updated
Jul 14, 2022
โข
1k
โข
365
โข
11
liked
2 models
10 months ago
naver-clova-ix/donut-base
Image-to-Text
โข
Updated
Aug 13, 2022
โข
59.7k
โข
209
microsoft/DialoGPT-medium
Text Generation
โข
Updated
Feb 29, 2024
โข
211k
โข
370
liked
a dataset
about 1 year ago
HuggingFaceFW/fineweb
Viewer
โข
Updated
Jan 31
โข
25B
โข
854k
โข
2.14k
liked
a model
over 1 year ago
spacy/en_core_web_trf
Token Classification
โข
Updated
Jun 13, 2024
โข
195
โข
47
liked
a model
about 2 years ago
databricks/dolly-v2-12b
Text Generation
โข
Updated
Jun 30, 2023
โข
2.3k
โข
1.95k