Sthenno
sthenno
AI & ML interests
To contact me: [email protected]
Recent Activity
reacted
to
sometimesanotion's
post
with 👍
about 6 hours ago
The capabilities of the new Qwen 3 models are fascinating, and I am watching that space!
My experience, however, is that context management is vastly more important with them. If you use a client with a typical session log with rolling compression, a Qwen 3 model will start to generate the same messages over and over. I don't think that detracts from them. They're optimized for a more advanced MCP environment. I honestly think the 8B is optimal for home use, given proper RAG/CAG.
In typical session chats, Lamarck and Chocolatine are still my daily drives. I worked hard to give Lamarck v0.7 a sprinkling of CoT from both DRT and Deepseek R1. While those models got surpassed on the leaderboards, in practice, I still really enjoy their output.
My projects are focusing on application and context management, because that's where the payoff in improved quality is right now. But should there be a mix of finetunes to make just the right mix of - my recipes are standing by.
new activity
about 6 hours ago
sthenno-com/miscii-14b-0218:使用时需要购买api吗
liked
a model
5 days ago
shuttleai/shuttle-3.5
Organizations
sthenno's activity
使用时需要购买api吗
1
#3 opened 3 days ago
by
Andy2390

Improve language tag
1
#5 opened 11 days ago
by
lbourdois

Fusion vs. SLERP?
1
10
#2 opened 2 months ago
by
sometimesanotion

Adding Evaluation Results
#2 opened 2 months ago
by
sthenno

倾向于拒绝生成xx内容
1
4
#1 opened 2 months ago
by
Cran-May
Adding Evaluation Results
#2 opened 3 months ago
by
sthenno

Adding Evaluation Results
#4 opened 3 months ago
by
sthenno

Adding Evaluation Results
#4 opened 3 months ago
by
sthenno

This merge makes sense
1
4
#1 opened 3 months ago
by
sometimesanotion

Adding Evaluation Results
#1 opened 3 months ago
by
sthenno

Adding Evaluation Results
#1 opened 3 months ago
by
sthenno

Adding Evaluation Results
#2 opened 3 months ago
by
sthenno

Adding Evaluation Results
#1 opened 3 months ago
by
sthenno

Extra SLERP parameters
1
7
#1 opened 4 months ago
by
sometimesanotion

Nuslerp parameters?
1
2
#1 opened 3 months ago
by
sometimesanotion

Quant Request
2
#1 opened 5 months ago
by
mt114514
We sincerely invite your participation, model fine-tuners.
1
3
#3 opened 4 months ago
by
win10

Adding Evaluation Results
2
#1 opened 4 months ago
by
sthenno

Quant Request
1
1
#2 opened 5 months ago
by
Cran-May
Adding Evaluation Results
#1 opened 6 months ago
by
leaderboard-pr-bot
