Commit History

Update README.md
371669a
Running
verified

Luigi commited on

update readme
076c1f2

Luigi commited on

add Qwen2.5-Omni-3B & MiMo-7B-RL
6a4537b

Luigi commited on

disable models unrunnable on hf spaces
7c5f318

Luigi commited on

switch Phi-4-mini-Instruct" from unsloth to microsoft
23b3848

Luigi commited on

extend max tokens
d730ffe

Luigi commited on

adjust layout
4911925

Luigi commited on

add "Granite-4.0-Tiny-Preview" model
09d9700

Luigi commited on

add phi-4 reasonning and phi-4-mini reasonning
68e6569

Luigi commited on

bugfix: set device to xpu by mistake
1242438

Luigi commited on

add all qwen3 variants
2882063

Luigi commited on

user can define search timeout
e2ee907

Luigi commited on

give 5 second for web earch to gather reults
c00d442

Luigi commited on

support thinking models and streamingly display thought
8c3c2b9

Luigi commited on

do not preview prompt at error return from chat response
c09049b

Luigi commited on

inject assistant placeholder at right time
12dd3f3

Luigi commited on

disable L142 which is not needed
3c176a1

Luigi commited on

fix bug in prompt preview display
41ee8bf

Luigi commited on

add prompt preview for debug
5fc0117

Luigi commited on

fix: prevent self-talking issue by using tokenizer chat_template formatting
960db60

Luigi commited on

bugfix to Error: "str" object has no attribute "pad_token_id"
889f080

Luigi commited on

add taiwan elm 1.1b & 270m instruct
c8399e3

Luigi commited on

add type in qwen3 0.6b repo id
76d4d60

Luigi commited on

add qwen3
fe395ab

Luigi commited on

Add Smollm2 360m instruct fine-tuned on TaiwanChat
7308211

Luigi commited on

keep debug message
37f7787

Luigi commited on

add debug to show web resarch result
a2f07a4

Luigi commited on

give 1 second for web search to grab data
9ad3ffd

Luigi commited on

inject web search result if web search enabled
bc257ff

Luigi commited on

refactor(app): improve streaming, background search, dtype fallback, and cleanup :contentReference[oaicite:0]{index=0}
293686e

Luigi commited on

bugfixc: not using pipeline for response generation
939895d

Luigi commited on

Add original SmolLM2 135M Instruct for comparaison
423dc1a

Luigi commited on

Add SmolLM2-135M-Instruct-TaiwanChat
38fcc03

Luigi commited on

Add SmolLM2-135M TaiwanChat
0d642b7

Luigi commited on

Update README.md
34cf84a
verified

Luigi commited on

default to gemma-3-4b
88a6a62

Luigi commited on

model repo_id typo fix
89372fa

Luigi commited on

enable web search by default
6235e63

Luigi commited on

remove tinyllama which has bad response quality
a22cf42

Luigi commited on

make streaming response
5ea073d

Luigi commited on

apply history flatten before it goint to prompt
ef361b0

Luigi commited on

better management on system prompt
5f6306a

Luigi commited on

add accelerate
5ed3cb3

Luigi commited on

usue chat pipeline instead of model and tokenizer individually
ac8e9cc

Luigi commited on

bugfix to padding-related issues
f248fec

Luigi commited on

add attention mask
b6b3940

Luigi commited on

Clean model description
4731160

Luigi commited on

pin torch to 2.4.0
4c6b4c5

Luigi commited on

add sentencepiece tokenzier
4afc958

Luigi commited on

update requirements
51e3e3c

Luigi commited on