doberst commited on
Commit
039d5b0
·
verified ·
1 Parent(s): 730fa72

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ inference: false
4
+ base_model: Qwen/Qwen2.5-Coder-7B-Instruct
5
+ base_model_relation: quantized
6
+ tags: [green, llmware-chat, p7, ov]
7
+ ---
8
+
9
+ # qwen2.5-coder-7b-instruct-npu-ov
10
+
11
+ **qwen2.5-coder-7b-instruct-npu-ov** is an OpenVino int4 quantized version of [Qwen2.5-Coder-7B-Instruct](https://www.huggingface.co/Qwen/Qwen2.5-Coder-7B-Instruct), providing a fast inference implementation, optimized for AI PCs using Intel NPU.
12
+
13
+ This is from the latest release series from Qwen, and is an excellent chat/instruct coding assistant model.
14
+
15
+ ### Model Description
16
+
17
+ - **Developed by:** Qwen
18
+ - **Quantized by:** llmware
19
+ - **Model type:** qwen2.5
20
+ - **Parameters:** 7 billion
21
+ - **Model Parent:** Qwen/Qwen2.5-Coder-7B-Instruct
22
+ - **Language(s) (NLP):** English
23
+ - **License:** Apache 2.0
24
+ - **Uses:** Chat, general-purpose LLM
25
+ - **Quantization:** int4
26
+
27
+
28
+ ## Model Card Contact
29
+
30
+ [llmware on github](https://www.github.com/llmware-ai/llmware)
31
+
32
+ [llmware on hf](https://www.huggingface.co/llmware)
33
+
34
+ [llmware website](https://www.llmware.ai)