大模型部署¶
LLM-TPU 项目简介¶
LLM-TPU 包含了各类开源生成式 AI 模型的移植部署例程,其中以 LLM 为主,也包含 Stable Diffusion(AI 绘画)。
项目仓库链接:LLM-TPU。
例程清单¶
| Model | INT4 | INT8 | FP16/BF16 | Huggingface Link |
|---|---|---|---|---|
| Baichuan2-7B | ✔ | LINK | ||
| ChatGLM3-6B | ✔ | ✔ | ✔ | LINK |
| CodeFuse-7B | ✔ | ✔ | LINK | |
| DeepSeek-6.7B | ✔ | ✔ | LINK | |
| Falcon-40B | ✔ | ✔ | LINK | |
| Phi-3-mini-4k | ✔ | ✔ | ✔ | LINK |
| Qwen-7B | ✔ | ✔ | ✔ | LINK |
| Qwen-14B | ✔ | ✔ | ✔ | LINK |
| Qwen-72B | ✔ | LINK | ||
| Qwen1.5-0.5B | ✔ | ✔ | ✔ | LINK |
| Qwen1.5-1.8B | ✔ | ✔ | ✔ | LINK |
| Llama2-7B | ✔ | ✔ | ✔ | LINK |
| Llama2-13B | ✔ | ✔ | ✔ | LINK |
| LWM-Text-Chat | ✔ | ✔ | ✔ | LINK |
| Mistral-7B-Instruct | ✔ | ✔ | LINK | |
| Stable Diffusion | ✔ | LINK | ||
| Stable Diffusion XL | ✔ | LINK | ||
| WizardCoder-15B | ✔ | LINK | ||
| Yi-6B-chat | ✔ | ✔ | LINK | |
| Yi-34B-chat | ✔ | ✔ | LINK |