AI 模型库
Qwen3-0.6B
text-generationQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun
gpt2
text-generation在此处测试整体生成能力:https://transformer.huggingface.co/doc/gpt2-large
Qwen2.5-7B-Instruct
text-generationQwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qw
Qwen2.5-1.5B-Instruct
text-generationQwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等。相较于Qwen2,Qwen2.5带来了以下改进:
Qwen3-8B
text-generationQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun
DeepSeek-V3.2
text-generation<!-- markdownlint-disable first-line-h1 --> <!-- markdownlint-disable html --> <!-- markdownlint-disable no-duplicate-header -->
Qwen3-4B-Instruct-2507
text-generationWe introduce the updated version of the **Qwen3-4B non-thinking mode**, named **Qwen3-4B-Instruct-2507**, featuring the following key enhancements:
Llama-3.1-8B-Instruct
text-generationtext-generation
Qwen2.5-3B-Instruct
text-generationQwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了从0.5到720亿参数规模的多个基础语言模型和指令微调语言模型。相较于Qwen2,Qwen2.5带来了以下改进:
opt-125m
text-generationOPT 首次在《开放预训练Transformer语言模型》中被提出,并于2022年5月3日由Meta AI在metaseq的代码库中首次发布。
gpt-oss-20b
text-generationTry gpt-oss · Guides · Model card · OpenAI blog
Llama-3.2-1B-Instruct
text-generationtext-generation
tiny-Qwen2ForCausalLM-2.5
text-generation这是一个为TRL库中的单元测试构建的最小模型。
Qwen3-32B
text-generationQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun
Qwen2.5-0.5B-Instruct
text-generationQwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等。与Qwen2相比,Qwen2.5带来了以下改进:
dolphin-2.9.1-yi-1.5-34b
text-generation由Eric Hartford、Lucas Atkins、Fernando Fernandes及Cognitive Computations共同策划并训练
Qwen2-1.5B-Instruct
text-generationQwen2是Qwen大语言模型系列的新版本。针对Qwen2,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等,其中包括一个混合专家模型。本仓库包含指令微调后的15亿参数Qwen2模型。
gpt-oss-120b
text-generationTry gpt-oss · Guides · Model card · OpenAI blog
DeepSeek-R1
text-generationDeepSeek-R1 <!-- markdownlint-disable first-line-h1 --> <!-- markdownlint-disable html --> <!-- markdownlint-disable no-duplicate-header -->
Qwen3-4B
text-generationQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun