AI 模型库

共 个模型
下载量 收藏数 最新 text-generation ×

Qwen3-0.6B

text-generation
Qwen · Qwen/Qwen3-0.6B

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun

18,271,115 1454

gpt2

text-generation
openai-community · openai-community/gpt2

在此处测试整体生成能力:https://transformer.huggingface.co/doc/gpt2-large

15,977,391 3414

Qwen2.5-7B-Instruct

text-generation
Qwen · Qwen/Qwen2.5-7B-Instruct

Qwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qw

14,018,215 1443

Qwen2.5-1.5B-Instruct

text-generation
Qwen · Qwen/Qwen2.5-1.5B-Instruct

Qwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等。相较于Qwen2,Qwen2.5带来了以下改进:

12,078,264 900

Qwen3-8B

text-generation
Qwen · Qwen/Qwen3-8B

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun

11,444,026 1266

DeepSeek-V3.2

text-generation
deepseek-ai · deepseek-ai/DeepSeek-V3.2

<!-- markdownlint-disable first-line-h1 --> <!-- markdownlint-disable html --> <!-- markdownlint-disable no-duplicate-header -->

11,120,947 1621

Qwen3-4B-Instruct-2507

text-generation
Qwen · Qwen/Qwen3-4B-Instruct-2507

We introduce the updated version of the **Qwen3-4B non-thinking mode**, named **Qwen3-4B-Instruct-2507**, featuring the following key enhancements:

10,724,588 1052

Llama-3.1-8B-Instruct

text-generation
meta-llama · meta-llama/Llama-3.1-8B-Instruct

text-generation

9,374,277 5995

Qwen2.5-3B-Instruct

text-generation
Qwen · Qwen/Qwen2.5-3B-Instruct

Qwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了从0.5到720亿参数规模的多个基础语言模型和指令微调语言模型。相较于Qwen2,Qwen2.5带来了以下改进:

8,755,192 657

opt-125m

text-generation
facebook · facebook/opt-125m

OPT 首次在《开放预训练Transformer语言模型》中被提出,并于2022年5月3日由Meta AI在metaseq的代码库中首次发布。

8,598,631 437

gpt-oss-20b

text-generation
openai · openai/gpt-oss-20b

Try gpt-oss · Guides · Model card · OpenAI blog

7,217,171 4596

Llama-3.2-1B-Instruct

text-generation
meta-llama · meta-llama/Llama-3.2-1B-Instruct

text-generation

6,896,720 1399

tiny-Qwen2ForCausalLM-2.5

text-generation
trl-internal-testing · trl-internal-testing/tiny-Qwen2ForCausalLM-2.5

这是一个为TRL库中的单元测试构建的最小模型。

6,506,866 6

Qwen3-32B

text-generation
Qwen · Qwen/Qwen3-32B

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun

5,968,332 692

Qwen2.5-0.5B-Instruct

text-generation
Qwen · Qwen/Qwen2.5-0.5B-Instruct

Qwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等。与Qwen2相比,Qwen2.5带来了以下改进:

5,858,017 510

dolphin-2.9.1-yi-1.5-34b

text-generation
dphn · dphn/dolphin-2.9.1-yi-1.5-34b

由Eric Hartford、Lucas Atkins、Fernando Fernandes及Cognitive Computations共同策划并训练

4,703,403 63

Qwen2-1.5B-Instruct

text-generation
Qwen · Qwen/Qwen2-1.5B-Instruct

Qwen2是Qwen大语言模型系列的新版本。针对Qwen2,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等,其中包括一个混合专家模型。本仓库包含指令微调后的15亿参数Qwen2模型。

4,474,021 162

gpt-oss-120b

text-generation
openai · openai/gpt-oss-120b

Try gpt-oss · Guides · Model card · OpenAI blog

4,387,264 4768

DeepSeek-R1

text-generation
deepseek-ai · deepseek-ai/DeepSeek-R1

DeepSeek-R1 <!-- markdownlint-disable first-line-h1 --> <!-- markdownlint-disable html --> <!-- markdownlint-disable no-duplicate-header -->

3,681,237 13326

Qwen3-4B

text-generation
Qwen · Qwen/Qwen3-4B

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun

3,491,372 610