AI 模型库

0. TL;DR 1. Model Details 2. Usage 3. Uses 4. Bias, Risks, and Limitations 5. Training Details 6. Evaluation 7. Environmental Impact 8. Citation 9. Model Card Authors

2,082,419 1073 transformers

siglip-so400m-patch14-384

zero-shot-image-classification

google · google/siglip-so400m-patch14-384

SigLIP model pre-trained on WebLi at resolution 384x384. It was introduced in the paper Sigmoid Loss for Language Image Pre-Training by Zhai et al. and first released in this repository.

2,080,535 674 transformers

Qwen3-Embedding-4B

feature-extraction

Qwen · Qwen/Qwen3-Embedding-4B

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen

2,076,631 262 sentence-transformers

stable-diffusion-xl-base-1.0

text-to-image

stabilityai · stabilityai/stable-diffusion-xl-base-1.0

SDXL包含一个用于潜在扩散的专家集成管道：首先，基础模型用于生成（带噪声的）潜在表示，随后通过一个专门用于优化的精炼模型（下载地址：https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0/）进一步处理这些潜在表示。

2,075,935 7702 diffusers

Qwen2.5-14B-Instruct-AWQ

text-generation

Qwen · Qwen/Qwen2.5-14B-Instruct-AWQ

Qwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5，我们发布了一系列基础语言模型和指令微调语言模型，参数量从0.5亿到720亿不等。相较于Qwen2，Qwen2.5带来了以下改进：

2,055,940 35 transformers

chronos-bolt-tiny

time-series-forecasting

autogluon · autogluon/chronos-bolt-tiny

🚀 **Update Feb 14, 2025**: Chronos-Bolt models are now available on Amazon SageMaker JumpStart! Check out the tutorial notebook to learn how to deploy Chronos endpoints for production use in a few lin

2,045,801 13

gpt2-large

text-generation

openai-community · openai-community/gpt2-large

目录 - 模型详情 - 模型入门指南 - 用途 - 风险、局限性与偏见 - 训练 - 评估 - 环境影响 - 技术规格 - 引用信息 - 模型卡片作者

2,042,727 349 transformers

inclusively-reformulation-it5

E-MIMIC · E-MIMIC/inclusively-reformulation-it5

该模型是一个意大利语序列到序列模型，基于IT5-large针对包容性语言改写任务进行了微调。

2,036,303 2 transformers

vitpose-plus-base

keypoint-detection

usyd-community · usyd-community/vitpose-plus-base

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation and ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation. It obtains 81.1 AP on MS COCO Keypoint test-d

2,032,582 31 transformers

Qwen3-ASR-1.7B

automatic-speech-recognition

Qwen · Qwen/Qwen3-ASR-1.7B

The Qwen3-ASR family includes Qwen3-ASR-1.7B and Qwen3-ASR-0.6B, which support language identification and ASR for 52 languages and dialects. Both leverage large-scale speech training data and the str

2,021,550 793

Depth-Anything-V2-Small-hf

depth-estimation

depth-anything · depth-anything/Depth-Anything-V2-Small-hf

Depth Anything V2 基于 59.5 万张合成标注图像和 6200 万张以上真实未标注图像训练而成，提供了能力最强的单目深度估计（MDE）模型，具有以下特点： - 比 Depth Anything V1 更精细的细节 - 比 Depth Anything V1 及基于 SD 的模型更鲁棒

2,015,953 31 transformers

pythia-70m-deduped

text-generation

EleutherAI · EleutherAI/pythia-70m-deduped

*Pythia Scaling Suite* 是一组为促进可解释性研究而开发的模型集合（详见论文）。该套件包含两组共八个模型，参数量分别为70M、160M、410M、1B、1.4B、2.8B、6.9B和12B。每个参数量对应两个模型：一个基于Pile数据集训练，另一个基于P

1,963,084 28 transformers

Qwen3.5-2B

image-text-to-text

Qwen · Qwen/Qwen3.5-2B

> [!Note] > This repository contains model weights and configuration files for the post-trained model in the Hugging Face Transformers format. > > These artifacts are compatible with Hugging Face Tra

1,941,060 269 transformers

e5-base-v2

sentence-similarity

intfloat · intfloat/e5-base-v2

弱监督对比预训练的文本嵌入。梁旺、杨楠、黄晓龙、焦斌星、杨林军、江大新、Rangan Majumder、韦福如，arXiv 2022

1,940,618 155 sentence-transformers

DeepSeek-V4-Pro

text-generation

deepseek-ai · deepseek-ai/DeepSeek-V4-Pro

DeepSeek-V4：迈向高效百万级Token上下文智能

1,339,144 3838 transformers

1 ... 7 8 9 10 11 12 13