AI 模型库

共 个模型

fairface_age_image_detection

image-classification
dima806 · dima806/fairface_age_image_detection

Detects age group with about 59% accuracy based on an image.

6,233,949 73

Qwen3.5-4B

image-text-to-text
Qwen · Qwen/Qwen3.5-4B

> [!Note] > This repository contains model weights and configuration files for the post-trained model in the Hugging Face Transformers format. > > These artifacts are compatible with Hugging Face Tra

6,181,955 525

DFN5B-CLIP-ViT-H-14-378

apple · apple/DFN5B-CLIP-ViT-H-14-378

一个在DFN-5B上训练的CLIP(对比语言-图像预训练)模型。 数据过滤网络(DFN)是用于自动过滤大规模未整理数据池的小型网络。 该模型基于从430亿未整理图文对(128亿图像)中筛选出的50亿张图像进行训练。

6,071,841 109

paraphrase-multilingual-mpnet-base-v2

sentence-similarity
sentence-transformers · sentence-transformers/paraphrase-multilingual-mpnet-base-v2

sentence-transformers/paraphrase-multilingual-mpnet-base-v2

6,055,304 460

Qwen3-32B

text-generation
Qwen · Qwen/Qwen3-32B

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun

5,968,332 692

chronos-2

time-series-forecasting
autogluon · autogluon/chronos-2

Chronos-2 **Chronos-2** is a 120M-parameter, encoder-only time series foundation model for zero-shot forecasting. It supports **univariate**, **multivariate**, and **covariate-informed** tasks within

5,898,361 14

Wan_2.2_ComfyUI_Repackaged

Comfy-Org · Comfy-Org/Wan_2.2_ComfyUI_Repackaged

示例:https://comfyanonymous.github.io/ComfyUI_examples/wan22/

5,887,459 705

Qwen2.5-0.5B-Instruct

text-generation
Qwen · Qwen/Qwen2.5-0.5B-Instruct

Qwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等。与Qwen2相比,Qwen2.5带来了以下改进:

5,858,017 510

Qwen3-Embedding-0.6B

feature-extraction
Qwen · Qwen/Qwen3-Embedding-0.6B

The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen

5,778,172 1008

WanVideo_comfy

Kijai · Kijai/WanVideo_comfy

WanVideo的合并与量化模型,源自此处:

5,761,123 2304

gemma-4-E4B-it

any-to-any
google · google/gemma-4-E4B-it

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

5,585,425 971

Qwen3-VL-8B-Instruct

image-text-to-text
Qwen · Qwen/Qwen3-VL-8B-Instruct

Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.

5,445,377 898

whisper-large-v3

automatic-speech-recognition
openai · openai/whisper-large-v3

Whisper is a state-of-the-art model for automatic speech recognition (ASR) and speech translation, proposed in the paper Robust Speech Recognition via Large-Scale Weak Supervision by Alec Radford et

4,998,671 5669

segmentation

voice-activity-detection
pyannote · pyannote/segmentation

语音活动检测

4,834,943 676

vit-base-patch16-224

image-classification
google · google/vit-base-patch16-224

Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224, and fine-tuned on ImageNet 2012 (1 million images, 1,000 classes) at resolution 22

4,780,326 958

dolphin-2.9.1-yi-1.5-34b

text-generation
dphn · dphn/dolphin-2.9.1-yi-1.5-34b

由Eric Hartford、Lucas Atkins、Fernando Fernandes及Cognitive Computations共同策划并训练

4,703,403 63

bert-base-multilingual-cased

fill-mask
google-bert · google-bert/bert-base-multilingual-cased

基于掩码语言建模(MLM)目标,在维基百科规模最大的前104种语言上预训练的模型。该模型首次发表于此论文,并在此仓库中首次发布。此模型区分大小写:例如,它能够识别"english"与"English"之间的差异。

4,498,947 587

Qwen2-1.5B-Instruct

text-generation
Qwen · Qwen/Qwen2-1.5B-Instruct

Qwen2是Qwen大语言模型系列的新版本。针对Qwen2,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等,其中包括一个混合专家模型。本仓库包含指令微调后的15亿参数Qwen2模型。

4,474,021 162

mxbai-embed-large-v1

feature-extraction
mixedbread-ai · mixedbread-ai/mxbai-embed-large-v1

The crispy sentence embedding family from Mixedbread.

4,425,778 795

gpt-oss-120b

text-generation
openai · openai/gpt-oss-120b

Try gpt-oss · Guides · Model card · OpenAI blog

4,387,264 4768