AI 模型库

共 个模型

spkrec-ecapa-voxceleb

speechbrain · speechbrain/spkrec-ecapa-voxceleb

Speaker Verification with ECAPA-TDNN embeddings on Voxceleb

2,447,877 325

Llama-3.2-3B-Instruct

text-generation
meta-llama · meta-llama/Llama-3.2-3B-Instruct

text-generation

2,441,920 2124

TRELLIS-image-large

image-to-3d
microsoft · microsoft/TRELLIS-image-large

<!-- Provide a quick summary of what the model is/does. -->

2,435,939 641

esmfold_v1

facebook · facebook/esmfold_v1

ESMFold是一种基于ESM-2骨干网络的最先进的端到端蛋白质折叠模型。它不需要任何查找或多序列比对步骤,因此无需依赖任何外部数据库即可进行预测。这使得其推理速度显著快于AlphaFold。

2,425,269 50

blip-image-captioning-base

image-to-text
Salesforce · Salesforce/blip-image-captioning-base

BLIP:面向统一视觉-语言理解与生成的语言-图像预训练引导方法

2,399,250 852

all-distilroberta-v1

sentence-similarity
sentence-transformers · sentence-transformers/all-distilroberta-v1

all-distilroberta-v1 这是一个句子变换器模型:它将句子和段落映射到768维的稠密向量空间,可用于聚类或语义搜索等任务。

2,392,130 42

chronos-2-small

time-series-forecasting
autogluon · autogluon/chronos-2-small

This is the _small_ variant of the Chronos-2 model with 28M parameters. For usage and details on the Chronos-2 model, please refer to https://huggingface.co/autogluon/chronos-2.

2,339,445 4

Qwen2.5-Coder-7B-Instruct

text-generation
Qwen · Qwen/Qwen2.5-Coder-7B-Instruct

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 bil

2,328,991 705

Qwen3-0.6B-FP8

text-generation
Qwen · Qwen/Qwen3-0.6B-FP8

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun

2,318,955 59

whisper-small

automatic-speech-recognition
openai · openai/whisper-small

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data, Whisper models demonstrate a strong ability to generalise to many

2,293,475 556

Qwen3.6-27B

image-text-to-text
Qwen · Qwen/Qwen3.6-27B

> [!Note] > This repository contains model weights and configuration files for the post-trained model in the Hugging Face Transformers format. > > These artifacts are compatible with Hugging Face Tra

2,273,063 1231

Qwen3-VL-Embedding-2B

sentence-similarity
Qwen · Qwen/Qwen3-VL-Embedding-2B

The **Qwen3-VL-Embedding** and **Qwen3-VL-Reranker** model series are the latest additions to the Qwen family, built upon the recently open-sourced and powerful Qwen3-VL foundation model. Specifically

2,267,689 398

Gemma-4-31B-IT-NVFP4

text-generation
nvidia · nvidia/Gemma-4-31B-IT-NVFP4

描述: Gemma 4 31B IT 是由 Google DeepMind 构建的开放多模态模型,支持文本和图像输入,能够将视频作为帧序列进行处理,并生成文本输出。该模型旨在为推理、智能体工作流、编程和多模态理解提供前沿性能。

2,262,752 470

Bio_ClinicalBERT

fill-mask
emilyalsentzer · emilyalsentzer/Bio_ClinicalBERT

《公开可用的临床BERT嵌入》论文包含四种独特的临床BERT模型:基于BERT-Base(`cased_L-12_H-768_A-12`)或BioBERT(`BioBERT-Base v1.0 + PubMed 200K + PMC 270K`)初始化,并在所有MIMIC笔记或仅出院小结上进行训练。

2,251,720 428

chatterbox

text-to-speech
ResembleAI · ResembleAI/chatterbox

**09/04 🔥 Introducing Chatterbox Multilingual in 23 Languages!**

2,233,576 1578

Qwen2.5-0.5B

text-generation
Qwen · Qwen/Qwen2.5-0.5B

Qwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了一系列基础语言模型和指令微调语言模型,参数规模从0.5亿到720亿不等。相较于Qwen2,Qwen2.5带来了以下改进:

2,227,722 401

gte-multilingual-base

sentence-similarity
Alibaba-NLP · Alibaba-NLP/gte-multilingual-base

**gte-multilingual-base** 模型是 GTE(通用文本嵌入)模型系列中的最新成员,具备以下关键特性:

2,221,782 359

vit-base-patch16-224-in21k

image-feature-extraction
google · google/vit-base-patch16-224-in21k

Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224. It was introduced in the paper An Image is Worth 16x16 Words: Transformers for Ima

2,218,067 406

OmniVoice

text-to-speech
k2-fsa · k2-fsa/OmniVoice

&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;

2,212,436 840

nsfw-image-detection-384

image-classification
Marqo · Marqo/nsfw-image-detection-384

__NOTE: Like all models, this one can make mistakes. NSFW content can be subjective and contextual, this model is intended to help identify this content, use at your own risk.__

2,198,756 52