AI 模型库
all-MiniLM-L6-v2
sentence-similarityall-MiniLM-L6-v2 这是一个句子变换器模型:它将句子和段落映射到384维的稠密向量空间,可用于聚类或语义搜索等任务。
Qwen3-VL-2B-Instruct
image-text-to-textMeet Qwen3-VL — 迄今为止通义千问系列中最强大的视觉语言模型。
bert-base-uncased
fill-mask基于掩码语言建模(MLM)目标在英语上预训练的模型。该模型首次发表于此论文,并首次在此仓库中发布。此模型不区分大小写:即对"english"和"English"不作区分。
electra-base-discriminator
ELECTRA:以判别器而非生成器方式预训练文本编码器
paraphrase-multilingual-MiniLM-L12-v2
sentence-similaritysentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
ms-marco-MiniLM-L6-v2
text-ranking该模型在MS Marco段落排序任务上进行了训练。
bge-small-en-v1.5
feature-extractionModel List | FAQ | Usage | Evaluation | Train | Contact | Citation | License
all-mpnet-base-v2
sentence-similarityall-mpnet-base-v2 这是一个句子变换器模型:它将句子和段落映射到768维的稠密向量空间,可用于聚类或语义搜索等任务。
clip-vit-large-patch14
zero-shot-image-classificationDisclaimer: The model card is taken and modified from the official CLIP repository, it can be found here.
bge-m3
sentence-similarity更多详情请参考我们的GitHub仓库:https://github.com/FlagOpen/FlagEmbedding
clip-vit-base-patch32
zero-shot-image-classificationDisclaimer: The model card is taken and modified from the official CLIP repository, it can be found here.
clap-htsat-fused
audio-classificationModel card for CLAP: Contrastive Language-Audio Pretraining
roberta-large
fill-mask--- language: en tags: - exbert license: mit datasets: - bookcorpus - wikipedia --- RoBERTa 大型模型 基于掩码语言建模(MLM)目标在英语上预训练的模型。该模型首次在 此论文中提出,并在此仓库中首次发布。此模型区分大小写。
xlm-roberta-base
fill-maskXLM-RoBERTa模型在包含100种语言的2.5TB过滤CommonCrawl数据上进行了预训练。该模型由Conneau等人在论文《大规模无监督跨语言表示学习》中提出,并首次在此仓库中发布。
mobilenetv3_small_100.lamb_in1k
image-classificationA MobileNet-v3 image classification model. Trained on ImageNet-1k in `timm` using recipe template described below.
Qwen3-0.6B
text-generationQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun
roberta-base
fill-mask基于遮蔽语言建模(MLM)目标在英语上预训练的模型。该模型首次发表于 此论文,并首次发布于 此仓库。此模型区分大小写:它 能识别english与English之间的差异。
chronos-2
time-series-forecasting**Update Dec 30, 2025:** ☁️ Deploy Chronos-2 on Amazon SageMaker. New guide covers real-time GPU and CPU inference, serverless endpoints (run on demand, no idle costs), and batch transform for large-s
distilbert-base-uncased
fill-mask该模型是BERT基础模型的蒸馏版本。该模型首次发表于此论文中。蒸馏过程的代码可在此处获取。该模型不区分大小写:它不会区分"english"和"English"。
gpt2
text-generation在此处测试整体生成能力:https://transformer.huggingface.co/doc/gpt2-large