AI 模型库
nomic-embed-text-v1.5
sentence-similaritynomic-embed-text-v1.5:基于套娃表示学习的可伸缩生产级嵌入模型
adetailer
- coco2017(仅人物) - AniSeg - skytnt/anime-segmentation
nsfw_image_detection
image-classificationModel Card: Fine-Tuned Vision Transformer (ViT) for NSFW Image Classification
colbertv2.0
ColBERT is a _fast_ and _accurate_ retrieval model, enabling scalable BERT-based search over large text collections in tens of milliseconds.
bge-large-en-v1.5
feature-extractionModel List | FAQ | Usage | Evaluation | Train | Contact | Citation | License
Qwen2.5-7B-Instruct
text-generationQwen2.5 is the latest series of Qwen large language models. For Qwen2.5, we release a number of base language models and instruction-tuned language models ranging from 0.5 to 72 billion parameters. Qw
Qwen2.5-1.5B-Instruct
text-generationQwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了一系列基础语言模型和指令微调语言模型,参数量从0.5亿到720亿不等。相较于Qwen2,Qwen2.5带来了以下改进:
Qwen3-8B
text-generationQwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Built upon extensive training, Qwen3 delivers groun
DeepSeek-V3.2
text-generation<!-- markdownlint-disable first-line-h1 --> <!-- markdownlint-disable html --> <!-- markdownlint-disable no-duplicate-header -->
whisperkit-coreml
automatic-speech-recognition--- pretty_name: "WhisperKit" viewer: false library_name: whisperkit tags: - whisper - whisperkit - coreml - asr - quantized - automatic-speech-recognition --- WhisperKit
wespeaker-voxceleb-resnet34-LM
在生产环境中使用这个开源模型? 考虑切换到 pyannoteAI 以获得更优、更快的选择。
Qwen3-4B-Instruct-2507
text-generationWe introduce the updated version of the **Qwen3-4B non-thinking mode**, named **Qwen3-4B-Instruct-2507**, featuring the following key enhancements:
bge-reranker-v2-m3
text-classification更多详情请参考我们的Github:FlagEmbedding。
speaker-diarization-3.1
automatic-speech-recognition自动语音识别
segmentation-3.0
voice-activity-detection语音活动检测
clip-vit-large-patch14-336
zero-shot-image-classification<!-- This model card has been generated automatically according to the information Keras had access to. You should probably proofread and complete it, then remove this comment. -->
Kokoro-82M
text-to-speech**Kokoro** 是一个拥有8200万参数的开源权重文本转语音(TTS)模型。尽管其架构轻量,但能提供与更大模型相媲美的质量,同时速度显著更快、成本效益更高。凭借Apache许可证授权的权重,Kokoro可部署于从生产环境到各类应用场景。
Llama-3.1-8B-Instruct
text-generationtext-generation
gemma-4-31B-it
image-text-to-textHugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind
Qwen2.5-3B-Instruct
text-generationQwen2.5是Qwen大语言模型的最新系列。针对Qwen2.5,我们发布了从0.5到720亿参数规模的多个基础语言模型和指令微调语言模型。相较于Qwen2,Qwen2.5带来了以下改进: