AI 资源中心

共 109 项资源

排序方式

下载量收藏数最新

厂商

全部 01.AI Alibaba Alibaba-NLP Andycurrent Anthropic BAAI Bingsu Cohere Comfy-Org DeepSeek E-MIMIC EleutherAI FacebookAI Falconsai FinLang Google Kijai MahmoudAshraf Marqo Meta Midjourney Mistral AI NeoQuasar OpenAI Perplexity AI ProsusAI Qwen RedHatAI ResembleAI Salesforce Stability AI TinyLlama TostAI Xenova amazon apple argmaxinc autogluon cardiffnlp colbert-ir coqui cross-encoder cyankiwi daekeun-ml deepseek-ai depth-anything dima806 distilbert docling-project dphn emilyalsentzer facebook google google-bert google-t5 hexgrad hmellor intfloat jinaai jonatasgrosman k2-fsa laion llava-hf lpiccinelli meta-llama microsoft mistralai mixedbread-ai neuralmind nomic-ai nvidia openai openai-community patrickjohncyh prajjwal1 pyannote rhasspy sentence-transformers speechbrain stabilityai timm trl-internal-testing unsloth usyd-community vikhyatk xAI zai-org 商汤科技字节跳动智谱AI 月之暗面百川智能百度科大讯飞稀宇科技腾讯阶跃星辰

electra-base-discriminator

google

ELECTRA：以判别器而非生成器方式预训练文本编码器

389

gemma-4-31B-it

image-text-to-text

google

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

2838

gemma-4-26B-A4B-it

image-text-to-text

google

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

925

gemma-4-E4B-it

any-to-any

google

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

971

vit-base-patch16-224

image-classification

google

Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224, and fine-tuned on ImageNet 2012 (1 million images, 1,000 classes) at resolution 22

958

gemma-4-E2B-it

any-to-any

google

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

597

mobilebert-uncased

google

MobileBERT：面向资源受限设备的紧凑型任务无关BERT

gemma-3-12b-it

image-text-to-text

google

image-text-to-text

714

vit-base-patch16-224-in21k

image-feature-extraction

google

Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224. It was introduced in the paper An Image is Worth 16x16 Words: Transformers for Ima

406

gemma-3-4b-it

image-text-to-text

google

image-text-to-text

1322

flan-t5-base

google

0. TL;DR 1. Model Details 2. Usage 3. Uses 4. Bias, Risks, and Limitations 5. Training Details 6. Evaluation 7. Environmental Impact 8. Citation 9. Model Card Authors

1073

siglip-so400m-patch14-384

zero-shot-image-classification

google

SigLIP model pre-trained on WebLi at resolution 384x384. It was introduced in the paper Sigmoid Loss for Language Image Pre-Training by Zhai et al. and first released in this repository.

674