AI 资源中心

共 109 项资源

electra-base-discriminator

google

ELECTRA:以判别器而非生成器方式预训练文本编码器

389

gemma-4-31B-it

image-text-to-text
google

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

2838

gemma-4-26B-A4B-it

image-text-to-text
google

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

925

gemma-4-E4B-it

any-to-any
google

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

971

vit-base-patch16-224

image-classification
google

Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224, and fine-tuned on ImageNet 2012 (1 million images, 1,000 classes) at resolution 22

958

gemma-4-E2B-it

any-to-any
google

Hugging Face | GitHub | Launch Blog | Documentation License: Apache 2.0 | Authors: Google DeepMind

597

mobilebert-uncased

google

MobileBERT:面向资源受限设备的紧凑型任务无关BERT

71

gemma-3-12b-it

image-text-to-text
google

image-text-to-text

714

vit-base-patch16-224-in21k

image-feature-extraction
google

Vision Transformer (ViT) model pre-trained on ImageNet-21k (14 million images, 21,843 classes) at resolution 224x224. It was introduced in the paper An Image is Worth 16x16 Words: Transformers for Ima

406

gemma-3-4b-it

image-text-to-text
google

image-text-to-text

1322

flan-t5-base

google

0. TL;DR 1. Model Details 2. Usage 3. Uses 4. Bias, Risks, and Limitations 5. Training Details 6. Evaluation 7. Environmental Impact 8. Citation 9. Model Card Authors

1073

siglip-so400m-patch14-384

zero-shot-image-classification
google

SigLIP model pre-trained on WebLi at resolution 384x384. It was introduced in the paper Sigmoid Loss for Language Image Pre-Training by Zhai et al. and first released in this repository.

674