
Qwen/Qwen3-4B-Base · Hugging Face
Qwen3-4B-Base Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
Qwen/Qwen3-30B-A3B-Base · Hugging Face
Qwen3-30B-A3B-Base Qwen3 Highlights Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) …
black-forest-labs/FLUX.2-klein-base-9B · Hugging Face
FLUX.2 [klein] 9B Base is a 9 billion parameter rectified flow transformer capable of generating images from text descriptions and supports multi-reference editing capabilities. It's a full …
Qwen/Qwen3-8B · Hugging Face
# Use the endpoint provided by Alibaba Model Studio: # 'model_type': 'qwen_dashscope', # 'api_key': os.getenv('DASHSCOPE_API_KEY'), # Use a custom endpoint compatible with …
answerdotai/ModernBERT-base · Hugging Face
Dec 19, 2024 · On GLUE, ModernBERT-base surpasses other similarly-sized encoder models, and ModernBERT-large is second only to Deberta-v3-large. For general retrieval tasks, …
unsloth/FLUX.2-klein-base-9B-GGUF · Hugging Face
FLUX.2 [klein] 9B Base is a 9 billion parameter rectified flow transformer capable of generating images from text descriptions and supports multi-reference editing capabilities. It's a full …
google-t5/t5-base · Hugging Face
T5-Base is the checkpoint with 220 million parameters. Developed by: Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, …