Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity Mar 18 • 2
microsoft/BiomedCLIP-PubMedBERT_256-vit_base_patch16_224 Zero-Shot Image Classification • Updated Jan 14 • 30.1k • 156
wkcn/TinyCLIP-ViT-8M-16-Text-3M-YFCC15M Zero-Shot Image Classification • Updated 25 days ago • 6.13k • 4
sentence-transformers/paraphrase-multilingual-mpnet-base-v2 Sentence Similarity • Updated Mar 27 • 874k • 255