Qwen3 ASR Demo
Convert audio to text with context and language options
Convert audio to text with context and language options
Generate high-quality images from text prompts
Generate images from text prompts
inpaint images using Qwen Image with inpainting Controlnet
UMO based on OmniGen2
Dedicated display for RTEB benchmark results
Flux Kontext extended with product placement capabilities
Generate 3D CAD models from images
Generate any application with DeepSeek
generate a video from an image with a text prompt
Generate a video by interpolating between two images with a prompt
Generate expressive speech from text with emotion control
Generate high-quality images from text prompts
Powerful Watermark Removal API
Convert audio to text with context and language options
Generate web application code from descriptions
High-fidelity 3D Geometry Generation from single view image
Mood Palette Generator
Edit images based on user instructions
Try on clothes virtually by uploading images
Embedding Leaderboard
Image-to-3D Generation
Remove background from images
Generate images from text prompts
ChatGPT with real-time web search & URL reading capability
generate a video from an image with a text prompt
The ultimate guide to training LLM on large GPU Clusters
Nano Banana for Hugging Face PRO users
Swap faces in images
Visualize embeddings in 3D space, powered by EmbeddingGemma
Chatterbox TTS supporting 23 languages
Generate Gradio app code from user requests