Diception Demo
A Generalist Diffusion Model for Vision Perception
A Generalist Diffusion Model for Vision Perception
Large Language Diffusion Models
Generate customized images using text and an ID image
OpenAI's Deep Research, but open
"One-minute creation by AI Coding Autonomous Agent MOUSE-I"
SD3.5 in 8-steps with TensorArt TurboX
Transcribe audio from microphone, file, or YouTube link
Generate virtual try-on results for clothing
Display and filter the UGI leaderboard data
Generate images with virtual try-on or pose transfer
Generate creative text content using predefined scripts
Convert PDFs to Markdown with open-source parsers
Enhance and restore old photos with faces
Create audio from videos or text prompts
Generate thoughts based on hand gestures
Break the language barrier
Mixture of Diffusers and ControlNet Tile Upscaler for SDXL
Submit and evaluate models for a leaderboard
Generate stunning high quality illusion artwork
Video deep fake (uncensored)
Clone voice to say text
Generate detailed images from a prompt and an image
Magma-8B model for UI Agents