Explore

Explore the latest available applications at dione.

Open WebUI logo

Open WebUI

Dione Team

Open WebUI is an extensible, feature-rich, and user-friendly self-hosted AI platform designed to operate entirely offline. It supports various LLM runners like Ollama and OpenAI-compatible APIs, with built-in inference engine for RAG, making it a powerful AI deployment solution.

1/31/2025
46
Browser use logo

Browser use

Dione Team

This project builds upon the foundation of the browser-use, which is designed to make websites accessible for AI agents.

v1.0.05/5/2025
19
MatAnyone logo

MatAnyone

Dione Team

Stable Video Matting with Consistent Memory Propagation

v1.0.06/30/2025
74

WanGP by DeepBeepMeep : The best Open Source Video Generative Models Accessible to the GPU Poor

v1.0.06/25/2025
209

Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion WebUI (based on Gradio ) to make development easier, optimize resource management, speed up inference, and study experimental features.

v1.0.05/13/2025
48
Text Generation WebUI logo

Text Generation WebUI

Dione Team

NVIDIA Only - LLM UI with advanced features, easy setup, and multiple backend support.

v1.0.07/10/2025
24

Prompt, run, edit, and deploy full-stack web applications using any LLM you want!

v1.0.01/28/2025
64
Invoke AI logo

Invoke AI

Dione Team

Invoke is a leading creative engine built to empower professionals and enthusiasts alike. Generate and create stunning visual media using the latest AI-driven technologies. Invoke offers an industry leading web-based UI, and serves as the foundation for multiple commercial products.

v1.0.05/13/2025
13

A Universal Customization Method for Both Single and Multi-Subject Conditioning

v1.0.06/25/2025
9
Melo TTS logo

Melo TTS

Dione Team

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

v1.0.05/11/2025
94
MMAudio logo

MMAudio

Dione Team

MMAudio generates synchronized audio given video and/or text inputs. Our key innovation is multimodal joint training which allows training on a wide range of audio-visual and audio-text datasets. Moreover, a synchronization module aligns the generated audio with the video frames.

v1.0.05/11/2025
53
Facefusion logo

Facefusion

Dione Team

Industry leading face manipulation platform.

v1.0.06/24/2025
145

Zonos is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.

v1.0.07/13/2025
98
ACE-Step logo

ACE-Step

Dione Team

A Step Towards Music Generation Foundation Model

v1.0.06/24/2025
78
Ultimate-TTS-Studio logo

Ultimate-TTS-Studio

Dione Team

NVIDIA ONLY – All-in-One TTS App with Kokoro, Chatterbox, Fish-Speech, F5 & index-tts. Supports Conversation Mode & eBook-to-Audiobook. All features work across all engines in a unified interface.

v1.0.07/1/2025
185
Whisper logo

Whisper

Dione Team

A Gradio-based browser interface for Whisper. You can use it as an Easy Subtitle Generator!

v1.0.07/30/2025
58
Kokoro TTS logo

Kokoro TTS

Dione Team

Welcome to Kokoro, a high-quality text-to-speech synthesis program powered by deep learning. This tool converts any text into high-fidelity speech in just a few seconds. Simply input text, select a voice, adjust the speed, and enjoy the generated audio.

v1.0.07/17/2025
115

n8n is a workflow automation platform that gives technical teams the flexibility of code with the speed of no-code. With 400+ integrations, native AI capabilities, and a fair-code license, n8n lets you build powerful automations while maintaining full control over your data and deployments.

v1.0.07/22/2025
80
KittenTTS logo

KittenTTS

pierrunoyt

Kitten TTS is an open-source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis.

v1.0.08/8/2025
109
Stable Audio Open logo

Stable Audio Open

Dione Team

Stable Audio Open allows anyone to generate up to 47 seconds of high-quality audio data from a simple text prompt. Its specialised training makes it ideal for creating drum beats, instrument riffs, ambient sounds, foley recordings and other audio samples for music production and sound design.

v1.0.07/13/2025
42
ComfyUI logo

ComfyUI

Dione Team

ComfyUI lets you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface. Available on Windows, Linux, and macOS.

v1.0.08/4/2025
182
E2-F5 TTS logo

E2-F5 TTS

Dione Team

A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

v1.0.07/14/2025
70
Stable Diffusion WebUI logo

Stable Diffusion WebUI

Dione Team

A web interface for Stable Diffusion, implemented using Gradio library.

v1.0.07/13/2025
42
StreamSnap logo

StreamSnap

Dione Team

A powerful, AI-powered Gradio application for downloading, transcribing, and analyzing YouTube videos and audio.

v1.0.07/13/2025
68
RuinedFooocus logo

RuinedFooocus

Dione Team

Forget everything you thought you knew about AI art generation - RuinedFooocus is here to completely reinvent the game!

v1.0.07/4/2025
43
FLUX.1 Kontext logo

FLUX.1 Kontext

Dione Team

Transform your images with AI-powered editing magic! Upload an image and describe your desired changes - watch the magic happen! All under 20gb VRAM

v1.0.07/1/2025
79

This project provides a user-friendly Gradio-based Graphical User Interface (GUI) for Kohya's Stable Diffusion training scripts. Stable Diffusion training empowers users to customize image generation models by fine-tuning existing models, creating unique artistic styles, and training specialized models like LoRA (Low-Rank Adaptation).

v1.0.07/13/2025
10

Applio is a powerful, AI-driven voice conversion tool that enables you to create personalized voices or make use of a variety of pre-existing voices. Whether you prefer local installation or cloud-based usage through Google Colab, Applio is designed to be efficient and user-friendly.

v1.0.01/10/2025
3298
Chatterbox-vllm logo

Chatterbox-vllm

Dione Team

This is a port of https://github.com/resemble-ai/chatterbox to vLLM.

v1.0.08/5/2025
59
ZipVoice logo

ZipVoice

Dione Team

ZipVoice is a series of fast and high-quality zero-shot TTS models based on flow matching.

v1.0.08/12/2025
84
OmniGen logo

OmniGen

Dione Team

A unified image generation model that you can use to perform various tasks, including but not limited to text-to-image generation, subject-driven generation, Identity-Preserving Generation, and image-conditioned generation.

v1.0.05/12/2025
14
OrpheusTTS logo

OrpheusTTS

pierrunoyt

A Dione configuration for OrpheusTTS - automated installer that downloads TTS application files and installs PyTorch with platform-specific GPU support (NVIDIA/AMD/CPU).

v1.0.08/15/2025
62

Dia directly generates highly realistic dialogue from a transcript. You can condition the output on audio, enabling emotion and tone control. The model can also produce nonverbal communications like laughter, coughing, clearing throat, etc.

v1.0.08/27/2025
58
SoniTranslate logo

SoniTranslate

Dione Team

SonyTranslate is a powerful and user-friendly web application that allows you to easily translate videos into different languages. This repository hosts the code for the SonyTranslate web UI, which is built with the Gradio library to provide a seamless and interactive user experience.

v1.0.07/31/2025
93
VibeVoice-Windows logo

VibeVoice-Windows

sup3rmass1ve

Frontier Open-Source Text-to-Speech

v1.0.09/1/2025
71
LBM-Relighting logo

LBM-Relighting

Dione Team

Introduce Latent Bridge Matching (LBM), a new, versatile and scalable method that relies on Bridge Matching in a latent space to achieve fast image-to-image translation. We show that the method can reach state-of-the-art results for various image-to-image tasks using only a single inference step. In addition to its efficiency, we also demonstrate the versatility of the method across different image translation tasks such as object removal, normal and depth estimation, and object relighting. We also derive a conditional framework of LBM and demonstrate its effectiveness by tackling the tasks of controllable image relighting and shadow generation.

v1.0.08/20/2025
24

Want to see more? Check out the home for more details.