AI-Powered Creative Platform Version 0.4 | March 17, 2026 | Feature-Complete Reference Self-contained desktop application for AI-driven creative filmmaking. Zero cloud dependencies.

Download Visione for free: https://huggingface.co/atMrMattV/Visione

#	Section	Description
1	Platform Overview	What Visione is, the 11 modes, design philosophy
2	System Architecture	Hardware, software stack, model inventory, VRAM budget, file structure
3	Imagine — Image & Video Generation	Image generation (3 models), video generation (LTX-2.3, 7 workflows), prompt enhancement
4	Retouch — Image Editor	Image editor: AI editing, drawing tools, face swap, smart select, adjustments, LUTs
5	Retexture — Video Stylization	Video stylization: LoRA presets + Contextual Edit (Kiwi-Edit)
6	Enhance — Post-Processing	Image enhance, video enhancement (SeedVR2 + RTX SR), upscaling, frame interpolation
7	Storyboard — AI Filmmaking Pipeline	12-stage AI + manual filmmaking pipeline, 5 AI agents, character references
8	Sound — Audio Generation	Music (ACE-Step), voice (Qwen3-TTS), foley (HunyuanVideo-Foley)
9	Editor — Non-Linear Video Editor	NLE: timeline, 37 transitions, 7 tools, 5 engines, LUT grading, FFmpeg export
10	Cross-Component Integration	Content flow, SendToChips routing, SSE, VRAM arbitration
11	Frontend Architecture	AppShell, NavRail, themes, design system, state management
12	API Surface	Complete 144-endpoint inventory across 13 route modules
13	Data Model	TypeScript interfaces, database strategy
14	Codebase Statistics	Source code metrics, dependencies, fonts
15	Technical Reference	Critical gotchas (GPU, models, ComfyUI, frontend, LLM)
16	Model Registry & Downloads	HuggingFace model registry, 33 models, starter/feature tiers
17	System Dependency Check	14 runtime dependency checks, component validation
18	Installer & Distribution	NSIS installer, Tauri 2 bundle, release configuration

1. Platform Overview

What is Visione?

Visione is a self-contained desktop application for AI-driven creative production. It runs entirely on local hardware with zero cloud dependencies — no external APIs, no subscriptions, no internet required after initial model download. The platform enables a complete creative pipeline: from a text idea to a finished short film with music, voiceover, sound effects, and editing — all within a single application.

Eleven Modes (3 Categories)

Category	Mode	Purpose	Status
Make	Imagine	Text-to-image generation (3 models, 85 style LoRAs)	Complete
Make	Animate	Image/text-to-video generation (LTX-2.3, 7 workflows)	Complete
Make	Retexture	Video stylization (35 LoRA presets + Contextual Edit)	Complete
Refine	Retouch	Image editor: AI editing, 16 tools, face swap, adjustments, LUTs	Complete
Refine	Sound	Music (ACE-Step), voice (Qwen3-TTS), foley (HunyuanVideo-Foley)	Complete
Refine	Enhance	Image enhance, video enhancement, upscaling, frame interpolation	Complete
Produce	Storyboard	12-stage AI + Manual filmmaking pipeline (concept to ZIP)	Complete
Produce	Editor	Non-linear video editor with 37 transitions, 7 tools, export	Complete
Produce	Styles	CivitAI LoRA browser + download + custom style management	Complete
Produce	Characters	Character library with AI reference generation + validation	Complete
Produce	Gallery	Output gallery with multi-filter toolbar, detail view, actions	Complete

Design Philosophy

Self-contained: Every model runs locally. No internet required after initial model download.
Sequential GPU: One heavy model at a time. Models load and unload to fit within VRAM.
User control: AI handles execution complexity; humans make creative decisions.
No black boxes: Every generation step is visible, editable, and reversible.
Editorial aesthetic: Dark, typographic UI. No generic Material/Bootstrap styling.

Table of Contents

1. Platform Overview

What is Visione?

Eleven Modes (3 Categories)

Design Philosophy