Tab Reference - Loki Studio

Home Tab

Purpose:Overview and quick actions for your Loki Studio workflow

Features

Config Status

Shows the currently loaded configuration file and its path

Quick Stats

Videos in upload folder
Transcripts completed
Metadata generated
Thumbnails created
Videos uploaded

Recent Activity

Last 10 actions taken, useful for tracking workflow progress

Quick Actions

Open root directory
Open upload folder
Open config file
Refresh all stats

Use When

Checking progress at a glance
Verifying config is loaded
Quick navigation to folders

Tips

Check stats regularly to monitor workflow progress
Recent Activity helps identify bottlenecks
Keep config file backed up

Transcription Tab (Muninn Engine)

Purpose: Convert video audio to text transcripts using the Muninn transcription engine (Whisper AI). 100% FREE and runs locally.

Features

Video List

Shows all videos in toupload/ folder

Columns: Name, Duration, Status, Transcript

Settings Panel

Model: Dropdown shows only installed models (large-v3-turbo recommended)
Device: CUDA (GPU) for 10-20x faster, or CPU
Compute Type: float16 (GPU), float32, int8 (CPU)
VAD Mode: Auto, Silero, Energy, or Off
Language: 99+ languages with auto-detect
Translation: Any-to-any translation between 30+ languages (NLLB-200)
Speaker Diarization: Identify up to 10 speakers
Word Timestamps: Essential for karaoke captions

Progress Display

Current video progress bar
Overall progress (X/Y videos)
ETA (estimated time remaining)
GPU utilization (if available)

Log Window

Detailed processing logs with timestamps
Error messages and warnings
Speed metrics and performance data

Controls

▶ Start

Begin transcription

⏸ Pause

Pause current job

⏹ Stop

Cancel processing

View Transcript

Open transcript file

Use When

First step in workflow - before metadata generation
Need to update transcripts after editing video
Testing different Whisper models for accuracy

Tips

Use large-v3-turbo - Best balance of speed and quality
Enable Word Timestamps if you want karaoke-style captions
Use GPU (CUDA) - 10-20x faster than CPU
Enable Silero VAD for intelligent silence removal
Performance: ~29x realtime on RTX 4090 with large-v3
Cost: $0.00 - completely free, runs locally

Metadata Tab

Purpose: Generate YouTube titles, descriptions, and tags using AI. FREE with Ollama!

Features

Video List

Shows videos with transcripts. Columns: Name, Status, Title, Tags

Preview Panel

Title: Full title text (under 100 characters)
Description: Full description with line breaks and keywords
Tags: Comma-separated tag list for SEO
Key Moments: Auto-generate YouTube chapters
Character Counts: Shows length vs. YouTube limits

AI Provider Options

Ollama (FREE): One-click model install from Settings
LM Studio (FREE): Local LLM with nice UI
OpenAI: GPT-4o-mini (~$0.01-0.05/video)
Anthropic: Claude Sonnet (~$0.02-0.10/video)

AI Personalities (37+)

Dad Joke, Space Marine, Product Reveal
Professional, Casual, Enthusiastic, Mysterious
Custom personalities available per profile

Templates

Description Header:Prepended to all descriptions
Description Footer:Appended to all descriptions
Custom Prompts:Override AI behavior (advanced)

Controls

▶ Generate

Start AI generation

Regenerate

Force new metadata

Edit

Manual editing

Export

Save to CSV/JSON

Use When

After transcription is complete
Need fresh metadata for re-uploads
Testing different AI providers
Bulk metadata generation for multiple videos

Tips

Specific config = better metadata:Include target_audience and genre in your config
Cloud providers usually superior:OpenAI typically better quality than local AI
Adjust temperature:0.9 for creative, 0.5 for conservative titles
Always review:AI isn't perfect - manually edit if needed
Save templates:Reuse good metadata for similar videos

Example Generated Metadata:

Title:

Skyrim VR - Epic Dragon Battle at College of Winterhold! (Mage Ep. 5)

Tags:

Skyrim vr, skyrim gameplay, elder scrolls, vr gaming, dragon battle, mage build, college of winterhold, destruction magic, vr rpg, skyrim playthrough

Thumbnail Tab

Purpose:Create custom YouTube thumbnails with logos, text, and visual effects

Features

Video List

Shows videos ready for thumbnail generation. Columns: Name, Status, Thumbnail preview

Preview Panel

Large preview (1920x1080 resolution)
Zoom controls for detailed inspection
Open in external viewer for full-screen display
Regenerate button to try different frame

Logo Settings

Logo Path:Path to logo file
Position:Top-left, top-right, top-center, bottom-left, etc.
Size:Max width/height (preserves aspect ratio)

Text Settings

Title Text:Game/content name (large, white by default)
Subtitle:Episode/description (smaller, colored)
Font Scale:Control text size
Colors:RGB customization for each text element
Outline:Add outline for visibility on any background

Frame Selection

Timestamp Range:Extract frame from specific seconds (e.g., 30-120s)
Brightness Threshold:Reject dark frames automatically
Max Retry:Number of attempts to find good frame

Controls

▶ Generate

Create thumbnails

Regenerate

Try different frame

Edit External

Open in editor

Set Template

Save settings

Flair Feature

Place PNG images in[root]/Flair/Folder
Randomly selected or specify in config
Examples: explosions, stars, character faces, badges
Positioned viaFlair_positionSetting

Use When

After metadata generated (optional but recommended)
Creating professional-looking thumbnails
Batch thumbnail creation for multiple videos
Testing different visual styles

Tips

Logo placement:PlaceLogo.pngIn root for auto-detection
Use bright, contrasting colorsFor text visibility
Timestamp range 30-120s:Usually has better frames than opening scenes
Add text outlineFor visibility on any background
Test logo positions:Top-right common for gaming content
Flair adds personality:Use decorative overlays to stand out

Thumbnail Layout Example:

┌─────────────────────────────────────────┐
│ [Logo] │ ← Logo (top-right)
│ │
│ SKYRIM VR │ ← Title (large, white)
│ Epic Dragon Battle! │ ← Subtitle (blue)
│ │
│ [Gameplay Frame] │ ← Video screenshot
│ │
└─────────────────────────────────────────┘

Upload Tab

Purpose: Batch upload processed videos to YouTube with scheduling and playlist management

Features

Video Queue Reordering (NEW!)

Up/Down Arrows: Control upload priority
Queue Persistence: Order saved across tab switches
Drag-free: Reliable arrow button reordering

YouTube Channel Section

Channel Dropdown: Select active channel
Add Channel: Authorize new channel
Validate Channel: Check credentials validity

Video List

Shows videos ready for upload (marked with checkmark)

Columns: Name, Size, Ready, Publish Date, Status, Reorder buttons

Resumable Uploads

Handles network interruptions gracefully
Resume where it left off after reconnection

Upload Settings

Schedule:Days between uploads, release time (HH: MM), timezone
Privacy:Public, unlisted, or private
Playlist:Name (auto-created if doesn't exist)
Category:Gaming, People & Blogs, Science & Tech, etc.
Made for Kids:Yes/No (COPPA compliance)

Progress Display

Per-video progress: percentage, speed (MB/s), ETA
Overall progress: X/Y videos uploaded
Quota usage: YouTube API quota consumed

State Management

Uploaded Videos List:Track previously uploaded videos
State File: .yt_uploader_state.jsonTracks history
Skip Uploaded:Won't re-upload same video
Reset State:Clear upload history (advanced)

Controls

▶ Start

Begin batch upload

⏸ Pause

Pause current upload

⏹ Stop

Cancel remaining uploads

Dry Run

Simulate without uploading

Use When

Final step in workflow - all processing complete
Batch uploading multiple videos
Scheduling future publishes
Switching between multiple YouTube channels

YouTube Quota Costs

Upload video: 1,600 units
Add to playlist: 50 units
Upload thumbnail: 50 units
Per video total:~1,700 units
Daily limit:~6 videos per day (10,000 / 1,700)

Tips

Always test with Dry Run first!Verify files before uploading
Upload during off-peak hoursFor faster speeds
Schedule publishes for optimal times(evenings, weekends)
Use playlistsTo organize content
Test with "unlisted"Before publishing publicly
Monitor quota:10,000 units/day max
Videos moved toDone/FolderAfter upload
Can close Loki StudioAfter upload starts (background processing)

Timeline Editor (Mimir Engine)

Purpose: Video editing, audio mixing, and live caption burning with GStreamer-powered playback

Features

Live Caption System (NEW!)

Caption Styles: One Word Pop (karaoke), Full Segment, Sliding Window
Live Preview: See captions during playback
Caption Burning: Burn captions directly into video
Color Customization: Highlight, Pending, Spoken word colors
Position Control: Bottom, Top, Center, corners

GStreamer Playback

Hardware-accelerated video playback
Multi-track audio support
Real-time waveform display
Frame-accurate seeking

Audio Track Management

Auto-detect all audio tracks (Game, Mic 1, Mic 2)
Per-track volume controls and mute
Voice Dominant auto-balance for gaming
LUFS analysis and normalization

Segment Marking

Mark start and end points for segments
Toggle between Keep/Remove modes
Delete segment markers

Segment List

🟢Keep:Include in final video
🔴Remove:Exclude from final video
Manual time editing
Drag & drop reordering

Voice-Triggered Trimming

Auto-Detect Markers:Find "start cut" / "end cut" in transcript
Marker Settings:Customizable start/end phrases
Audio Channel:Select channel for multi-channel videos
Review & Adjust:Preview detected segments
Apply:Create segments from markers

Export Options

Output Path:Where to save final video
Video Codec:Copy (fast) or re-encode (smaller files)
Audio Channel:Select audio track for VR/multi-channel
Export Final:Process and save final video
Progress:Shows FFmpeg processing status

Use When

Removing boring parts (loading screens, deaths, menus)
Creating highlight reels from long recordings
Merging multi-episode recordings into single video
Voice-triggered editing (say "start cut" during recording)
Creating shorter videos from long source material

Workflow Example

Record 2-hour gaming session
Say "start cut" when boring part starts, "end cut" when it ends
Load video in Timeline tab
Click "Auto-Detect Markers" button
Review auto-marked segments
Manually add/adjust segments if needed
Export final video (e.g., 2 hours → 45 minutes of action)

Tips

Use voice markers during recordingFor hands-free editing
Keep segments shortFor tighter pacing
Export with "copy" codecFor fastest processing (no quality loss)
Test with short videos firstTo learn workflow
Export multiple versionsWith different segment selections

Application Settings (Yggdrasil)

Purpose: Global application settings organized into subtabs

Subtabs

1. Directories

Base Directory: Root folder for all video operations
Create Missing: Auto-create folder structure (toupload/, Done/, Failed/)

2. LLM Providers

Ollama (FREE): One-click model install!
LM Studio (FREE): Local LLM option
OpenAI: GPT-4o-mini ($0.01-0.05/video)
Anthropic: Claude Sonnet ($0.02-0.10/video)
Install Models: Click Install button for Ollama models
Refresh: Detect newly installed models

3. Transcription

Model: Only shows installed Whisper models
Device: CUDA (GPU) or CPU
VAD: Auto, Silero, Energy, Off
Language: 99+ languages with auto-detect

4. Captions

Style: One Word Pop, Full Segment, Sliding Window
Colors: Highlight, Pending, Spoken word colors
Position: Bottom, Top, Center, corners
Font Size: 12-72 pixels
Background: Toggle semi-transparent backdrop

5. Advanced

Threading: Processing thread configuration
Debug: Verbose logging options
Cache: Clear cached data

Logging

Log Level:Debug, Info, Warning, Error
Log to File:Save logs to disk
Log Path:Where to save log files
Clear Logs:Delete old log files

Shortcuts

Keyboard Shortcuts:Customize key bindings
Quick Actions:Assign keys to common tasks

Advanced

Developer Mode:Show debug information and API calls
API Rate Limits:Throttle API calls to avoid hitting quotas
Experimental Features:Enable beta features and early access

Tips

Set default configTo auto-load your primary project on startup
Adjust GPU memory limitIf you have limited VRAM
Enable loggingFor troubleshooting issues
Customize keyboard shortcutsFor your workflow
Monitor cache sizeAnd clear periodically to free disk space

Tab Reference Guide

Table of Contents

Home Tab

Features

Use When

Tips

Transcription Tab (Muninn Engine)

Features

Controls

Use When

Tips

Metadata Tab

Features

Controls

Use When

Tips

Thumbnail Tab

Features

Controls

Flair Feature

Use When

Tips

Upload Tab

Features

Controls

Use When

YouTube Quota Costs

Tips

Timeline Editor (Mimir Engine)

Features

Use When

Workflow Example

Tips

Application Settings (Yggdrasil)

Subtabs

Tips