sebastjan
  • Joined on 2026-04-20
sebastjan created repository sebastjan/folxplay-clone 2026-04-29 11:17:28 +02:00
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 11:03:55 +02:00
e350352883 Fix: Gemini 3.1 Pro thinking model needs 32k maxOutputTokens (was 4096 → MAX_TOKENS truncation)
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 10:52:20 +02:00
534d710e8a Auto-resume jobs interrupted by container restart
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 10:43:34 +02:00
32baf9cd45 Auto-resume: cleanup stuck jobs on container startup + GEMINI_API_KEY env
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 10:26:31 +02:00
ec71c54570 Upgrade to Sonnet 4.6 + add Gemini 3.1 Pro support
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 10:22:14 +02:00
9faa224885 Upgrade Claude model: Haiku 4.5 → Sonnet 4.6 for better Slavic language transcript correction
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 10:20:21 +02:00
69fb2f5ce8 Upgrade default Whisper model: small/medium → large-v3 for much better Slovenian/Slavic transcription accuracy
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 10:13:37 +02:00
4bc5ac6756 Major: Claude post-processing of Whisper transcript
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 10:03:26 +02:00
4e123bdabc UI: hide lang/model dropdowns — both are fully automatic now (3-sample lang detection + medium default model)
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 09:59:23 +02:00
af3c933c78 Robust language detection + anti-hallucination
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 09:35:03 +02:00
c870d80726 Fix: extend clip if ends mid-vocal (no chorus cut-off), DejaVu Sans font (supports SLO/HR/BS chars), auto-upgrade to medium Whisper model for Slavic languages
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 09:07:33 +02:00
5d5e169f9d Disable Whisper VAD filter — was dropping vocal segments in songs creating gaps in subtitles
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 08:55:44 +02:00
a04811bdc9 Add Claude LLM analysis: sends full transcript to Claude API for true song structure understanding (refrain detection across all repetitions, not just local heuristic)
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 08:33:56 +02:00
e072eec362 Fix: handle Whisper transcribe failure for instrumental-only audio (fallback to empty transcript)
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 08:23:45 +02:00
33a138af9e Fix: force native Python bool/float for JSON serialization (numpy types)
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 08:21:38 +02:00
8512076b91 Major: smart selection pipeline (analyze.py) + audio fade + multi-lang auto-detect
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 08:09:30 +02:00
81edd24ca3 Subtitles: smaller font 56px (was 84), higher position MarginV=400, side margins 80px for safe zone
sebastjan pushed to main at sebastjan/reels-app 2026-04-29 07:59:39 +02:00
ba787744a6 Subtitles: cap chunk duration at 2.5s, split long lines into multiple time slices for faster reels pacing
sebastjan pushed to main at sebastjan/reels-app 2026-04-28 20:09:57 +02:00
e001387a89 Subtitles: convert SRT to ASS directly with PlayResY=1920 for predictable scaling instead of unreliable force_style
sebastjan pushed to main at sebastjan/reels-app 2026-04-28 19:40:51 +02:00
28d933c916 Subtitles: UPPERCASE + position lower (MarginV=320 for 1080x1920) + bigger font