reels-app

Author	SHA1	Message	Date
Sebastjan Artič	446769b68f	Trim bar: visible styling (was too transparent on dark theme) Bug from screenshot: trim bar was invisible due to: - Background rgba(255,255,255,0.05) too transparent - Handles 18px width with low contrast - Removed video controls Fixed: - Trim bar background #1a1a1a + 2px #444 border (visible) - Handles 24px width, full red #ff6b6b with strong glow - Region 35-20% opacity (brighter) - Playhead 3px white with shadow (visible) - Restored video controls - Added hint text below trim bar	2026-04-30 11:17:57 +00:00
Sebastjan Artič	9dba2a1185	iPhone-style trim bar: drag handles instead of sliders User feedback: 'tako kot imajo na iphonu - potegnem iz leve in iz desne za na konec... reel pa more biti že v stanju postavljen' Replaced 2 separate range sliders with iPhone-style trim bar: - Single horizontal bar showing full video duration - 2 draggable handles (left = start, right = end) - Selected region highlighted in accent color - Live playhead during playback - Mouse + touch support - Click anywhere on bar = seek to that position - Initial state: handles positioned at auto-selected clip range (just fine-tune left/right, no need to set from scratch) formatTime helper for nice m:ss.c display.	2026-04-30 10:34:34 +00:00
Sebastjan Artič	7cb4302dcd	Edit feature: slider + napis edit + recut endpoint User insight: 'treba je narediti da ko se reels naredijo da jih lahko popravljamo... delamo na avtomatiko ampak lahk pa tudi popravljam' Avto pipeline ostane (Soniox + Claude + render). Po render-u uporabnik lahko klikne ✏️ Edit gumb in: 1. Slider za clip start/end: - Vidi 16:9 original video - Drag start/end slider z živim preview-om - Dolžina prikazana real-time - Min 5s, max 60s 2. Edit napisov (collapsed, opcijsko): - Klik na vrstico → input za popravek besedila - Original timestamp ostane, samo besedilo se posodobi - Uporabno za 'doline IZBOR' → 'doline IZPOD' tip popravkov 3. Re-render: - Backend POST /api/jobs/{id}/recut z {start, end, custom_segments} - Worker preskoči Soniox + Claude (custom_clip flag) - Re-uporabi cached transcript + analysis - Re-render samo: clip → reframe → subtitle → output - ~30s namesto 3-5 min New endpoints: - GET /api/source-video/{id} — 16:9 original za editor preview - GET /api/transcript/{id} — segmenti + clip range za editor - POST /api/jobs/{id}/recut — re-render z user timestampi Worker change: če job ima custom_clip=True, preskoči auto_chorus analizo in samo re-uporabi obstoječi clip_range iz analysis.json (updated by recut endpoint).	2026-04-30 10:26:25 +00:00
Sebastjan Artič	64b0ed1edc	Default 'no subtitles' mode: cleaner reels without text issues User feedback: subtitles have been causing problems (wrong text from STT, chorus selection issues). Better to default to clean reels without burned-in subtitles - just video + audio at the chorus moment. Changes: - 'Brez podnapisov' checkbox now CHECKED by default - Removed 'Stil podnapisov' dropdown from UI (kept hidden for compat) - Updated step label: 'Reframe v 9:16 + podnapisi' → 'Reframe v 9:16' - Backend already supports --no-subs flag, no logic changes needed Result: reels are simpler and more reliable. Just clean 9:16 with audio. User can enable subtitles per-job by unchecking the box if needed.	2026-04-29 19:38:35 +00:00
Sebastjan Artič	108f6b1ad3	Fix: live preview blocks right-side preview buttons Bug: when user clicked Preview in the left 'Analiza pesmi' panel, it loaded an inline <video> below it. The video element grew to 400px and, combined with the sticky positioning of the left card and grid layout, caused the right column (jobs list) preview/download buttons to become unclickable — the video element was effectively layering over them. Fix: replace inline preview with the same modal that the right-side Preview buttons use. Removes the layout conflict entirely: - Live panel Preview button now opens the centered modal - Removed inline #live-video element - Removed liveVideo references from JS (resetLive) - Job cards now have data-id and data-title attributes so the modal can pull title for display Both left-side (live) and right-side (jobs list) preview now use the same clean modal experience.	2026-04-29 15:37:51 +00:00
Sebastjan Artič	9df58212b2	Two fixes: clip end overflow into instrumental + UI cleanup 1. CLIP END EXTENSION TOO AGGRESSIVE (Avsenika problem): Previous logic extended clip end to any segment within 1s pause. This caused clip to spill into instrumental break or next chorus. New rules (multi-language): - Hard cap: original_clip_end + 3s max (prevents long instrumental tails) - Pause threshold tightened: 0.7s (was 1.0s) - Length check: skip segments longer than 2.5s (likely new verse/chorus) - Outro filler regex: only extend if next segment matches (la\|na\|oh\|ah\|eh\|ej\|aj\|ja\|hey\|yeah\|yo\|ho\|wo\|hu\|mm\|nn\|uu\|oo\|aa\|ee\|ii) - Universal across languages (works for SLO 'aj ja ja', EN 'yeah', ES 'ay ay ay', RO 'hei hei', JP 'la la la') 2. UI CLEANUP: - Removed dead pendingFile/pendingArtist/pendingTitle references (multi-upload migration left some single-file resets behind) - Job watch handler no longer tries to clear single-file state	2026-04-29 15:14:42 +00:00
Sebastjan Artič	91cc03658d	Multi-upload batch queue + Telegram notifications Changes: 1. Frontend multi-upload: - File input now has 'multiple' attribute, drag-drop accepts multiple - File queue list with per-file artist/title preview + remove button - 'Pošlji vse' uploads sequentially (one at a time to avoid network saturation) - Each file gets same batch_id for Telegram batch summary - After upload, queue clears, jobs appear in right sidebar 2. Backend queue worker: - New _queue_worker() background thread processes 'queued' jobs sequentially - Only 1 job at a time to keep openclaw stable (avoid CPU/RAM thrash) - FIFO order by created_at - Auto-starts on app startup after job resume 3. Job submission flow change: - /api/process and /api/youtube no longer call background.add_task directly - Just mark status='queued', queue worker picks up - This means upload completes fast, processing happens in background - User can close browser, jobs continue 4. Telegram notifications (FOLX Alerts bot): - Per-job: 'Reel pripravljen: Lady Gaga - Abracadabra (29s, 30 MB)' - Per-job failed: 'Reel ni uspel: <name> + error message' - Batch summary: 'Batch končan: 10/10 reels pripravljeni' (only if >1 in batch) - Uses existing TELEGRAM_TOKEN + TELEGRAM_CHAT_ID env vars - app/telegram.py module with notify_job_done(), notify_job_failed(), notify_batch_complete() 5. batch_id field: - Added to Job model + StartJobIn pydantic - Saved during upload + process - Used to count batch progress and trigger summary notification User experience: - Drag 20 videos at once - Click 'Pošlji' - Close browser, go grab coffee - Telegram sends 'Reel pripravljen' for each - After all done: 'Batch končan: 20/20 reels pripravljeni' summary - Open app to download all	2026-04-29 15:12:38 +00:00
Sebastjan Artič	157e6b781e	Fix 'Žena' word still cut: word-level start extension instead of segment-level Previous fix used segment boundaries — required segments <3s for type 1 or <4s for type 2. But Žena was in a 4.3s segment ('saj še doma mi več noč'jo verjet'. Žena me'), so the condition wasn't met and clip start stayed at 77.7s, exactly at end of word 'Žena' (76.88-77.70s). New approach: scan word-level timestamps directly: 1. If clip start falls MID-WORD → extend back to word start - 0.15s 2. If a word ends 0-0.5s BEFORE clip start AND next word is at clip start → that word is suspect (may be first word of chorus that Scribe put in previous segment), extend back to its start - 0.15s Word-level timestamps are always available from Scribe (timestamps_granularity=word). Falls back to segment-level for local Whisper without word timing. This handles arbitrary segment lengths and is universal — works for any language where the chorus starts on a word that the STT placed in the previous segment.	2026-04-29 15:04:18 +00:00
Sebastjan Artič	1cc8e8be35	MXF/MPG broadcast format support: handle multichannel audio properly Problem: MXF and MPG files (TV broadcast formats) often contain: - Multiple audio streams (4-8 streams for different language tracks) - Multichannel layouts (5.1, 7.1) instead of stereo - Default ffmpeg behavior was -c:a aac without channel limit, which meant multichannel got transcoded as multichannel AAC, overwriting what should have been clean stereo Solution: 1. get_audio_streams() helper probes all audio streams with ffprobe - Returns codec, channels, sample_rate, language, layout for each 2. build_audio_args() picks best stream + downmix: - Prefers first 2-channel stereo stream (usually main mix) - Falls back to first stream if none are 2-ch - Always: -ac 2 (force stereo downmix), -ar 48000, -c:a aac, -b:a 192k - Bitrate raised from 128k to 192k for music quality 3. Smart trim path now detects broadcast formats: - .mxf, .mpg, .mpeg, .ts, .m2ts, .mts → transcode (not stream copy) - Standard MP4/MOV → stream copy (faster, lossless) 4. Pre-conversion step for broadcast files without trim: - Even without --start/--duration, MXF/MPG get converted to MP4 - Same audio handling as trim path 5. Main render adds explicit -map 0✌️0 -map 0🅰️0? -ac 2 to ensure only first video and first audio stream get encoded, with stereo 6. ACR recognize also gets -map 0🅰️0 -ac 2 for MXF compatibility 7. UI accepts: video/*,.mxf,.mpg,.mpeg,.ts,.m2ts,.mts 8. Upload limit raised: 2GB → 10GB (MXF files are large) This means a TV broadcast MXF with [SLO/EN/DE language tracks] now correctly outputs stereo MP4 with the main language track preserved.	2026-04-29 14:38:48 +00:00
Sebastjan Artič	b543057cee	ACRCloud auto-recognition: never block uploads, fall back to fingerprinting Changes: 1. UI: removed blocking prompt() that asked for artist+title on filename that didn't match 'Artist - Title' pattern. Upload always proceeds. Instead shows yellow warning saying 'server will try to recognize'. 2. Backend: added scripts/acr_recognize.py — extracts 20s audio sample from video (at 15s and 60s offsets for robustness), computes ACRCloud fingerprint via native binary (3KB payload), sends to identify API. 3. Pipeline: process_job() now runs ACR recognition step before analysis IF parsed_artist or parsed_title is missing. Result is saved to job metadata and used for download filename + Scribe/Claude filename hint. 4. Credentials: ACR_HOST + ACR_ACCESS_KEY + ACR_SECRET_KEY env vars added to Coolify (using existing keys from openclaw fb-agent metka). 5. requirements.txt: added pyacrcloud==1.0.11 for native fingerprinting. This unblocks future automation/cron upload pipelines — files don't need to be perfectly named, ACRCloud will identify them automatically. Fallback chain: 1. Filename parsing (Artist - Title.mp4) 2. ACRCloud audio fingerprint (works even for '12345.mp4', 'IMG_001.mp4') 3. If both fail: download filename uses 'reel_<id>.mp4' (still works)	2026-04-29 14:24:53 +00:00
Sebastjan Artič	3877b822ff	Smart download filenames: 'Artist - Title - REEL.mp4' + validation Two improvements: 1. DOWNLOAD FILENAME: instead of 'reel_<job-id>.mp4' (e.g. reel_25e076af7600.mp4), downloads now have descriptive names like: - 'Lady Gaga - Abracadabra - REEL.mp4' - 'Modrijani - S teboj - REEL.mp4' - 'Sarah Connor - FICKA - REEL.mp4' 2. PRE-UPLOAD VALIDATION: when filename doesn't follow 'Artist - Title' format, browser prompts user for both fields. Without them, upload is blocked. This prevents files with names like '12345.mp4' or 'video_final.mp4' from being processed without identifying info. Implementation: - parse_artist_title() helper handles common formats: - 'Artist - Title.mp4' / 'Artist – Title' (em-dash) - 'Artist \| Title' / 'Artist : Title' - Strips noise: '(Official Music Video)', '(Audio)', '(HD)', '[Lyric Video]' - Client-side parser mirrors backend (validation before upload) - Backend accepts artist + title form fields (override parsed) - Job stored with parsed_artist + parsed_title + has_clean_name fields - YouTube jobs auto-fetch title via yt-dlp --info-only and parse it - Filename hint to Scribe/Claude uses parsed values (cleaner than raw filename) - Download endpoint uses build_download_filename() for content-disposition - Jobs list shows 'Artist — Title' instead of raw filename Result: downloaded reels are auto-named correctly for Facebook/Instagram upload, no more renaming files manually.	2026-04-29 14:15:18 +00:00
Sebastjan Artič	671b512917	Fix Preview button in jobs sidebar not opening modal Root cause: inline onclick with JSON.stringify(title) broke when title contained quotes, special chars, or was empty. The HTML attribute parser got confused by mismatched quotes, so click handler never fired. Fix: - Replaced inline onclick handlers with data-action attributes - Added single delegated click listener at document level - Title stored in element dataset (no HTML quoting issues) - Added escapeHtml() helper for safe rendering of titles/errors Now clicking Preview in the right sidebar opens the fullscreen modal correctly, regardless of filename characters.	2026-04-29 13:39:04 +00:00
Sebastjan Artič	389c26d012	Modal preview: click Preview opens fullscreen video player Previously: clicking Preview in jobs list showed a small inline video within the job card row. Now: clicking Preview opens a centered fullscreen modal with: - Large video player (up to 95vw × 85vh) — same experience as bottom live-preview but accessible from jobs list - Auto-play, controls, native HTML5 video player - Title shown below video for context - Download button + Close button - Click outside or ESC key to close - Backdrop blur for focus Removes the obsolete inline <video> element that was rendered hidden in each job card. Body scroll locked while modal open.	2026-04-29 13:21:36 +00:00
Sebastjan Artič	05fb0081c6	Fix preview cutoff + sticky left panel 1. Preview endpoint now supports HTTP Range requests (HTTP 206 Partial) - HTML5 video player needs Range support to seek/buffer properly - Without it, video would cut off after a few seconds - Returns chunks of 64KB on demand 2. Left panel (upload form) is now sticky (position: sticky) - Stays in view while right panel (jobs list) scrolls - On mobile (<800px) reverts to normal flow	2026-04-29 10:24:32 +00:00
Sebastjan Artič	ec71c54570	Upgrade to Sonnet 4.6 + add Gemini 3.1 Pro support - Refactored analyze_with_claude into shared _build_analysis_prompt + _parse_llm_response helpers - New analyze_with_gemini() using Gemini 3.1 Pro ($2/M in, MMMLU 92.6% — best multilingual) - Unified analyze_with_llm(provider) dispatcher with auto-fallback (Claude → Gemini) - API endpoint accepts llm_provider in StartJobIn (claude/gemini/auto) - Frontend dropdown to pick LLM - Default model is now Sonnet 4.6 (was Haiku 4.5) — 3x quality at 3x price (~3 cents/video) - Gemini support is opt-in: needs GEMINI_API_KEY env var to activate	2026-04-29 08:26:27 +00:00
Sebastjan Artič	69fb2f5ce8	Upgrade default Whisper model: small/medium → large-v3 for much better Slovenian/Slavic transcription accuracy	2026-04-29 08:20:18 +00:00
Sebastjan Artič	4e123bdabc	UI: hide lang/model dropdowns — both are fully automatic now (3-sample lang detection + medium default model)	2026-04-29 08:03:22 +00:00
Sebastjan Artič	af3c933c78	Robust language detection + anti-hallucination - 3-sample voting for auto-detect (start/middle/end of song) prevents lang switching mid-song - Lock detected language for full transcription - Anti-hallucination: condition_on_previous_text=False, temperature=0.0 - compression_ratio_threshold=2.4 (rejects repetitive hallucinations) - log_prob_threshold=-1.0 (rejects low-confidence segments) - no_speech_threshold=0.6 (more aggressive silence detection) - Default Whisper model changed: small → medium (better for all langs incl. Slavic)	2026-04-29 07:59:20 +00:00
Sebastjan Artič	c870d80726	Fix: extend clip if ends mid-vocal (no chorus cut-off), DejaVu Sans font (supports SLO/HR/BS chars), auto-upgrade to medium Whisper model for Slavic languages	2026-04-29 07:35:00 +00:00
Sebastjan Artič	8512076b91	Major: smart selection pipeline (analyze.py) + audio fade + multi-lang auto-detect - New analyze.py: full transcript + energy + structural analysis - Smart clip range: includes pre-chorus, can exceed 30s up to max_duration (default 45s) - Audio fade in/out: auto-detected from vocal boundaries - Instrumental detection: auto-disables subs if vocals < 10% of duration - Multi-language: auto-detect via Whisper or explicit (DE/SL/HR/BS/SR/EN/IT/ES/FR) - Frontend: cleaner UX, added bs language, smart selection description - reframe.py: --fade-in --fade-out args - clip.py: propagates fade params - app/main.py: replaces find_chorus.py call with analyze.py	2026-04-29 06:21:35 +00:00
Sebastjan Artič	bf7ced5c7b	Reset upload form also after failed jobs (so next upload works)	2026-04-28 16:29:39 +00:00
Sebastjan Artič	c34e4aa376	UX: Live progress panel below upload form, stable progress bar, inline preview/download	2026-04-28 16:19:40 +00:00
Sebastjan Artič	30b969e4b8	Initial: reels clipper app - FastAPI backend (auth, jobs, SSE, download) - Frontend: drag&drop + YouTube URL + jobs panel - Pipeline: yt_download → find_chorus → reframe → subtitle - Modes: track (face follow), center, blur - Whisper for SI/DE/EN subtitles - Auto-chorus detection via Whisper + RMS energy - Docker + Coolify ready	2026-04-28 15:28:22 +00:00

23 Commits