reels-app/scripts
Sebastjan Artič d8c2aae9c1 Prompt: chorus must start on FIRST WORD of FIRST LINE
User feedback after re-processing 14 reels:
- 8 perfect (BRAJDE 100%, FICKA, Abracadabra, Žena, Stisn, PA PA,
  Gojzar tanc, GADI Pijan)
- 4 problematic patterns identified:
  1. CVETELE: clip extends 22s into instrumental on 'nekoč oba' hold
  2. GORENJSKA LJUBLJENA: clip starts mid-line at 'obrnem nazaj'
     instead of 'V Ljubljani se obrnem nazaj'
  3. Fantje: clip starts mid-chorus at 'vabijo me' (2nd line)
     instead of first line
  4. PODEŽELSKI: extends into 'o o o' outro filler

Common cause: Soniox can group end-of-verse + start-of-chorus into
same segment (e.g., '[43.6-47.6] doma. V Ljubljani se'), and Claude
picks segment.start (43.6) or next segment.start (48.2) instead of
the actual word 'V' boundary inside the segment.

Prompt fix:
1. NEW critical rule: 'clip start = TOČNO prva beseda PRVE vrstice'
2. Warning about Soniox merging end-of-verse + start-of-chorus
3. Use word-level timestamps to find chorus start word
4. Warning about long held tones in Soniox segments (15-20s on
   'oba', 'doma', 'srca' due to fade-out instrumental)
5. Cut 1-2s after last sung word, don't wait 20s for tone to die
6. Outro filler: include short outros (yeah/aj-aj), but cut before
   long repeating outros (5+s of 'o o o') as those are fade-out

Added concrete examples in PRIMERI:
- BRAJDE: 28s (already perfect)
- GORENJSKA: explicit warning about 'V Ljubljani se' boundary
- CVETELE: explicit warning about 15-20s held tone segments

This is a prompt-only change. No code logic modified.
LLM still has full autonomy on duration.
2026-04-30 05:19:35 +00:00
..
acr_recognize.py MXF/MPG broadcast format support: handle multichannel audio properly 2026-04-29 14:38:48 +00:00
analyze.py Prompt: chorus must start on FIRST WORD of FIRST LINE 2026-04-30 05:19:35 +00:00
clip.py Upgrade default Whisper model: small/medium → large-v3 for much better Slovenian/Slavic transcription accuracy 2026-04-29 08:20:18 +00:00
find_chorus.py Find chorus: weight repetitive short phrases (like 'Ohne dich x5') as strong chorus signal 2026-04-28 16:57:45 +00:00
reframe.py MXF/MPG broadcast format support: handle multichannel audio properly 2026-04-29 14:38:48 +00:00
subtitle.py Upgrade default Whisper model: small/medium → large-v3 for much better Slovenian/Slavic transcription accuracy 2026-04-29 08:20:18 +00:00
yt_download.py Add cookies support to yt_download.py for YouTube bot detection bypass 2026-04-28 15:47:59 +00:00