reels-app/scripts
Sebastjan Artič 90cdad516b Universal chorus selection: chorus mandatory, pre-chorus only natural extension
User feedback: 'REFREN je obvezen, pre-chorus opcijsko' + 'sistem mora biti
stabilen za vse jezike, tudi španščino in romunščino'.

Two changes:

1. Web search is now MANDATORY first step (was: optional fallback):
   - Even if Claude thinks it knows the song, must search lyrics first
   - Universal lyrics sources by language:
     SLO: besedila.com, lyricstranslate.com
     DE: songtexte.com
     HR/SR/BS: tekstovi.net
     ES: letras.com, musica.com
     RO: versuri.ro
     IT: angolotesti.it
     FR: paroles.net
     EN: genius.com, azlyrics.com
     Universal: lyricstranslate.com (any language)
   - Search strategy: artist+title first, then transcript snippet fallback
   - Without lyrics, Claude cannot reliably identify chorus boundaries

2. Simplified selection rules - chorus is THE priority:
   - Chorus (full first occurrence) = MANDATORY
   - Pre-chorus = ONLY if 1-2 verse lines tightly connected to chorus
   - In doubt: just take chorus alone (12-25s)
   - Outro fillers explicitly multi-language:
     SLO 'aj ja ja' / 'ej ej ej'
     EN 'yeah' / 'oh oh'
     ES 'ay ay ay'
     RO 'hei hei'
     JA 'la la la'
   - 12-35s total range (was 15-35s, now allows shorter chorus-only clips)

This makes the system language-agnostic: works the same way for Slovenian
narodno-zabavna, Spanish reggaeton, Romanian manele, German Schlager, etc.
The lyrics lookup is what makes it stable across languages.
2026-04-29 13:36:34 +00:00
..
analyze.py Universal chorus selection: chorus mandatory, pre-chorus only natural extension 2026-04-29 13:36:34 +00:00
clip.py Upgrade default Whisper model: small/medium → large-v3 for much better Slovenian/Slavic transcription accuracy 2026-04-29 08:20:18 +00:00
find_chorus.py Find chorus: weight repetitive short phrases (like 'Ohne dich x5') as strong chorus signal 2026-04-28 16:57:45 +00:00
reframe.py Major: smart selection pipeline (analyze.py) + audio fade + multi-lang auto-detect 2026-04-29 06:21:35 +00:00
subtitle.py Upgrade default Whisper model: small/medium → large-v3 for much better Slovenian/Slavic transcription accuracy 2026-04-29 08:20:18 +00:00
yt_download.py Add cookies support to yt_download.py for YouTube bot detection bypass 2026-04-28 15:47:59 +00:00