- 3-sample voting for auto-detect (start/middle/end of song) prevents lang switching mid-song - Lock detected language for full transcription - Anti-hallucination: condition_on_previous_text=False, temperature=0.0 - compression_ratio_threshold=2.4 (rejects repetitive hallucinations) - log_prob_threshold=-1.0 (rejects low-confidence segments) - no_speech_threshold=0.6 (more aggressive silence detection) - Default Whisper model changed: small → medium (better for all langs incl. Slavic) |
||
|---|---|---|
| .. | ||
| analyze.py | ||
| clip.py | ||
| find_chorus.py | ||
| reframe.py | ||
| subtitle.py | ||
| yt_download.py | ||