Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] "Error occured while doing alignment" #77

Open
Benko111 opened this issue Apr 1, 2024 · 0 comments
Open

[BUG] "Error occured while doing alignment" #77

Benko111 opened this issue Apr 1, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@Benko111
Copy link

Benko111 commented Apr 1, 2024

Describe the bug
Not sure if this is a bug. Trying do use alignment or refinement feature it processes the task up to 100%, but then instead of saving result files it aborts and displays an error message: "Error occured while doing alignment. can only concatenate str (not "int") to str".
When I uncheck "Remove repetitions" in Result Modification settings it will finish normally, but then files will be filled with hallucination repetitions. What can I do about it?

To Reproduce
Steps to reproduce the behavior:

  1. Go to 'Tools' > 'Alignment'.
  2. Start nalignment process with txt file and audio file.
  3. Wait until processing finishes.
  4. At 100%, when trying to save results error message shows up.

Expected behavior
Clean file without repetitions by using postprocessing feature "remove repetitions" as it worked before.

Log
2024-04-01 23:23:19.852 | DEBUG | main.py:1964 [MainThread] - all_english: False
2024-04-01 23:23:19.860 | INFO | helper.py:37 [MainThread] - Checking model name
2024-04-01 23:23:19.860 | DEBUG | helper.py:38 [MainThread] - modelKey: 🐌 Large V3 [10GB VRAM] (Most Accurate), src_english: False
2024-04-01 23:23:19.860 | DEBUG | helper.py:43 [MainThread] - modelName: large-v3
2024-04-01 23:23:19.860 | DEBUG | main.py:1539 [MainThread] - Running disabler...
2024-04-01 23:23:22.095 | DEBUG | main.py:1617 [MainThread] - Running enabler...
2024-04-01 23:23:22.260 | INFO | file.py:981 [Thread-105 (mod_result)] - Start Process (MOD FILE)
2024-04-01 23:23:22.261 | DEBUG | load.py:366 [Thread-105 (mod_result)] - Mode load args get: {'device': 'cuda', 'cpu_preload': True, 'success': True}
2024-04-01 23:23:32.560 | DEBUG | load.py:439 [Thread-105 (mod_result)] - Pass kwarg:
2024-04-01 23:23:32.560 | DEBUG | load.py:440 [Thread-105 (mod_result)] - {'temperature': (0.0, 0.2, 0.4, 0.6, 0.8, 1.0), 'best_of': 3, 'beam_size': 3, 'patience': 1.0, 'compression_ratio_threshold': 2.4, 'logprob_threshold': -1.0, 'no_speech_threshold': 0.72, 'suppress_tokens': '', 'suppress_blank': True, 'initial_prompt': None, 'prefix': None, 'condition_on_previous_text': True, 'max_initial_timestamp': 1.0, 'fp16': True}
2024-04-01 23:23:32.560 | DEBUG | load.py:366 [Thread-105 (mod_result)] - Mode align args get: {'regroup': True, 'suppress_silence': True, 'suppress_word_ts': True, 'min_word_dur': 0.1, 'q_levels': 20, 'k_size': 5, 'vad': False, 'vad_threshold': 0.35, 'vad_onnx': False, 'demucs': False, 'demucs_output': None, 'demucs_options': None, 'only_voice_freq': False, 'prepend_punctuations': '"'“¿([{-', 'append_punctuations': '"'.。,,!!??::”)]}、', 'threads': 0, 'success': True}
2024-04-01 23:23:32.560 | INFO | file.py:996 [Thread-105 (mod_result)] - Model Args: {'device': 'cuda', 'cpu_preload': True, 'download_root': 'C:\Users\Ben\.cache\whisper'}
2024-04-01 23:23:32.560 | INFO | file.py:997 [Thread-105 (mod_result)] - Process Args: {'regroup': True, 'suppress_silence': True, 'suppress_word_ts': True, 'min_word_dur': 0.1, 'q_levels': 20, 'k_size': 5, 'vad': False, 'vad_threshold': 0.35, 'vad_onnx': False, 'demucs': False, 'demucs_output': None, 'demucs_options': None, 'only_voice_freq': False, 'prepend_punctuations': '"'“¿([{-', 'append_punctuations': '"'.。,,!!??::”)]}、'}
2024-04-01 23:23:32.584 | DEBUG | file.py:1111 [Thread-105 (mod_result)] - PROCESSING: ['C:/Transkription/Assets/audio/MSE600.mp3', 'C:/Transkription/Textdaten/Rohtext/MSE600.txt', 'german']
2024-04-01 23:23:32.584 | DEBUG | file.py:1140 [Thread-105 (mod_result)] - Save_name: MSE600 01-04-2024-23-23-32
2024-04-01 23:23:32.584 | DEBUG | language.py:269 [Thread-105 (mod_result)] - GETTING WHISPER LANGUAGE FROM SIMILAR LANGUAGE NAME
2024-04-01 23:23:32.584 | DEBUG | language.py:274 [Thread-105 (mod_result)] - Found key german while searching for german
2024-04-01 23:23:32.585 | DEBUG | language.py:275 [Thread-105 (mod_result)] - FULL KEY GET ['german']
2024-04-01 23:23:33.636 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 0% | ## | 0/837.41 [00:00<?, ?sec/s]
2024-04-01 23:23:35.679 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 1% | ## | 7.46/837.41 [00:02<03:47, 3.65sec/s]
2024-04-01 23:23:36.266 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 2% | ## | 18.74/837.41 [00:02<01:40, 8.18sec/s]
2024-04-01 23:23:36.837 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 5% | ## | 45.38/837.41 [00:03<00:41, 19.26sec/s]
2024-04-01 23:23:37.401 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 9% | ## | 73.16/837.41 [00:03<00:27, 27.93sec/s]
2024-04-01 23:23:37.987 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 12% | ### | 99.6/837.41 [00:04<00:22, 33.08sec/s]
2024-04-01 23:23:38.558 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 15% | ### | 126.04/837.41 [00:04<00:19, 36.98sec/s]
2024-04-01 23:23:39.129 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 18% | ### | 150.42/837.41 [00:05<00:17, 38.68sec/s]
2024-04-01 23:23:39.709 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 21% | ### | 179.42/837.41 [00:06<00:15, 42.09sec/s]
2024-04-01 23:23:40.280 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 24% | ### | 203.82/837.41 [00:06<00:14, 42.28sec/s]
2024-04-01 23:23:40.851 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 27% | ### | 230.02/837.41 [00:07<00:14, 43.36sec/s]
2024-04-01 23:23:41.415 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 30% | ### | 254.34/837.41 [00:07<00:13, 43.28sec/s]
2024-04-01 23:23:41.986 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 33% | ### | 277.44/837.41 [00:08<00:13, 42.43sec/s]
2024-04-01 23:23:42.573 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 36% | ### | 298.42/837.41 [00:08<00:13, 40.40sec/s]
2024-04-01 23:23:43.143 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 39% | ### | 324.7/837.41 [00:09<00:12, 42.08sec/s]
2024-04-01 23:23:43.723 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 41% | ### | 346.26/837.41 [00:10<00:12, 40.60sec/s]
2024-04-01 23:23:44.295 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 44% | ### | 371.22/837.41 [00:10<00:11, 41.52sec/s]
2024-04-01 23:23:44.865 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 47% | ### | 394.98/837.41 [00:11<00:10, 41.56sec/s]
2024-04-01 23:23:45.429 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 50% | ### | 417.96/837.41 [00:11<00:10, 41.31sec/s]
2024-04-01 23:23:46.000 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 53% | ### | 445.98/837.41 [00:12<00:08, 43.65sec/s]
2024-04-01 23:23:46.587 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 56% | ### | 468.24/837.41 [00:12<00:08, 41.91sec/s]
2024-04-01 23:23:47.157 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 59% | ### | 491.76/837.41 [00:13<00:08, 41.70sec/s]
2024-04-01 23:23:47.737 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 62% | ### | 520.48/837.41 [00:14<00:07, 44.06sec/s]
2024-04-01 23:23:48.302 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 65% | ### | 544.0/837.41 [00:14<00:06, 43.34sec/s]
2024-04-01 23:23:48.879 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 67% | ### | 564.68/837.41 [00:15<00:06, 41.09sec/s]
2024-04-01 23:23:49.465 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 70% | ### | 586.9/837.41 [00:15<00:06, 40.11sec/s]
2024-04-01 23:23:50.030 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 73% | ### | 611.96/837.41 [00:16<00:05, 41.38sec/s]
2024-04-01 23:23:50.616 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 76% | ### | 634.5/837.41 [00:16<00:05, 40.48sec/s]
2024-04-01 23:23:51.187 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 79% | ### | 663.3/837.41 [00:17<00:04, 43.45sec/s]
2024-04-01 23:23:51.767 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 82% | ### | 687.08/837.41 [00:18<00:03, 42.71sec/s]
2024-04-01 23:23:52.342 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 85% | ### | 710.84/837.41 [00:18<00:02, 42.30sec/s]
2024-04-01 23:23:52.909 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 88% | ### | 734.68/837.41 [00:19<00:02, 42.23sec/s]
2024-04-01 23:23:53.495 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 91% | ### | 761.2/837.41 [00:19<00:01, 43.14sec/s]
2024-04-01 23:23:54.060 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 94% | ### | 784.32/837.41 [00:20<00:01, 42.50sec/s]
2024-04-01 23:23:54.646 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 96% | ### | 807.92/837.41 [00:21<00:00, 41.81sec/s]
2024-04-01 23:23:55.186 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 97% | ### | 815.98/837.41 [00:21<00:00, 34.12sec/s]
2024-04-01 23:23:55.186 | INFO | _logging.py:55 [Thread-108 (run_mod)] - Align: 100% | #### | 837.41/837.41 [00:21<00:00, 38.86sec/s]
2024-04-01 23:23:57.959 | ERROR | file.py:1279 [Thread-105 (mod_result)] - Error occured while doing alignment
2024-04-01 23:23:57.959 | ERROR | file.py:1280 [Thread-105 (mod_result)] - can only concatenate str (not "int") to str
Traceback (most recent call last):

File "D:\Codes_Projects\Python\Speech-Translate\speech_translate\utils\audio\file.py", line 1229, in mod_result

File "D:\Codes_Projects\Python\Speech-Translate.venv\Lib\site-packages\stable_whisper\result.py", line 1632, in remove_repetition

TypeError: can only concatenate str (not "int") to str
2024-04-01 23:24:26.216 | INFO | main.py:2008 [Thread-105 (mod_result)] - Stopping alignment...
2024-04-01 23:24:26.249 | INFO | main.py:2023 [Thread-105 (mod_result)] - Stopped

Desktop (please complete the following information):

  • OS: Windows 10
  • App Installation version: 1.3.10 GPU-Version
  • App / Python version: ...

Additional context
It did work like a charm before. After playingf with repetition settings it stopped working and the behaviour I described appeared. Even after setting bacl all default settings, it won't work any more. Thanks for any help.

@Benko111 Benko111 added the bug Something isn't working label Apr 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant