whisperX to create transcripts #28

JvSdv · 2023-01-03T23:00:35Z

PLEASE ONLY SUBMIT PULL REQUESTS FOR BUG FIXES OR LEGITIMATELY USEFUL ADDITIONAL FUNCTIONALITY

I know many of you are enthusiastic to help but I don't want this project to become bloated

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Proposed Changes

Add transcription of videos in english using whisperx an upgrade of the original whisper, with well defined timestamsps

Additional Info

Check out how to use whisperx with other languages and how to use it at (https://github.com/m-bain/whisperX)

Checklist:

I have commented my code, particularly in hard-to-understand areas
I have tested that the code works

Required:

By contributing to this project, you agree to the terms of the GPLv3 license, and agree to grant the project owner the right to also provide or sell this software, including your contribution, to anyone under any other license, with no compensation to you.

…d-Dubs

sofiadparamo

The WisperX implementation looks good to me! But I see some problems with the other files.

sofiadparamo · 2023-01-04T03:07:00Z

audio_builder.py

@@ -26,6 +27,12 @@
 cloudConfig = configparser.ConfigParser()
 cloudConfig.read('cloud_service_settings.ini')

+# Get the video file name and create the output folder based on the original video file name
+originalVideoFile = os.path.abspath(batchConfig['SETTINGS']['original_video_file_path'].strip("\""))


I see some calls to the settings file with original_video_file_path, however, the config file remains unchanged, I think that creating a commit with these new settings and a default value would avoid some unexpected behavior on users that do not look into the code

I think it's best to remove this for now

sofiadparamo · 2023-01-04T03:11:10Z

main.py

@@ -72,6 +72,11 @@
 originalVideoFile = os.path.abspath(batchConfig['SETTINGS']['original_video_file_path'].strip("\""))
 srtFile = os.path.abspath(batchConfig['SETTINGS']['srt_file_path'].strip("\""))

+# Create the output folder based on the original video file name
+fileName = os.path.basename(originalVideoFile).split(".")[0]


Isn't this the same as the code in audio_builder.py?

Also, there's already some code to handle the creation of output folders in Line 479 of the main file, in my opinion, moving this implementation there (or replacing the existing one) would be a better fit for this code.

ThioJoe · 2023-01-04T14:33:17Z

Instead of adding this to requirements.txt, I'll probably want to implement it optionally somehow. Maybe add a message to be displayed when the script is run saying how to install it.
Mostly because it would require downloading big files like for the model, and also not everyone would have a GPU that would even work with it.

JvSdv added 7 commits January 2, 2023 20:57

whisperx init

7286923

video location

8eed0b8

Merge branch 'main' of https://github.com/JvSdv/Auto-Synced-Translate…

4d8fc0d

…d-Dubs

separating audio from video and transcribing

b26881f

Output folder based on the original videofile name

c72b126

Optional Whisperx

9f055e8

Merge branch 'main' of https://github.com/JvSdv/Auto-Synced-Translate…

3862301

…d-Dubs

sofiadparamo reviewed Jan 4, 2023

View reviewed changes

JvSdv added 2 commits January 4, 2023 01:10

print output, and remove video folder output

06a6db5

remove re import

55db431

JvSdv and others added 5 commits January 4, 2023 14:47

verify if the transcription is done

b3c066b

Video anywhere and Variables

4f9e6cd

update readme and requirements

3004db5

Merge branch 'main' into main

e1e0a4c

Merge branch 'ThioJoe:main' into main

d395401

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisperX to create transcripts #28

whisperX to create transcripts #28

JvSdv commented Jan 3, 2023

sofiadparamo left a comment

sofiadparamo Jan 4, 2023

JvSdv Jan 4, 2023

sofiadparamo Jan 4, 2023

ThioJoe commented Jan 4, 2023

whisperX to create transcripts #28

Are you sure you want to change the base?

whisperX to create transcripts #28

Conversation

JvSdv commented Jan 3, 2023

PLEASE ONLY SUBMIT PULL REQUESTS FOR BUG FIXES OR LEGITIMATELY USEFUL ADDITIONAL FUNCTIONALITY

I know many of you are enthusiastic to help but I don't want this project to become bloated

Type of change

Proposed Changes

Additional Info

Checklist:

Required:

sofiadparamo left a comment

Choose a reason for hiding this comment

sofiadparamo Jan 4, 2023

Choose a reason for hiding this comment

JvSdv Jan 4, 2023

Choose a reason for hiding this comment

sofiadparamo Jan 4, 2023

Choose a reason for hiding this comment

ThioJoe commented Jan 4, 2023