You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for the recent development of long-form audio speaker diarization in NVIDIA/NeMo#7737. Recently I encounter a 4-hour-long audio and observe OOM on RAM (not VRAM).
It happens after screen prints the last iteration of "Extracting embeddings for Diarization" and the program consumes more than 64GB memory when I observe job getting killed. FYI,
[NeMo I 2023-11-19 20:54:29 clustering_diarizer:343] Extracting embeddings for Diarization
[NeMo I 2023-11-19 20:54:29 collections:445] Filtered duration for loading collection is 0.00 hours.
[NeMo I 2023-11-19 20:54:29 collections:446] Dataset loaded with 52949 items, total duration of 7.25 hours.
[NeMo I 2023-11-19 20:54:29 collections:448] # 52949 files loaded accounting to # 1 labels
Hi,
Thanks for the recent development of long-form audio speaker diarization in NVIDIA/NeMo#7737. Recently I encounter a 4-hour-long audio and observe OOM on RAM (not VRAM).
It happens after screen prints the last iteration of "Extracting embeddings for Diarization" and the program consumes more than 64GB memory when I observe job getting killed. FYI,
My telephonic config file:
The text was updated successfully, but these errors were encountered: