Replies: 2 comments 5 replies
-
Thanks @hbredin! I'm humbled as pyannote is an amazing project that I regard very highly. I'd be happy to support you in this process. For future plans for speaker diarization I'll mostly defer to @desh2608 as he has been contributing various recipes and features in lhotse to support that. That said, if you have any suggestions for features that would better support your use-case, I'm happy to help -- depending their complexity and on my own availability, I can either suggest a few helpful pointers or code them up. |
Beta Was this translation helpful? Give feedback.
-
@hbredin Yeah, I don't think we have many (or any?) diarization datasets or API at the moment. What we do have is a bunch of recipes for diarization related corpora such as DIHARD, AMI, etc. so I hope it would be straightforward to create Lhotse manifests from those. The wrapper you suggest could then easily convert the manifests to |
Beta Was this translation helpful? Give feedback.
-
Thanks for this great contribution to the community!
@FrenchKrab and I are considering a (progressive) switch from pyannote.database to
lhotse
for training pyannote.audio models and speaker diarization pipelines.We will most likely start by writing a wrapper around
lhotse
manifests to turn them intopyannote.database
protocols (more precisely pyannote.database.protocol.SpeakerDiarizationProtocol instances) and see how it goes.We would also be happy to contribute more datasets if the wrapper experiment goes well.
But, before jumping in, I wanted to ask you about your plans regarding
lhotse
and speaker diarization. As of today, it seems (but I'd be more than happy to be proven wrong here) that speaker diarization datasets and API are not the main priority of lhotse.Beta Was this translation helpful? Give feedback.
All reactions