Replies: 2 comments 5 replies
-
|
Thanks @hbredin! I'm humbled as pyannote is an amazing project that I regard very highly. I'd be happy to support you in this process. For future plans for speaker diarization I'll mostly defer to @desh2608 as he has been contributing various recipes and features in lhotse to support that. That said, if you have any suggestions for features that would better support your use-case, I'm happy to help -- depending their complexity and on my own availability, I can either suggest a few helpful pointers or code them up. |
Beta Was this translation helpful? Give feedback.
-
|
@hbredin Yeah, I don't think we have many (or any?) diarization datasets or API at the moment. What we do have is a bunch of recipes for diarization related corpora such as DIHARD, AMI, etc. so I hope it would be straightforward to create Lhotse manifests from those. The wrapper you suggest could then easily convert the manifests to |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Thanks for this great contribution to the community!
@FrenchKrab and I are considering a (progressive) switch from pyannote.database to
lhotsefor training pyannote.audio models and speaker diarization pipelines.We will most likely start by writing a wrapper around
lhotsemanifests to turn them intopyannote.databaseprotocols (more precisely pyannote.database.protocol.SpeakerDiarizationProtocol instances) and see how it goes.We would also be happy to contribute more datasets if the wrapper experiment goes well.
But, before jumping in, I wanted to ask you about your plans regarding
lhotseand speaker diarization. As of today, it seems (but I'd be more than happy to be proven wrong here) that speaker diarization datasets and API are not the main priority of lhotse.Beta Was this translation helpful? Give feedback.
All reactions