Frequently Asked Questions

Can I apply pretrained pipelines on audio already loaded in memory?
Can I use gated models (and pipelines) offline?
Does pyannote support streaming speaker diarization?
How can I improve performance?
How does one spell and pronounce pyannote.audio?

Can I apply pretrained pipelines on audio already loaded in memory?

Yes: read this tutorial until the end.

Can I use gated models (and pipelines) offline?

Short answer: yes, see this tutorial for models and that one for pipelines.

Long answer: gating models and pipelines allows me to know a bit more about pyannote.audio user base and eventually help me write grant proposals to make pyannote.audio even better. So, please fill gating forms as precisely as possible.

For instance, before gating pyannote/speaker-diarization, I had no idea that so many people were relying on it in production. Hint: sponsors are more than welcome! Maintaining open source libraries is time consuming.

That being said, this whole authentication process does not prevent you from using official pyannote.audio models offline (i.e. without going through the authentication process in every docker run ... or whatever you are using in production): see this tutorial for models and that one for pipelines.

Does pyannote support streaming speaker diarization?

Short answer: not out of the box, no.

Long answer: I am looking for sponsors to add this feature. In the meantime, diart is the closest you can get from a streaming pyannote.audio. You might also be interested in this blog post about streaming voice activity detection based on pyannote.audio.

How can I improve performance?

Long answer:

Manually annotate dozens of conversations as precisely as possible.
Separate them into train (80%), development (10%) and test (10%) subsets.
Setup the data for use with pyannote.database.
Follow this recipe.
Enjoy.

Also: I am available for contracting to help you with that.

How does one spell and pronounce pyannote.audio?

📝 Written in lower case: pyannote.audio (or pyannote if you are lazy). Not PyAnnote nor PyAnnotate (sic). 📢 Pronounced like the french verb pianoter. pi like in piano, not py like in python. 🎹 pianoter means to play the piano (hence the logo 🤯).

Generated by FAQtory

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FAQ.md

FAQ.md

Frequently Asked Questions

Can I apply pretrained pipelines on audio already loaded in memory?

Can I use gated models (and pipelines) offline?

Does pyannote support streaming speaker diarization?

How can I improve performance?

How does one spell and pronounce pyannote.audio?

Files

FAQ.md

Latest commit

History

FAQ.md

File metadata and controls

Frequently Asked Questions

Can I apply pretrained pipelines on audio already loaded in memory?

Can I use gated models (and pipelines) offline?

Does pyannote support streaming speaker diarization?

How can I improve performance?

How does one spell and pronounce pyannote.audio?