Skip to content

Itz-Sidra/Speaker-Label

Repository files navigation

Speaker Separation & Transcription Web App

This is a Streamlit-based web application for performing speaker separation and transcription on audio files using AssemblyAI's API. The app processes audio files, identifies multiple speakers, and transcribes the content with timestamps and speaker labels.

Features

  • Speaker Separation: Diarizes audio to identify individual speakers.
  • Transcription: Provides a text transcription of the audio with timestamps.
  • Audio Segment Extraction: Allows playback of specific audio segments for individual speakers.
  • Speaker Statistics: Displays word count and speaking duration for each speaker.

Requirements

  • Python 3.8+
  • AssemblyAI API Key

File Structure

  • app.py: Main application script.
  • requirements.txt: Python dependencies.
  • .streamlit/secrets.toml: Configuration file for secrets (e.g., API keys).

Key Libraries Used

  • Streamlit: For building the interactive UI.
  • Pydub: For audio processing and segment extraction.
  • Requests: For API communication with AssemblyAI.

API Integration

The app integrates with the AssemblyAI API, leveraging the following endpoints:

  • /v2/upload: Uploads audio files for processing.
  • /v2/transcript: Requests transcription with speaker labels.
  • /v2/transcript/{id}: Polls the transcription status and retrieves the results.

Deployment:

Demo

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages