Speech Recognition Application

This is a Streamlit-based application that allows users to transcribe speech using various speech recognition APIs, including Google Speech Recognition, Wit.ai, Bing Speech, Houndify, and IBM Speech to Text.

Features

Speech Transcription: The application provides a button to start recording audio from the user's microphone. The recorded audio is then transcribed using the selected speech recognition API, and the transcription is displayed in the application.
API Selection: Users can select the speech recognition API they want to use from a dropdown menu. The available options are Google Speech Recognition, Wit.ai, Bing Speech, Houndify, and IBM Speech to Text.
Language Selection: Users can select the language they are speaking from a dropdown menu. The supported languages are English (US), French (France), Spanish (Spain), German (Germany), and Italian (Italy).
Pause and Resume: Users can pause the speech recognition process and resume it later using a checkbox in the application.
Save Transcription to File: Users can save the transcribed text to a file named transcription.txt in their home directory.

Installation

To run this application, you'll need to have the following dependencies installed:

Python 3.7 or later
Streamlit
SpeechRecognition
PyAudio

You can install these dependencies using pip:

pip install streamlit SpeechRecognition PyAudio

Usage

Clone the repository to your local machine:

git clone https://github.com/your-username/speech-recognition-app.git

Navigate to the project directory:
```
cd speech-recognition-app
```
Run the Streamlit application:
```
streamlit run app.py
```
This will launch the application in your default web browser.
In the application, select the speech recognition API, language, and whether you want to pause and resume the process.
Click the "Start Recording" button to begin transcribing your speech.
Once the transcription is complete, you can click the "Save to File" button to save the text to a file named transcription.txt in your home directory.

Configuration

To use the non-Google speech recognition APIs (Wit.ai, Bing Speech, Houndify, and IBM Speech to Text), you'll need to provide your own API credentials in the transcribe_speech() function in the speech_recognition.py file.

Replace the placeholders with your actual API credentials:

def transcribe_speech(recognition_api='google', language='en-US', pause_resume=False):
    # ...
    if recognition_api == 'wit':
        text = r.recognize_wit(audio_text, key="YOUR_WIT_AI_API_KEY")
    elif recognition_api == 'bing':
        text = r.recognize_bing(audio_text, key="YOUR_BING_SPEECH_API_KEY")
    elif recognition_api == 'houndify':
        text = r.recognize_houndify(audio_text, client_id="YOUR_HOUNDIFY_CLIENT_ID", client_key="YOUR_HOUNDIFY_CLIENT_KEY")
    elif recognition_api == 'ibm':
        text = r.recognize_ibm(audio_text, username="YOUR_IBM_SPEECH_TO_TEXT_USERNAME", password="YOUR_IBM_SPEECH_TO_TEXT_PASSWORD")
    # ...

Contributing

If you'd like to contribute to this project, please feel free to submit a pull request or open an issue on the GitHub repository.

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
__pycache__		__pycache__
README.md		README.md
app.py		app.py
new.py		new.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Recognition Application

Features

Installation

Usage

Configuration

Contributing

License

About

Releases

Packages

Languages

segunumoru1/Speech_Recognition

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition Application

Features

Installation

Usage

Configuration

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages