End to end audio transcription and information extraction from audio input.
- Ensure you have the Required Dependencies listed below
- Open your terminal, navigate to our main project directory (Audio-Analyzer-and-Transcription-Tool) and type: sh scripts/run.sh scripts/run
To run our code, the user must:
- Download our repo
- Install the latest versions of the following:
- Node.js
- python3
- pip3 (if not included in your python3 download)
- ffmpeg
- Install additional dependencies listed below. To let our scripts do this for you automatically, open your terminal, navigate to our main project directory (Audio-Analyzer-and-Transcription-Tool) and type: sh scripts/setup.sh scripts/setup
- Node.js installs: all packages within react-ui/package-lock.json ("npm install")
- Python3 installs: numpy, flask, flask-cors, python-dotenv, torch, torchaudio, openai-whisper, soundfile, librosa, and https://github.com/huggingface/transformers
(node packages go into react-ui/node_modules/; python packages go into virtual environment created in flask-backend/src/.venv/; nothing outside of our project folder is touched)
Easily select a file or folder with our file selection dialog, then click Transcribe! Your outputs will appear shortly, though longer inputs may take a few minutes to process.
By default, the selected files are sent for batched transcription, which is the fastest option available if you have GPU access. However, we provide several optional features which can help significantly in certain circumstances:
Select the Save Outputs checkbox when in GPU mode to automatically save each transcript in a designated output folder; CPU support coming soon. Output folder path can be a fully qualified path to a new location, or simply a folder name, which will be placed in the project directory folder.
coming soon
coming soon