Clone this repo, and setup and activate a virtualenv:
python3 -m pip install virtualenv
python3 -m virtualenv venv
source venv/bin/activate
Then, install the dependencies:
pip install -r requirements.txt
Make a OpenAI, and ElevenLabs account and set your tokens:
export OPENAI_API_KEY=<token>
export ELEVENLABS_API_KEY=<eleven-token>
Make a new voice in Eleven and get the voice id of that voice using their get voices API, or by clicking the flask icon next to the voice in the VoiceLab tab.
export ELEVENLABS_VOICE_ID=<voice-id>
In on terminal, run the webcam capture:
python capture.py
In another terminal, run the narrator:
python narrator.py