An interactive web application that enables real-time speech recognition and AI-powered responses, creating a seamless voice-based interaction between users and AI.
- Real-time speech recognition using whisper.cpp
- Natural language processing with AI model integration
- Voice synthesis for AI responses (Kokoro/ F5 TTS)
- Responsive web design for all devices
- Low-latency communication
- Simple and intuitive user interface
- Clone the repository:
git clone https://github.com/erturkkadir/hanutchu.git
cd hanutchu
- Frontend: vanilla js
- Speech Recognition: whisper.cpp
- Voice Synthesis: vioce activity detection
- AI Integration: ollama with any model
- Styling: Bootstrap CSS
- Development: Python. Websokcet/WebRTC,
- Modern web browser with microphone and.or webcam
- Active internet connection
- Speech Recognition: The application listens to user speech input using basic browser functions
- Text Processing: Converts speech to text and processes it for AI model consumption
- AI Processing: Sends processed text to AI model and receives response
- Voice Synthesis: Converts AI response to speech using S2T model (kokoro, F5)
- User Interface: Updates UI with transcription and response in real-time
We welcome contributions! Please follow these steps:
- Fork the repository
- Create a feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
- Follow the existing code style and conventions
- Write clear commit messages
- Add tests for new features
- Update documentation as needed
This project is licensed under the Apacahe 2.0 License - see the LICENSE file for details.
- Add support for multiple languages
- Implement conversation history
- Add voice customization options
- Create offline mode capabilities
- Improve error handling and recovery
- Add unit and integration tests
If you encounter any issues or have questions:
- Check the Issues page
- Create a new issue if your problem isn't already listed
- Join our Discord community
- llama.cpp / whisper.cpp GGerganov for their amazing work
- ollama team for their excellent API
- All our contributors and supporters
Made with ❤️ by [KadirErturk]