AI-Powered Image Analysis and Audio Transcription with Groq

This project leverages Groq's API and LLAMA 3.2 to provide lightning-fast, cost-effective AI solutions for image analysis and audio transcription.

Key Features

AI Image Analysis
- Upload any image and receive instant AI-powered insights
- Enhance your workflow with cutting-edge image analysis
Audio Transcription
- Upload audio files for quick and accurate transcription
- Powered by advanced language models for high-quality results

Technology Stack

AI Backend: Groq API
Language Model: LLAMA 3.2
Frontend: Next.js with React

Why Choose This Tool?

Super Fast: Utilizes Groq's high-performance API for rapid results
Cost-Effective: LLAMA 3.2 provides state-of-the-art performance at a fraction of the cost
Versatile: Handles both image analysis and audio transcription in one application

Getting Started

Prerequisites

Node.js (v14 or later)
pnpm
A Groq API key

Installation

Clone the repository:

git clone https://github.com/yourusername/ai-image-audio-analyzer.git
cd ai-image-audio-analyzer

Install dependencies:
```
pnpm install
```
Copy the .env.example file to .env.local and fill in your API keys:
```
cp .env.example .env.local
```
Edit .env.local and add your Groq API key and other necessary credentials.

Running the Application

Start the development server:
```
pnpm dev
```
Open http://localhost:3000 in your browser.

Environment Variables

GROQ_API_KEY: Your Groq API key
GROQ_BASE_URL: Groq API base URL (default: https://api.groq.com/openai/v1)
MODEL_VISION: The vision model to use (default: "llama-3.2-11b-vision-preview")
NEXT_PUBLIC_VOICE_FAST_MODEL: The fast voice model (default: "llama-3.2-3b-preview")
NEXT_PUBLIC_SERVER_URL: Your server URL (default: "http://localhost:3000")

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.vscode		.vscode
public		public
src		src
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
components.json		components.json
env.example		env.example
next.config.mjs		next.config.mjs
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-Powered Image Analysis and Audio Transcription with Groq

Key Features

Technology Stack

Why Choose This Tool?

Getting Started

Prerequisites

Installation

Running the Application

Environment Variables

Contributing

License

About

Releases

Packages

Languages

kendevco/realtime-voice-assistant-groq

Folders and files

Latest commit

History

Repository files navigation

AI-Powered Image Analysis and Audio Transcription with Groq

Key Features

Technology Stack

Why Choose This Tool?

Getting Started

Prerequisites

Installation

Running the Application

Environment Variables

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages