Fusion Vision

A real-time object detection system with voice narration capabilities. The aim is to create a Social Media Platform for the visually impaired where they can upload their photos and get the description of the image in voice. Also they can share their photos with their friends and family and checkout the photos from others and know the updates of their surroundings.

Features

Real-time object detection using TensorFlow and YOLOv8
Voice narration of detected objects
Web-based interface for easy access
Unique Social Media Platform for the visually impaired
Unique profiles and verified users

Getting Started

Follow these instructions to set up the project on your local machine.

Prerequisites

Ensure you have the following installed:

Python (>=3.7)
Node.js (for web dependencies)

Installation

Clone the repository:

git clone https://github.com/yourusername/fusion-vision.git
cd fusion-vision

Install Python dependencies:

Create a virtual environment and activate it:

python -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`

Install the required Python packages:

pip install -r requirements.txt

Install Node.js dependencies:

Navigate to the web directory and install dependencies:
```
cd web
npm install
```

Dependencies

Core Requirements

TensorFlow (>=2.0.0)
- Deep learning framework for object detection
- Used for running the COCO-SSD model
OpenCV (>=4.8.0)
- Computer vision library
- Handles camera input and image processing
YOLOv8
- Advanced object detection model
- Provides enhanced accuracy

Web Technologies

HTML5
- Camera API
- Canvas for drawing
- Speech synthesis
JavaScript
- TensorFlow.js
- WebRTC for camera access
- Speech synthesis API

Browser Requirements

Modern web browser with:
- WebRTC support
- JavaScript enabled
- Web Speech API support

Accessibility Permissions

To ensure the application functions correctly, you need to grant the following permissions in your web browser:

Camera Access: Required for capturing images and performing real-time object detection.
Microphone Access: Needed if the application includes voice input features.
Speech Synthesis: Ensure your browser allows speech synthesis for voice narration of detected objects.

Make sure to check your browser settings and grant these permissions when prompted.

Running the Application

Start the backend server:
```
python app.py
```
Launch the web interface:

Open index.html in a modern web browser.

Screenshots

Here are some screenshots of the application:

Contact Page
Homepage
Permission Request

Open Source Programs featuring Fusion Vision

Social Winter Of Code 2025

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
config		config
.DS_Store		.DS_Store
.env		.env
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
FusionVision_BG.jpg		FusionVision_BG.jpg
FusionVision_BG_blurred.jpg		FusionVision_BG_blurred.jpg
Logo.png		Logo.png
README.md		README.md
Terms.html		Terms.html
User.js		User.js
UserClass.js		UserClass.js
about.html		about.html
auth.js		auth.js
camera.html		camera.html
camera.js		camera.js
capture.js		capture.js
contact.html		contact.html
contact.jpg		contact.jpg
data.sql		data.sql
db.js		db.js
features.html		features.html
homepage.jpg		homepage.jpg
index.html		index.html
models.js		models.js
object-detection.js		object-detection.js
package-lock.json		package-lock.json
package.json		package.json
permission.jpg		permission.jpg
privacyp.html		privacyp.html
requirements.txt		requirements.txt
reviews.html		reviews.html
script.js		script.js
server.js		server.js
styles.css		styles.css
swoc.jpg		swoc.jpg
whoweare.html		whoweare.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fusion Vision

Features

Getting Started

Prerequisites

Installation

Dependencies

Core Requirements

Web Technologies

Browser Requirements

Accessibility Permissions

Running the Application

Screenshots

Open Source Programs featuring Fusion Vision

Contributing

About

Releases

Packages

Languages

shivenyadavs/FusionVision

Folders and files

Latest commit

History

Repository files navigation

Fusion Vision

Features

Getting Started

Prerequisites

Installation

Dependencies

Core Requirements

Web Technologies

Browser Requirements

Accessibility Permissions

Running the Application

Screenshots

Open Source Programs featuring Fusion Vision

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages