diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Dataset/Dataset- Explanations.txt b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Dataset/Dataset- Explanations.txt new file mode 100644 index 000000000..93df705a2 --- /dev/null +++ b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Dataset/Dataset- Explanations.txt @@ -0,0 +1,53 @@ +Detailed Description of the Dataset for Drone Navigation Project +1. Environment State Representation +The environment for the drone navigation task is modeled as a 2D grid (10x10) where each cell can represent different types of entities that the drone interacts with. The key components are: + +Free Space: This represents areas of the grid where the drone can move freely. Free space cells are the navigable areas where the drone does not encounter any obstacles. + +Obstacles: These are fixed points on the grid that the drone must avoid to prevent collisions. In this project, obstacles are defined as specific coordinates: + +Example: +Obstacle 1: (6, 6) +Obstacle 2: (7, 7) +Target: This is the desired destination that the drone aims to reach. The target position is critical for the navigation algorithm to determine successful completion of the task. + +Example: +Target Position: (8, 8) +2. State Space +The state of the drone is represented using a 2D NumPy array with two elements, denoting the drone's current position on the grid: + +state[0]: Represents the x-coordinate (horizontal position) of the drone. +state[1]: Represents the y-coordinate (vertical position) of the drone. +The observation space is defined within the bounds of the grid, specifically from 0 to 10. This range indicates that the drone's movements and positions are confined within a 10x10 grid. + +3. Action Space +The available actions for the drone are discrete movements within the grid. Each action corresponds to a direction the drone can move: + +0: Up (increases y-coordinate) +1: Down (decreases y-coordinate) +2: Left (decreases x-coordinate) +3: Right (increases x-coordinate) +4: Up-Right (increases both x and y coordinates) +5: Up-Left (decreases x and increases y coordinates) +6: Down-Right (increases x and decreases y coordinates) +7: Down-Left (decreases both x and y coordinates) +This action space allows for basic directional movements, enabling the drone to navigate towards its target while avoiding obstacles. + +4. Sample Data +While the environment is not reliant on external datasets, the positions of obstacles and the target can be treated as parameters that define the specific scenario of the navigation task. Below are examples of the parameters used in the project: + +Initial State: The drone starts at position (5, 5). +Obstacles: [(6, 6), (7, 7)] +Target Position: (8, 8) +This setup allows for a controlled testing environment where various navigation strategies can be implemented and evaluated. + +5. Data Generation +The grid layout, positions of obstacles, and the target location are configurable parameters that can be adjusted to create different scenarios for testing the drone's navigation algorithm. The drone can be tested in various configurations to analyze its performance in navigating towards the target while avoiding collisions. + +6. Future Dataset Enhancements +In future iterations of this project, there are several potential enhancements that can be made to the dataset: + +Dynamic Obstacles: Introducing moving obstacles that change positions over time, simulating more realistic navigation challenges. +Variable Target Locations: Allowing the target position to change during the task to test the drone's adaptability and decision-making. +Real-World Data: Integrating real-world datasets (such as GPS coordinates or aerial maps) to enhance the environment's complexity and realism. +Multiple Drones: Expanding the project to include multiple drones navigating the same environment, which could lead to more complex scenarios and interactions. diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Drone_Navigation_Detection.ipynb b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Drone_Navigation_Detection.ipynb new file mode 100644 index 000000000..227f408d6 --- /dev/null +++ b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Drone_Navigation_Detection.ipynb @@ -0,0 +1,935 @@ +{ + "nbformat": 4, + "nbformat_minor": 0, + "metadata": { + "colab": { + "provenance": [] + }, + "kernelspec": { + "name": "python3", + "display_name": "Python 3" + }, + "language_info": { + "name": "python" + } + }, + "cells": [ + { + "cell_type": "markdown", + "source": [ + "**Part 1: Custom Drone Navigation Environment**\n", + "\n", + "This code sets up a custom environment for a drone that can move in 8 possible directions. It uses the Gym framework to manage actions, states, rewards, and termination conditions." + ], + "metadata": { + "id": "o4DN_tEg2jvr" + } + }, + { + "cell_type": "code", + "execution_count": 1, + "metadata": { + "id": "HEk1YPjS2alK" + }, + "outputs": [], + "source": [ + "import gym\n", + "import numpy as np\n", + "import random\n", + "\n", + "# Define a custom environment for drone navigation\n", + "class DroneEnv(gym.Env):\n", + " def __init__(self):\n", + " super(DroneEnv, self).__init__()\n", + " self.action_space = gym.spaces.Discrete(8) # 8 possible actions\n", + " self.observation_space = gym.spaces.Box(low=0, high=10, shape=(2,), dtype=np.float32)\n", + " self.state = np.array([5, 5]) # Start position\n", + " self.target = np.array([8, 8]) # Target position\n", + " self.obstacles = [np.array([6, 6]), np.array([7, 7])] # Obstacles\n", + " self.epsilon = 0.1 # Exploration rate\n", + "\n", + " def reset(self):\n", + " self.state = np.array([5, 5])\n", + " return self.state\n", + "\n", + " def step(self, action):\n", + " if random.random() < self.epsilon: # Exploration\n", + " action = self.action_space.sample()\n", + "\n", + " # Define 8-direction movement\n", + " movements = [(0, 1), (0, -1), (-1, 0), (1, 0), (1, 1), (-1, 1), (1, -1), (-1, -1)]\n", + " move = movements[action]\n", + " self.state = np.clip(self.state + move, 0, 10) # Ensure within bounds\n", + "\n", + " # Check for collisions and termination conditions\n", + " reward = -1\n", + " done = False\n", + " if any(np.array_equal(self.state, obs) for obs in self.obstacles):\n", + " reward = -10 # Penalty for hitting an obstacle\n", + " done = True\n", + " elif np.array_equal(self.state, self.target):\n", + " reward = 10 # Reward for reaching the target\n", + " done = True\n", + "\n", + " return self.state, reward, done, {}\n", + "\n", + " def render(self):\n", + " print(f\"Drone Position: {self.state}, Target: {self.target}, Obstacles: {self.obstacles}\")\n" + ] + }, + { + "cell_type": "markdown", + "source": [ + "**Part 2: A* Pathfinding Algorithm**\n", + "The A* pathfinding algorithm helps the drone find an optimal path to the target while avoiding obstacles. This function takes a start position, a goal, obstacles, and grid dimensions as inputs." + ], + "metadata": { + "id": "Grvzyodk2vqP" + } + }, + { + "cell_type": "code", + "source": [ + "import heapq\n", + "\n", + "# Heuristic function for A* (Euclidean distance)\n", + "def heuristic(a, b):\n", + " return np.linalg.norm(np.array(a) - np.array(b))\n", + "\n", + "# A* pathfinding function\n", + "def a_star(start, goal, obstacles, grid_width, grid_height):\n", + " open_set = []\n", + " heapq.heappush(open_set, (0, start))\n", + " came_from = {}\n", + " g_score = {start: 0}\n", + " f_score = {start: heuristic(start, goal)}\n", + "\n", + " while open_set:\n", + " current = heapq.heappop(open_set)[1]\n", + "\n", + " if current == goal:\n", + " # Reconstruct path from goal to start\n", + " path = []\n", + " while current in came_from:\n", + " path.append(current)\n", + " current = came_from[current]\n", + " return path[::-1]\n", + "\n", + " for direction in [(1, 0), (0, 1), (-1, 0), (0, -1)]:\n", + " neighbor = (current[0] + direction[0], current[1] + direction[1])\n", + " if neighbor in obstacles or not (0 <= neighbor[0] < grid_width and 0 <= neighbor[1] < grid_height):\n", + " continue\n", + "\n", + " tentative_g_score = g_score[current] + 1\n", + " if neighbor not in g_score or tentative_g_score < g_score[neighbor]:\n", + " came_from[neighbor] = current\n", + " g_score[neighbor] = tentative_g_score\n", + " f_score[neighbor] = tentative_g_score + heuristic(neighbor, goal)\n", + " if neighbor not in [i[1] for i in open_set]:\n", + " heapq.heappush(open_set, (f_score[neighbor], neighbor))\n", + "\n", + " return [] # No path found if the open_set is exhausted\n" + ], + "metadata": { + "id": "gShrXNbi2rzU" + }, + "execution_count": 2, + "outputs": [] + }, + { + "cell_type": "markdown", + "source": [ + "**Part 3: Visualization of the Navigation Graph**\n", + "This block uses NetworkX and Matplotlib to visualize waypoints and paths in the navigation graph. Each edge represents a possible route between waypoints, and weights can represent distance or cost." + ], + "metadata": { + "id": "YUdnEH9f27PA" + } + }, + { + "cell_type": "code", + "source": [ + "import networkx as nx\n", + "import matplotlib.pyplot as plt\n", + "\n", + "# Visualization function for navigation graph\n", + "def visualize_graph():\n", + " G = nx.DiGraph()\n", + " G.add_nodes_from([1, 2, 3, 4, 5])\n", + " G.add_edge(1, 2, weight=1.5)\n", + " G.add_edge(1, 3, weight=2.0)\n", + " G.add_edge(2, 4, weight=1.2)\n", + " G.add_edge(3, 4, weight=0.9)\n", + " G.add_edge(4, 5, weight=1.8)\n", + " G.add_edge(3, 5, weight=1.5)\n", + "\n", + " # Define layout\n", + " pos = nx.spring_layout(G)\n", + " plt.figure(figsize=(8, 6))\n", + " nx.draw(G, pos, with_labels=True, node_size=700, node_color=\"lightblue\", edge_color=\"gray\", arrows=True)\n", + "\n", + " # Annotate with edge weights\n", + " edge_labels = nx.get_edge_attributes(G, 'weight')\n", + " nx.draw_networkx_edge_labels(G, pos, edge_labels=edge_labels, font_size=10)\n", + " plt.title(\"Drone Navigation Graph\")\n", + " plt.show()\n" + ], + "metadata": { + "id": "ZdfU_8XT24wE" + }, + "execution_count": 3, + "outputs": [] + }, + { + "cell_type": "markdown", + "source": [ + "**Part 4: Reinforcement Learning with PPO**\n", + "\n", + "We use Stable-Baselines3 to train a PPO model on the custom DroneEnv. The model learns how to navigate the environment and reach the target." + ], + "metadata": { + "id": "QgKGZZGj3FsG" + } + }, + { + "cell_type": "code", + "source": [ + "!pip install stable-baselines3[extra]\n", + "from stable_baselines3 import PPO\n", + "\n", + "# Initialize environment and PPO model\n", + "env = DroneEnv()\n", + "model = PPO(\"MlpPolicy\", env, verbose=1)\n", + "model.learn(total_timesteps=10000) # Train the model\n" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "X8DFgdHe3CaC", + "outputId": "d64581c2-1172-40d4-be11-93030890bfbf" + }, + "execution_count": 6, + "outputs": [ + { + "output_type": "stream", + "name": "stderr", + "text": [ + "/usr/local/lib/python3.10/dist-packages/ipykernel/ipkernel.py:283: DeprecationWarning: `should_run_async` will not call `transform_cell` automatically in the future. Please pass the result to `transformed_cell` argument and any exception that happen during thetransform in `preprocessing_exc_tuple` in IPython 7.17 and above.\n", + " and should_run_async(code)\n" + ] + }, + { + "output_type": "stream", + "name": "stdout", + "text": [ + "Collecting stable-baselines3[extra]\n", + " Downloading stable_baselines3-2.3.2-py3-none-any.whl.metadata (5.1 kB)\n", + "Collecting gymnasium<0.30,>=0.28.1 (from stable-baselines3[extra])\n", + " Downloading gymnasium-0.29.1-py3-none-any.whl.metadata (10 kB)\n", + "Requirement already satisfied: numpy>=1.20 in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (1.26.4)\n", + "Requirement already satisfied: torch>=1.13 in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (2.5.0+cu121)\n", + "Requirement already satisfied: cloudpickle in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (3.1.0)\n", + "Requirement already satisfied: pandas in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (2.2.2)\n", + "Requirement already satisfied: matplotlib in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (3.7.1)\n", + "Requirement already satisfied: opencv-python in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (4.10.0.84)\n", + "Requirement already satisfied: pygame in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (2.6.1)\n", + "Requirement already satisfied: tensorboard>=2.9.1 in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (2.17.0)\n", + "Requirement already satisfied: psutil in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (5.9.5)\n", + "Requirement already satisfied: tqdm in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (4.66.5)\n", + "Requirement already satisfied: rich in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (13.9.3)\n", + "Collecting shimmy~=1.3.0 (from shimmy[atari]~=1.3.0; extra == \"extra\"->stable-baselines3[extra])\n", + " Downloading Shimmy-1.3.0-py3-none-any.whl.metadata (3.7 kB)\n", + "Requirement already satisfied: pillow in /usr/local/lib/python3.10/dist-packages (from stable-baselines3[extra]) (10.4.0)\n", + "Collecting autorom~=0.6.1 (from autorom[accept-rom-license]~=0.6.1; extra == \"extra\"->stable-baselines3[extra])\n", + " Downloading AutoROM-0.6.1-py3-none-any.whl.metadata (2.4 kB)\n", + "Requirement already satisfied: click in /usr/local/lib/python3.10/dist-packages (from autorom~=0.6.1->autorom[accept-rom-license]~=0.6.1; extra == \"extra\"->stable-baselines3[extra]) (8.1.7)\n", + "Requirement already satisfied: requests in /usr/local/lib/python3.10/dist-packages (from autorom~=0.6.1->autorom[accept-rom-license]~=0.6.1; extra == \"extra\"->stable-baselines3[extra]) (2.32.3)\n", + "Collecting AutoROM.accept-rom-license (from autorom[accept-rom-license]~=0.6.1; extra == \"extra\"->stable-baselines3[extra])\n", + " Downloading AutoROM.accept-rom-license-0.6.1.tar.gz (434 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m434.7/434.7 kB\u001b[0m \u001b[31m8.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25h Installing build dependencies ... \u001b[?25l\u001b[?25hdone\n", + " Getting requirements to build wheel ... \u001b[?25l\u001b[?25hdone\n", + " Preparing metadata (pyproject.toml) ... \u001b[?25l\u001b[?25hdone\n", + "Requirement already satisfied: typing-extensions>=4.3.0 in /usr/local/lib/python3.10/dist-packages (from gymnasium<0.30,>=0.28.1->stable-baselines3[extra]) (4.12.2)\n", + "Collecting farama-notifications>=0.0.1 (from gymnasium<0.30,>=0.28.1->stable-baselines3[extra])\n", + " Using cached Farama_Notifications-0.0.4-py3-none-any.whl.metadata (558 bytes)\n", + "Collecting ale-py~=0.8.1 (from shimmy[atari]~=1.3.0; extra == \"extra\"->stable-baselines3[extra])\n", + " Downloading ale_py-0.8.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (8.1 kB)\n", + "Requirement already satisfied: absl-py>=0.4 in /usr/local/lib/python3.10/dist-packages (from tensorboard>=2.9.1->stable-baselines3[extra]) (1.4.0)\n", + "Requirement already satisfied: grpcio>=1.48.2 in /usr/local/lib/python3.10/dist-packages (from tensorboard>=2.9.1->stable-baselines3[extra]) (1.64.1)\n", + "Requirement already satisfied: markdown>=2.6.8 in /usr/local/lib/python3.10/dist-packages (from tensorboard>=2.9.1->stable-baselines3[extra]) (3.7)\n", + "Requirement already satisfied: protobuf!=4.24.0,<5.0.0,>=3.19.6 in /usr/local/lib/python3.10/dist-packages (from tensorboard>=2.9.1->stable-baselines3[extra]) (3.20.3)\n", + "Requirement already satisfied: setuptools>=41.0.0 in /usr/local/lib/python3.10/dist-packages (from tensorboard>=2.9.1->stable-baselines3[extra]) (75.1.0)\n", + "Requirement already satisfied: six>1.9 in /usr/local/lib/python3.10/dist-packages (from tensorboard>=2.9.1->stable-baselines3[extra]) (1.16.0)\n", + "Requirement already satisfied: tensorboard-data-server<0.8.0,>=0.7.0 in /usr/local/lib/python3.10/dist-packages (from tensorboard>=2.9.1->stable-baselines3[extra]) (0.7.2)\n", + "Requirement already satisfied: werkzeug>=1.0.1 in /usr/local/lib/python3.10/dist-packages (from tensorboard>=2.9.1->stable-baselines3[extra]) (3.0.4)\n", + "Requirement already satisfied: filelock in /usr/local/lib/python3.10/dist-packages (from torch>=1.13->stable-baselines3[extra]) (3.16.1)\n", + "Requirement already satisfied: networkx in /usr/local/lib/python3.10/dist-packages (from torch>=1.13->stable-baselines3[extra]) (3.4.2)\n", + "Requirement already satisfied: jinja2 in /usr/local/lib/python3.10/dist-packages (from torch>=1.13->stable-baselines3[extra]) (3.1.4)\n", + "Requirement already satisfied: fsspec in /usr/local/lib/python3.10/dist-packages (from torch>=1.13->stable-baselines3[extra]) (2024.6.1)\n", + "Requirement already satisfied: sympy==1.13.1 in /usr/local/lib/python3.10/dist-packages (from torch>=1.13->stable-baselines3[extra]) (1.13.1)\n", + "Requirement already satisfied: mpmath<1.4,>=1.1.0 in /usr/local/lib/python3.10/dist-packages (from sympy==1.13.1->torch>=1.13->stable-baselines3[extra]) (1.3.0)\n", + "Requirement already satisfied: contourpy>=1.0.1 in /usr/local/lib/python3.10/dist-packages (from matplotlib->stable-baselines3[extra]) (1.3.0)\n", + "Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.10/dist-packages (from matplotlib->stable-baselines3[extra]) (0.12.1)\n", + "Requirement already satisfied: fonttools>=4.22.0 in /usr/local/lib/python3.10/dist-packages (from matplotlib->stable-baselines3[extra]) (4.54.1)\n", + "Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.10/dist-packages (from matplotlib->stable-baselines3[extra]) (1.4.7)\n", + "Requirement already satisfied: packaging>=20.0 in /usr/local/lib/python3.10/dist-packages (from matplotlib->stable-baselines3[extra]) (24.1)\n", + "Requirement already satisfied: pyparsing>=2.3.1 in /usr/local/lib/python3.10/dist-packages (from matplotlib->stable-baselines3[extra]) (3.2.0)\n", + "Requirement already satisfied: python-dateutil>=2.7 in /usr/local/lib/python3.10/dist-packages (from matplotlib->stable-baselines3[extra]) (2.8.2)\n", + "Requirement already satisfied: pytz>=2020.1 in /usr/local/lib/python3.10/dist-packages (from pandas->stable-baselines3[extra]) (2024.2)\n", + "Requirement already satisfied: tzdata>=2022.7 in /usr/local/lib/python3.10/dist-packages (from pandas->stable-baselines3[extra]) (2024.2)\n", + "Requirement already satisfied: markdown-it-py>=2.2.0 in /usr/local/lib/python3.10/dist-packages (from rich->stable-baselines3[extra]) (3.0.0)\n", + "Requirement already satisfied: pygments<3.0.0,>=2.13.0 in /usr/local/lib/python3.10/dist-packages (from rich->stable-baselines3[extra]) (2.18.0)\n", + "Requirement already satisfied: importlib-resources in /usr/local/lib/python3.10/dist-packages (from ale-py~=0.8.1->shimmy[atari]~=1.3.0; extra == \"extra\"->stable-baselines3[extra]) (6.4.5)\n", + "Requirement already satisfied: mdurl~=0.1 in /usr/local/lib/python3.10/dist-packages (from markdown-it-py>=2.2.0->rich->stable-baselines3[extra]) (0.1.2)\n", + "Requirement already satisfied: MarkupSafe>=2.1.1 in /usr/local/lib/python3.10/dist-packages (from werkzeug>=1.0.1->tensorboard>=2.9.1->stable-baselines3[extra]) (3.0.2)\n", + "Requirement already satisfied: charset-normalizer<4,>=2 in /usr/local/lib/python3.10/dist-packages (from requests->autorom~=0.6.1->autorom[accept-rom-license]~=0.6.1; extra == \"extra\"->stable-baselines3[extra]) (3.4.0)\n", + "Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.10/dist-packages (from requests->autorom~=0.6.1->autorom[accept-rom-license]~=0.6.1; extra == \"extra\"->stable-baselines3[extra]) (3.10)\n", + "Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.10/dist-packages (from requests->autorom~=0.6.1->autorom[accept-rom-license]~=0.6.1; extra == \"extra\"->stable-baselines3[extra]) (2.2.3)\n", + "Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.10/dist-packages (from requests->autorom~=0.6.1->autorom[accept-rom-license]~=0.6.1; extra == \"extra\"->stable-baselines3[extra]) (2024.8.30)\n", + "Downloading AutoROM-0.6.1-py3-none-any.whl (9.4 kB)\n", + "Downloading gymnasium-0.29.1-py3-none-any.whl (953 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m953.9/953.9 kB\u001b[0m \u001b[31m28.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hDownloading Shimmy-1.3.0-py3-none-any.whl (37 kB)\n", + "Downloading stable_baselines3-2.3.2-py3-none-any.whl (182 kB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m182.3/182.3 kB\u001b[0m \u001b[31m13.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hDownloading ale_py-0.8.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.7 MB)\n", + "\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m1.7/1.7 MB\u001b[0m \u001b[31m50.7 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n", + "\u001b[?25hUsing cached Farama_Notifications-0.0.4-py3-none-any.whl (2.5 kB)\n", + "Building wheels for collected packages: AutoROM.accept-rom-license\n", + " Building wheel for AutoROM.accept-rom-license (pyproject.toml) ... \u001b[?25l\u001b[?25hdone\n", + " Created wheel for AutoROM.accept-rom-license: filename=AutoROM.accept_rom_license-0.6.1-py3-none-any.whl size=446660 sha256=663d6689e3906f5403b7df59abdc9bf6d2ad16ad590ad4f03726d9fc8d3c7593\n", + " Stored in directory: /root/.cache/pip/wheels/6b/1b/ef/a43ff1a2f1736d5711faa1ba4c1f61be1131b8899e6a057811\n", + "Successfully built AutoROM.accept-rom-license\n", + "Installing collected packages: farama-notifications, gymnasium, ale-py, shimmy, AutoROM.accept-rom-license, autorom, stable-baselines3\n", + "Successfully installed AutoROM.accept-rom-license-0.6.1 ale-py-0.8.1 autorom-0.6.1 farama-notifications-0.0.4 gymnasium-0.29.1 shimmy-1.3.0 stable-baselines3-2.3.2\n" + ] + }, + { + "output_type": "stream", + "name": "stderr", + "text": [ + "/usr/local/lib/python3.10/dist-packages/tensorflow/lite/python/util.py:55: DeprecationWarning: jax.xla_computation is deprecated. Please use the AOT APIs; see https://jax.readthedocs.io/en/latest/aot.html. For example, replace xla_computation(f)(*xs) with jit(f).lower(*xs).compiler_ir('hlo'). See CHANGELOG.md for 0.4.30 for more examples.\n", + " from jax import xla_computation as _xla_computation\n" + ] + }, + { + "output_type": "stream", + "name": "stdout", + "text": [ + "Using cpu device\n", + "Wrapping the env with a `Monitor` wrapper\n", + "Wrapping the env in a DummyVecEnv.\n" + ] + }, + { + "output_type": "stream", + "name": "stderr", + "text": [ + "/usr/local/lib/python3.10/dist-packages/stable_baselines3/common/vec_env/patch_gym.py:49: UserWarning: You provided an OpenAI Gym environment. We strongly recommend transitioning to Gymnasium environments. Stable-Baselines3 is automatically wrapping your environments in a compatibility layer, which could potentially cause issues.\n", + " warnings.warn(\n" + ] + }, + { + "output_type": "stream", + "name": "stdout", + "text": [ + "---------------------------------\n", + "| rollout/ | |\n", + "| ep_len_mean | 101 |\n", + "| ep_rew_mean | -107 |\n", + "| time/ | |\n", + "| fps | 756 |\n", + "| iterations | 1 |\n", + "| time_elapsed | 2 |\n", + "| total_timesteps | 2048 |\n", + "---------------------------------\n", + "----------------------------------------\n", + "| rollout/ | |\n", + "| ep_len_mean | 58.6 |\n", + "| ep_rew_mean | -63.8 |\n", + "| time/ | |\n", + "| fps | 728 |\n", + "| iterations | 2 |\n", + "| time_elapsed | 5 |\n", + "| total_timesteps | 4096 |\n", + "| train/ | |\n", + "| approx_kl | 0.01260215 |\n", + "| clip_fraction | 0.109 |\n", + "| clip_range | 0.2 |\n", + "| entropy_loss | -2.07 |\n", + "| explained_variance | -0.0333 |\n", + "| learning_rate | 0.0003 |\n", + "| loss | 13.1 |\n", + "| n_updates | 10 |\n", + "| policy_gradient_loss | -0.0109 |\n", + "| value_loss | 95.8 |\n", + "----------------------------------------\n", + "-----------------------------------------\n", + "| rollout/ | |\n", + "| ep_len_mean | 28 |\n", + "| ep_rew_mean | -32.4 |\n", + "| time/ | |\n", + "| fps | 655 |\n", + "| iterations | 3 |\n", + "| time_elapsed | 9 |\n", + "| total_timesteps | 6144 |\n", + "| train/ | |\n", + "| approx_kl | 0.011939146 |\n", + "| clip_fraction | 0.157 |\n", + "| clip_range | 0.2 |\n", + "| entropy_loss | -2.04 |\n", + "| explained_variance | -0.0245 |\n", + "| learning_rate | 0.0003 |\n", + "| loss | 25.8 |\n", + "| n_updates | 20 |\n", + "| policy_gradient_loss | -0.0172 |\n", + "| value_loss | 90.1 |\n", + "-----------------------------------------\n", + "------------------------------------------\n", + "| rollout/ | |\n", + "| ep_len_mean | 14.9 |\n", + "| ep_rew_mean | -20.3 |\n", + "| time/ | |\n", + "| fps | 620 |\n", + "| iterations | 4 |\n", + "| time_elapsed | 13 |\n", + "| total_timesteps | 8192 |\n", + "| train/ | |\n", + "| approx_kl | 0.0099224895 |\n", + "| clip_fraction | 0.203 |\n", + "| clip_range | 0.2 |\n", + "| entropy_loss | -2 |\n", + "| explained_variance | -0.003 |\n", + "| learning_rate | 0.0003 |\n", + "| loss | 40.7 |\n", + "| n_updates | 30 |\n", + "| policy_gradient_loss | -0.0139 |\n", + "| value_loss | 103 |\n", + "------------------------------------------\n", + "-----------------------------------------\n", + "| rollout/ | |\n", + "| ep_len_mean | 10.3 |\n", + "| ep_rew_mean | -16.5 |\n", + "| time/ | |\n", + "| fps | 601 |\n", + "| iterations | 5 |\n", + "| time_elapsed | 17 |\n", + "| total_timesteps | 10240 |\n", + "| train/ | |\n", + "| approx_kl | 0.014140065 |\n", + "| clip_fraction | 0.183 |\n", + "| clip_range | 0.2 |\n", + "| entropy_loss | -1.96 |\n", + "| explained_variance | -0.000966 |\n", + "| learning_rate | 0.0003 |\n", + "| loss | 73.6 |\n", + "| n_updates | 40 |\n", + "| policy_gradient_loss | -0.019 |\n", + "| value_loss | 114 |\n", + "-----------------------------------------\n" + ] + }, + { + "output_type": "execute_result", + "data": { + "text/plain": [ + "" + ] + }, + "metadata": {}, + "execution_count": 6 + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "**Part 5: Using A* Pathfinding and Testing the Model**\n", + "\n", + "In this part, we apply the A* algorithm to find an optimal path from the start to the target. We also test the trained model by running a few steps and observing its behavior in the environment" + ], + "metadata": { + "id": "fSLp-6PM3cdE" + } + }, + { + "cell_type": "code", + "source": [ + "# Test A* pathfinding from start to goal\n", + "obstacles = [(6, 6), (7, 7)]\n", + "start, goal = (5, 5), (8, 8)\n", + "path = a_star(start, goal, obstacles, grid_width=10, grid_height=10)\n", + "print(\"Optimal path found by A*:\", path)\n", + "\n", + "# Test the trained agent\n", + "obs = env.reset()\n", + "for _ in range(20):\n", + " action, _states = model.predict(obs)\n", + " obs, rewards, done, info = env.step(action)\n", + " env.render()\n", + " if done:\n", + " break\n" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" + }, + "id": "AAWY4Ejn3Yjw", + "outputId": "4feb2cd3-152a-42a0-88f1-b5ebefaf68b9" + }, + "execution_count": 7, + "outputs": [ + { + "output_type": "stream", + "name": "stdout", + "text": [ + "Optimal path found by A*: [(5, 6), (5, 7), (6, 7), (6, 8), (7, 8), (8, 8)]\n", + "Drone Position: [6 6], Target: [8 8], Obstacles: [array([6, 6]), array([7, 7])]\n" + ] + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "**Visualization 1: Environment Setup with Obstacles and Target**\n", + "\n", + "This code will set up a grid environment, place the drone, target, and obstacles on it, and display the current state of the environment." + ], + "metadata": { + "id": "NZmG9rDZ4THp" + } + }, + { + "cell_type": "code", + "source": [ + "import matplotlib.pyplot as plt\n", + "\n", + "def visualize_environment(drone_pos, target_pos, obstacles):\n", + " # Define the grid size\n", + " grid_size = (10, 10)\n", + "\n", + " # Create a blank grid\n", + " env_grid = np.zeros(grid_size)\n", + "\n", + " # Mark obstacles\n", + " for obs in obstacles:\n", + " env_grid[obs[0], obs[1]] = -1 # Obstacles marked as -1\n", + "\n", + " # Mark target position\n", + " env_grid[target_pos[0], target_pos[1]] = 2 # Target marked as 2\n", + "\n", + " # Mark drone position\n", + " env_grid[drone_pos[0], drone_pos[1]] = 1 # Drone marked as 1\n", + "\n", + " # Plot the grid\n", + " plt.imshow(env_grid, cmap=\"coolwarm\", origin=\"upper\")\n", + " plt.colorbar(label=\"Environment Elements\")\n", + " plt.scatter(drone_pos[1], drone_pos[0], color='blue', label=\"Drone\")\n", + " plt.scatter(target_pos[1], target_pos[0], color='green', label=\"Target\")\n", + " for obs in obstacles:\n", + " plt.scatter(obs[1], obs[0], color='red', label=\"Obstacle\" if obs == obstacles[0] else \"\")\n", + "\n", + " plt.legend(loc=\"upper right\")\n", + " plt.title(\"Drone Environment with Obstacles and Target\")\n", + " plt.show()\n", + "\n", + "# Test Visualization\n", + "drone_pos = (5, 5)\n", + "target_pos = (8, 8)\n", + "obstacles = [(6, 6), (7, 7)]\n", + "visualize_environment(drone_pos, target_pos, obstacles)\n" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 452 + }, + "id": "SFZkp-QU3pcR", + "outputId": "5db87dd3-1af4-42cc-febf-2380fae14b6e" + }, + "execution_count": 8, + "outputs": [ + { + "output_type": "display_data", + "data": { + "text/plain": [ + "
" + ], + "image/png": "\n" + }, + "metadata": {} + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "**Visualization 2: A* Pathfinding Visualization**\n", + "\n", + "Once the A* algorithm generates a path, we can overlay this path on the environment setup. The path points are highlighted to show the optimal path from the drone’s start position to the target." + ], + "metadata": { + "id": "ql7ZSfxR4aIV" + } + }, + { + "cell_type": "code", + "source": [ + "def visualize_astar_path(drone_pos, target_pos, obstacles, path):\n", + " # Create a grid for the environment\n", + " grid_size = (10, 10)\n", + " env_grid = np.zeros(grid_size)\n", + "\n", + " # Mark obstacles\n", + " for obs in obstacles:\n", + " env_grid[obs[0], obs[1]] = -1 # Obstacles marked as -1\n", + "\n", + " # Mark target position\n", + " env_grid[target_pos[0], target_pos[1]] = 2 # Target marked as 2\n", + "\n", + " # Mark drone position\n", + " env_grid[drone_pos[0], drone_pos[1]] = 1 # Drone marked as 1\n", + "\n", + " # Plot the grid with A* path\n", + " plt.imshow(env_grid, cmap=\"coolwarm\", origin=\"upper\")\n", + " plt.colorbar(label=\"Environment Elements\")\n", + " plt.scatter(drone_pos[1], drone_pos[0], color='blue', label=\"Drone Start\")\n", + " plt.scatter(target_pos[1], target_pos[0], color='green', label=\"Target\")\n", + " for obs in obstacles:\n", + " plt.scatter(obs[1], obs[0], color='red', label=\"Obstacle\" if obs == obstacles[0] else \"\")\n", + "\n", + " # Draw the A* path\n", + " if path:\n", + " path_x, path_y = zip(*path)\n", + " plt.plot(path_y, path_x, color=\"yellow\", linewidth=2, marker=\"o\", label=\"A* Path\")\n", + "\n", + " plt.legend(loc=\"upper right\")\n", + " plt.title(\"A* Pathfinding for Drone Navigation\")\n", + " plt.show()\n", + "\n", + "# Test A* Visualization\n", + "path = a_star(start=drone_pos, goal=target_pos, obstacles=obstacles, grid_width=10, grid_height=10)\n", + "visualize_astar_path(drone_pos, target_pos, obstacles, path)\n" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 452 + }, + "id": "hr1bJdal30ZW", + "outputId": "cb6746c5-beed-450a-945b-bf4cf3343b1f" + }, + "execution_count": 9, + "outputs": [ + { + "output_type": "display_data", + "data": { + "text/plain": [ + "
" + ], + "image/png": "\n" + }, + "metadata": {} + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "**Visualization 3. Dynamic Environment Visualization with Drone Movement\n", + "If you want to visualize the drone moving step-by-step toward the target, this code will update the grid in real-time with each step taken. This approach is particularly helpful in seeing the drone's decision-making process as it progresses.**" + ], + "metadata": { + "id": "MGYh34-34Hyy" + } + }, + { + "cell_type": "code", + "source": [ + "import time\n", + "from IPython.display import clear_output\n", + "\n", + "def visualize_dynamic_path(drone_pos, target_pos, obstacles, path):\n", + " for step in path:\n", + " clear_output(wait=True)\n", + "\n", + " # Create environment grid\n", + " grid_size = (10, 10)\n", + " env_grid = np.zeros(grid_size)\n", + "\n", + " # Mark obstacles\n", + " for obs in obstacles:\n", + " env_grid[obs[0], obs[1]] = -1 # Obstacles marked as -1\n", + "\n", + " # Mark target\n", + " env_grid[target_pos[0], target_pos[1]] = 2 # Target marked as 2\n", + "\n", + " # Mark drone position\n", + " env_grid[step[0], step[1]] = 1 # Drone marked as 1\n", + "\n", + " # Plot grid\n", + " plt.imshow(env_grid, cmap=\"coolwarm\", origin=\"upper\")\n", + " plt.colorbar(label=\"Environment Elements\")\n", + " plt.scatter(step[1], step[0], color='blue', label=\"Drone\")\n", + " plt.scatter(target_pos[1], target_pos[0], color='green', label=\"Target\")\n", + " for obs in obstacles:\n", + " plt.scatter(obs[1], obs[0], color='red', label=\"Obstacle\" if obs == obstacles[0] else \"\")\n", + "\n", + " plt.legend(loc=\"upper right\")\n", + " plt.title(\"Drone Moving Toward Target\")\n", + " plt.show()\n", + "\n", + " time.sleep(0.5) # Adjust time as needed\n", + "\n", + "# Execute dynamic visualization\n", + "drone_pos = (5, 5)\n", + "target_pos = (8, 8)\n", + "obstacles = [(6, 6), (7, 7)]\n", + "path = a_star(start=drone_pos, goal=target_pos, obstacles=obstacles, grid_width=10, grid_height=10)\n", + "visualize_dynamic_path(drone_pos, target_pos, obstacles, path)\n" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 452 + }, + "id": "-wnrFh6u4C7N", + "outputId": "7ff0ab6d-edaa-46c6-e7c9-2804df2c540f" + }, + "execution_count": 11, + "outputs": [ + { + "output_type": "display_data", + "data": { + "text/plain": [ + "
" + ], + "image/png": "\n" + }, + "metadata": {} + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "**Visualization 4. Heatmap of Pathfinding Costs (A* Algorithm)**\n", + "\n", + "The following code displays the pathfinding costs generated by the A* algorithm as a heatmap. This can illustrate which areas are more costly to traverse, showing the effectiveness of the chosen path." + ], + "metadata": { + "id": "33jk_6OR5Pbm" + } + }, + { + "cell_type": "code", + "source": [ + "def visualize_cost_heatmap(start, goal, obstacles, grid_width, grid_height):\n", + " # Initialize cost grid with high values\n", + " cost_grid = np.full((grid_width, grid_height), np.inf)\n", + "\n", + " # Use A* to calculate cost to each cell from start\n", + " open_set = [(0, start)]\n", + " g_score = {start: 0}\n", + "\n", + " while open_set:\n", + " _, current = heapq.heappop(open_set)\n", + " cost_grid[current[0], current[1]] = g_score[current]\n", + "\n", + " # Explore neighbors\n", + " for direction in [(1, 0), (0, 1), (-1, 0), (0, -1)]:\n", + " neighbor = (current[0] + direction[0], current[1] + direction[1])\n", + " if neighbor in obstacles or not (0 <= neighbor[0] < grid_width and 0 <= neighbor[1] < grid_height):\n", + " continue\n", + "\n", + " tentative_g_score = g_score[current] + 1\n", + " if tentative_g_score < g_score.get(neighbor, np.inf):\n", + " g_score[neighbor] = tentative_g_score\n", + " heapq.heappush(open_set, (tentative_g_score, neighbor))\n", + "\n", + " # Plot cost heatmap\n", + " plt.imshow(cost_grid, cmap=\"hot\", origin=\"upper\")\n", + " plt.colorbar(label=\"Traversal Cost\")\n", + " plt.scatter(start[1], start[0], color='blue', label=\"Start\")\n", + " plt.scatter(goal[1], goal[0], color='green', label=\"Goal\")\n", + " for obs in obstacles:\n", + " plt.scatter(obs[1], obs[0], color='red', label=\"Obstacle\" if obs == obstacles[0] else \"\")\n", + "\n", + " plt.legend(loc=\"upper right\")\n", + " plt.title(\"Pathfinding Cost Heatmap\")\n", + " plt.show()\n", + "\n", + "# Visualize cost heatmap\n", + "start = (5, 5)\n", + "goal = (8, 8)\n", + "obstacles = [(6, 6), (7, 7)]\n", + "visualize_cost_heatmap(start, goal, obstacles, grid_width=10, grid_height=10)\n" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 452 + }, + "id": "tYzcqAOY4EuZ", + "outputId": "2e5ac3d0-5a8f-4360-a66c-cdea1ab2c6d7" + }, + "execution_count": 13, + "outputs": [ + { + "output_type": "display_data", + "data": { + "text/plain": [ + "
" + ], + "image/png": "\n" + }, + "metadata": {} + } + ] + }, + { + "cell_type": "markdown", + "source": [ + "**Visualization 5. 3D Surface Plot of Pathfinding Costs**\n", + "\n", + "For a different perspective, a 3D surface plot of traversal costs shows how costs vary across the grid, highlighting areas where pathfinding is more challenging due to obstacles or distance." + ], + "metadata": { + "id": "IHPLqUme5eAJ" + } + }, + { + "cell_type": "code", + "source": [ + "from mpl_toolkits.mplot3d import Axes3D\n", + "\n", + "def visualize_cost_surface(start, goal, obstacles, grid_width, grid_height):\n", + " # Generate cost grid with A* similar to previous heatmap code\n", + " cost_grid = np.full((grid_width, grid_height), np.inf)\n", + " open_set = [(0, start)]\n", + " g_score = {start: 0}\n", + "\n", + " while open_set:\n", + " _, current = heapq.heappop(open_set)\n", + " cost_grid[current[0], current[1]] = g_score[current]\n", + "\n", + " for direction in [(1, 0), (0, 1), (-1, 0), (0, -1)]:\n", + " neighbor = (current[0] + direction[0], current[1] + direction[1])\n", + " if neighbor in obstacles or not (0 <= neighbor[0] < grid_width and 0 <= neighbor[1] < grid_height):\n", + " continue\n", + " tentative_g_score = g_score[current] + 1\n", + " if tentative_g_score < g_score.get(neighbor, np.inf):\n", + " g_score[neighbor] = tentative_g_score\n", + " heapq.heappush(open_set, (tentative_g_score, neighbor))\n", + "\n", + " # Create 3D surface plot\n", + " x, y = np.meshgrid(range(grid_width), range(grid_height))\n", + " fig = plt.figure(figsize=(10, 7))\n", + " ax = fig.add_subplot(111, projection='3d')\n", + " ax.plot_surface(x, y, cost_grid.T, cmap=\"viridis\", edgecolor=\"none\")\n", + "\n", + " # Add start, goal, and obstacles\n", + " ax.scatter(start[1], start[0], g_score[start], color='blue', s=50, label=\"Start\")\n", + " ax.scatter(goal[1], goal[0], g_score.get(goal, np.inf), color='green', s=50, label=\"Goal\")\n", + " for obs in obstacles:\n", + " ax.scatter(obs[1], obs[0], 0, color='red', s=50, label=\"Obstacle\" if obs == obstacles[0] else \"\")\n", + "\n", + " ax.set_xlabel('X Coordinate')\n", + " ax.set_ylabel('Y Coordinate')\n", + " ax.set_zlabel('Pathfinding Cost')\n", + " ax.set_title(\"3D Pathfinding Cost Surface\")\n", + " plt.legend()\n", + " plt.show()\n", + "\n", + "# Test 3D cost surface visualization\n", + "start = (5, 5)\n", + "goal = (8, 8)\n", + "obstacles = [(6, 6), (7, 7)]\n", + "visualize_cost_surface(start, goal, obstacles, grid_width=10, grid_height=10)\n" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 654 + }, + "id": "kTl56T9u5VBY", + "outputId": "b6978a02-fdd4-488a-c420-7cf0ec27899f" + }, + "execution_count": 14, + "outputs": [ + { + "output_type": "stream", + "name": "stderr", + "text": [ + "/usr/local/lib/python3.10/dist-packages/mpl_toolkits/mplot3d/proj3d.py:180: RuntimeWarning: invalid value encountered in divide\n", + " txs, tys, tzs = vecw[0]/w, vecw[1]/w, vecw[2]/w\n" + ] + }, + { + "output_type": "display_data", + "data": { + "text/plain": [ + "
" + ], + "image/png": "\n" + }, + "metadata": {} + } + ] + }, + { + "cell_type": "code", + "source": [ + "import networkx as nx\n", + "\n", + "def visualize_navigation_graph():\n", + " G = nx.DiGraph()\n", + " G.add_nodes_from([1, 2, 3, 4, 5])\n", + " G.add_edge(1, 2, weight=1.5)\n", + " G.add_edge(1, 3, weight=2.0)\n", + " G.add_edge(2, 4, weight=1.2)\n", + " G.add_edge(3, 4, weight=0.9)\n", + " G.add_edge(4, 5, weight=1.8)\n", + " G.add_edge(3, 5, weight=1.5)\n", + "\n", + " # Define layout\n", + " pos = nx.spring_layout(G)\n", + " plt.figure(figsize=(8, 6))\n", + " nx.draw(G, pos, with_labels=True, node_size=700, node_color=\"lightblue\", edge_color=\"gray\", arrows=True)\n", + "\n", + " # Annotate with edge weights\n", + " edge_labels = nx.get_edge_attributes(G, 'weight')\n", + " nx.draw_networkx_edge_labels(G, pos, edge_labels=edge_labels, font_size=10)\n", + " plt.title(\"Drone Navigation Graph\")\n", + " plt.show()\n", + "\n", + "# Test the graph visualization\n", + "visualize_navigation_graph()\n" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/", + "height": 659 + }, + "id": "zto0OQMm34ft", + "outputId": "4b91410d-428c-429d-a4a7-7119459def9e" + }, + "execution_count": 10, + "outputs": [ + { + "output_type": "display_data", + "data": { + "text/plain": [ + "
" + ], + "image/png": "\n" + }, + "metadata": {} + } + ] + } + ] +} diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/3D Path Finding Cost Suraface schematic (1).png b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/3D Path Finding Cost Suraface schematic (1).png new file mode 100644 index 000000000..c96a19820 Binary files /dev/null and b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/3D Path Finding Cost Suraface schematic (1).png differ diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Drone Nav Graph (1).png b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Drone Nav Graph (1).png new file mode 100644 index 000000000..15d329a6c Binary files /dev/null and b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Drone Nav Graph (1).png differ diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Drone Navigation Graph (1).png b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Drone Navigation Graph (1).png new file mode 100644 index 000000000..bd9d32aa1 Binary files /dev/null and b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Drone Navigation Graph (1).png differ diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Navigation Graph (1).png b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Navigation Graph (1).png new file mode 100644 index 000000000..4ddd495ac Binary files /dev/null and b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Navigation Graph (1).png differ diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Navigation Graph (2).png b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Navigation Graph (2).png new file mode 100644 index 000000000..4ddd495ac Binary files /dev/null and b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Navigation Graph (2).png differ diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Output img b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Output img new file mode 100644 index 000000000..8b1378917 --- /dev/null +++ b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/Output img @@ -0,0 +1 @@ + diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/a star graph (1).png b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/a star graph (1).png new file mode 100644 index 000000000..e72027c79 Binary files /dev/null and b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/a star graph (1).png differ diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/env graph (1).png b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/env graph (1).png new file mode 100644 index 000000000..523361e5a Binary files /dev/null and b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/env graph (1).png differ diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/pathfinding_heat-map (1).png b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/pathfinding_heat-map (1).png new file mode 100644 index 000000000..60dfd0baa Binary files /dev/null and b/Drone Navigation Detection using advanced Reinforcement Learning techniques/Images/pathfinding_heat-map (1).png differ diff --git a/Drone Navigation Detection using advanced Reinforcement Learning techniques/README.md b/Drone Navigation Detection using advanced Reinforcement Learning techniques/README.md new file mode 100644 index 000000000..8f6898fd0 --- /dev/null +++ b/Drone Navigation Detection using advanced Reinforcement Learning techniques/README.md @@ -0,0 +1,111 @@ +# Drone Pathfinding and Navigation with Reinforcement Learning +This project visualizes a drone's pathfinding journey in a grid environment, using both classical A* search and Reinforcement Learning (RL) techniques to achieve optimal navigation. The drone aims to reach a target location while avoiding obstacles and optimizing path cost. This file provides a comprehensive overview of the project’s structure, setup instructions, and available visualizations. + +# Table of Contents +-> Features +-> Project Blocks +-> Setup Instructions +-> Usage +-> Visualizations + + 1. Basic Environment Setup + 2. Static Path Visualization + 3. Heatmap of Pathfinding Costs + 4. Dynamic Movement Visualization + 5. 3D Surface Plot of Pathfinding Costs +-> Reinforcement Learning (RL) Model +-> Contributing +-> License + +# Features +Pathfinding with A Algorithm*: Finds an optimal, shortest path from the starting position to the target using the A* heuristic. Reinforcement Learning Navigation: A reinforcement learning model trains to achieve the navigation goal while avoiding obstacles, rewarding efficient paths. Dynamic Obstacles: Specify obstacle positions to simulate real-world barriers and allow pathfinding adaptations. Comprehensive Visualizations: Includes static, dynamic, and 3D visualizations of the environment, path costs, and drone’s decision-making process. Real-time Animation: Watch the drone’s actions in a step-by-step movement toward the target. + +# Project Structure +pathfinding block: Contains the A* algorithm and helper functions for calculating paths. +reinforcement_learning block: Implements the reinforcement learning environment using OpenAI Gym, where the drone learns an optimal policy for navigation. +visualizations block: Defines visualization functions, including static, dynamic, and heatmap visualizations. + +# Setup Instructions +Clone the repository: + +git clone https://github.com/Panchadip-128/Drone-Navigation_Detection_using_RL.git cd Drone-Navigation_Detection_using_RL + +# Install required dependencies: + pip install -r requirements.txt + +Run the script: Drone-Navigation_Detection_using_RL.ipynb + +# Usage: +Specify Start, Target, and Obstacle Positions: Set coordinates for the drone’s starting position, the target, and obstacles. Choose Navigation Algorithm: Run either the A* pathfinding method or the reinforcement learning model to observe different navigation approaches. + +Select Visualization Type: View different visualizations of the environment, path, costs, and drone movements. + +# Visualizations +The project includes several visualizations to illustrate pathfinding and navigation strategies in the environment. + +- Basic Environment Setup Sets up a grid environment, marking the drone’s starting position, the target, and obstacles. + + def visualize_environment(drone_pos, target_pos, obstacles, grid_size=(10, 10)) + + ![env graph](https://github.com/user-attachments/assets/a6868ac3-d936-4b03-a72d-1d20801c6aac) + + +- Static Path Visualization Displays a static view of the calculated A* path from start to target. + + def visualize_path(drone_pos, target_pos, obstacles, path) + + ![a star graph](https://github.com/user-attachments/assets/d70ec385-9cc2-40d6-adf6-5b22f12723d9) + +- Heatmap of Pathfinding Costs Shows a heatmap for traversal costs to each grid cell, providing insight into pathfinding challenges. + + def visualize_cost_heatmap(start, goal, obstacles, grid_width, grid_height) + + ![pathfinding_heat-map](https://github.com/user-attachments/assets/320baa43-f83b-4567-8d99-131bfb4dd3b7) + +- Dynamic Movement Visualization Animates the drone’s movement toward the target, step-by-step, showing real-time path adjustments. + + ![Navigation Graph](https://github.com/user-attachments/assets/acc92014-bbff-40de-b964-dc649d00a2d7) + + + +- 3D Surface Plot of Pathfinding Costs Visualizes the cost distribution across the grid in 3D, highlighting areas with high or low pathfinding costs. + + ![3D Path Finding Cost Suraface schematic](https://github.com/user-attachments/assets/f243d58c-1948-462a-a50b-cfd763807bf9) + +- Navigation Graph: + +![Drone Navigation Graph](https://github.com/user-attachments/assets/bc69c957-acac-48ce-ad2d-cef3399f3c39) + + +Reinforcement Learning (RL) Model Overview In addition to the A* algorithm, this project includes a reinforcement learning approach to allow the drone to learn optimal navigation strategies through interaction with the environment. The RL agent is implemented using OpenAI Gym and trained with the Proximal Policy Optimization (PPO) algorithm from stable-baselines3. + +RL Environment The RL environment for the drone is defined in DroneEnv, an OpenAI Gym environment that: + +Defines the drone’s possible actions: Up, Down, Left, Right, and diagonal moves. Contains a custom reward function: Positive Reward: Awarded for reaching the target. Penalty: Applied when the drone hits an obstacle or moves away from the target. Exploration vs. Exploitation: Introduces a small exploration rate (epsilon) to encourage the drone to explore initially before converging on optimal paths. + +# Training the RL Model + from stable_baselines3 import PPO + + env = DroneEnv() + model = PPO("MlpPolicy", env, verbose=1) + model.learn(total_timesteps=10000) # Training the model with adjustable timesteps + +# Evaluation +After training, the RL model navigates the drone autonomously, continuously adjusting its path based on learned policies. This approach enhances the drone’s flexibility, enabling it to adapt even with changing obstacles or targets. + +# Visualizing RL Navigation +The RL model’s path can be dynamically visualized, showing how it navigates step-by-step toward the target: + + obs = env.reset() + for _ in range(20): + action, _states = model.predict(obs) + obs, rewards, done, info = env.step(action) + env.render() + if done: + obs = env.reset() + +# Contributing +Contributions are welcome! Please fork the repository and create a pull request with improvements or feature addition or contact @Github:Panchadip-128 or @mail: panchadip125@gmail.com. + +# License +This project is licensed under MIT License policies.