Log Ingestor and Query Interface

Project Demo

Overview

This system is designed to efficiently ingest, store, and query vast volumes of log data. It comprises a Log Ingestor responsible for accepting log data over HTTP and a Query Interface that enables users to perform full-text searches and apply filters on various log attributes.

Technologies Used

Programming Language: Python
Database: MySQL
Technologies: Kafka , Kafka Rest Proxy, Kafka Schema Registry
Frontend: HTML CSS JavaScript
Backend: Flask

Features Implemented

Log Ingestor
- Ingests logs in the provided JSON format via HTTP on port 3000.
- Ensures scalability to handle high log volumes.
- Optimizes I/O operations and database write speeds.

Query Interface
- Offers a user-friendly interface (Web UI/CLI) for full-text search.
- Includes filters for:
  - level
  - message
  - resourceId
  - timestamp
  - traceId
  - spanId
  - commit
  - metadata.parentResourceId
- Implements efficient search algorithms for quick results.

Advanced Features (To be Implemented...)
- Search within specific date ranges.
- Utilization of regular expressions for search.
- Combining multiple filters for precise queries.
- Real-time log ingestion and searching capabilities.
- Role-based access control to the query interface.

System Architecture

System Architecture Diagram

Log Ingestor I - Log Publisher Service

Utilizes an HTTP server to receive logs.
Parses incoming JSON logs and publishes them to kafka topic.

Log Ingestor II - Log Consume Service

Subscribes to the Kafka topic and consumes the log from topic.
Stores log from topic to primary read database instance.

Query Interface - Log Search Service

Provides a user interface for search and filtering.
Processes user queries and translates them into database queries.
Utilizes optimized indexing for faster search results.

Database Structure

MYSQL - Relational Database: Stores structured log data, optimizing for structured queries and joins.
NoSQL Database (e.g., Elasticsearch): Facilitates full-text search and complex queries efficiently. (To be implemented...)

Scalability and Performance

Scalability: Implements database sharding for distributing load.
Caching Mechanism: Utilizes caching strategies for frequently accessed data.
Load Balancing: Distributes incoming requests across multiple servers for enhanced performance.

How to Run the Project

Prerequisites

docker

Installation

Clone the repository

git clone https://github.com/MasterZesty/log-ingestor-with-query-interface.git

Navigate to the project directory:
```
cd log-ingestor-with-query-interface
```
Run
```
docker-compose up -d
```
Wait for 1-2 mins as it takes some time to create resources by Kafka and MySQL.
To ingest log from ui in browser go to
```
http://localhost:3000/
```
Start consumer service
```
http://localhost:3000/consumer
```
To search log in browser go to
```
http://localhost:3000/search
```

You can also use POST method to send json data to http endpoint

curl --location 'localhost:3000' \
--header 'Content-Type: application/json' \
--data '{
"level": "error",
"message": "Failed to connect to DB",
"resourceId": "server-1234",
"timestamp": "2023-09-15T08:00:00Z",
"traceId": "abc-xyz-123",
"spanId": "span-456",
"commit": "5e5342f",
"metadata": {
   "parentResourceId": "boy server-0987"
}
}'

Identified Issues and Future Improvements

Real-time Capabilities: Enhance real-time log ingestion and search.
Enhanced Security: Strengthen security measures, especially for user access and data integrity.
Optimization: Continuously optimize database queries and indexing strategies for better performance.

Evaluation Criteria Met

Volume: Handles massive log volumes efficiently.
Speed: Provides quick search results.
Scalability: Adaptable to increasing log volumes and queries.
Usability: Offers an intuitive interface for users.
Advanced Features: Implements bonus functionalities.
Readability: Maintains a clean and structured codebase.

Conclusion

This system effectively manages log data ingestion and provides a seamless query interface for users to retrieve specific logs based on various attributes. Continuous improvements can enhance its performance and capabilities.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
docs		docs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
init.sql		init.sql
sample.json		sample.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Log Ingestor and Query Interface

Project Demo

Overview

Technologies Used

Features Implemented

System Architecture

System Architecture Diagram

Log Ingestor I - Log Publisher Service

Log Ingestor II - Log Consume Service

Query Interface - Log Search Service

Database Structure

Scalability and Performance

How to Run the Project

Prerequisites

Installation

Identified Issues and Future Improvements

Evaluation Criteria Met

Conclusion

About

Releases

Packages

Languages

License

MasterZesty/log-ingestor-with-query-interface

Folders and files

Latest commit

History

Repository files navigation

Log Ingestor and Query Interface

Project Demo

Overview

Technologies Used

Features Implemented

System Architecture

System Architecture Diagram

Log Ingestor I - Log Publisher Service

Log Ingestor II - Log Consume Service

Query Interface - Log Search Service

Database Structure

Scalability and Performance

How to Run the Project

Prerequisites

Installation

Identified Issues and Future Improvements

Evaluation Criteria Met

Conclusion

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages