Multi Agent Anomaly Detection

We propose multi-agent anomaly detection (MAAD), a distributed architecture with lightweight machine learning models for real-time anomaly detection. MAAD deploys an agent on each service of microservices systems When a request triggers a sequence of services, each agent performs local anomaly detection based on its own logs, local context, and information extracted from its parent span service. For more details, please refer to our IPDPS 2024 paper.

Environment requirement

Linux. This system has been tested on Ubuntu 22.04, python3.7.1.

Python environment

Use the requirements.txt as follow.

conda create -n MAAD python=3.7
conda activate MAAD
pip install -r requirements.txt

Other Requirement

Drain3, refer to https://github.com/logpai/Drain3.

GloVe, refer to https://github.com/stanfordnlp/GloVe.

SIF, refer to https://github.com/PrincetonML/SIF.

Dataset

Please refer to https://github.com/FudanSELab/DeepTraLog to get the dataset.

or

Refer to http://docs.aiops.cloudwise.com/en/ to get another dataset.

or any other dataset and ensure that the dataset has at least trace data.

Quick Start

Firstly, prepate the data at first. It is NECESSARY to do the things below at the same time.

Split the whole dataset into saperated trace files, each filename format is traceid_faulttype.* (e.g. 7bf800940ab64c55a70add01ad6b847b.37.16284749994970479_71.csv) and put them into the same folder.
Generate faults list as id_fault.csv and services list as id_service.csv.
Generate drain3 model and use this model to construct corpus to train a GloVe model.

Use MAADWorkflow.py to train multi agents.

python MAADWorkflow.py --servicelist id_service.csv --faultlist id_fault.csv --batch 1 --trainset ./data/train/ --labelmode 0 --errortypes 72 --train True --gloveword ./GloVeModel/vectors.txt --glovevec ./GloVeModel/vocab.txt --drain trainticket --fileconfig "4,1,0,6,3,5"

When the parameter train is set as False, the program will become inference mod. And the program could generate an MAADout.txt which contains a series of multi-agent confidence lists and its corresponding labels.

Then, use Multi_decision_Merger.py as the Multi-Decision Merger.

python Multi_decision_Merger.py --trainset MAADout_test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi Agent Anomaly Detection

Environment requirement

Python environment

Other Requirement

Dataset

Quick Start

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
DrainModel		DrainModel
GloVeModel		GloVeModel
config		config
data/train		data/train
model		model
.gitignore		.gitignore
7bf800940ab64c55a70add01ad6b847b.37.16284749994970479_71.csv		7bf800940ab64c55a70add01ad6b847b.37.16284749994970479_71.csv
MAADWorkflow.py		MAADWorkflow.py
Multi_decision_Merger.py		Multi_decision_Merger.py
README.md		README.md
data_preparation.py		data_preparation.py
id_fault.csv		id_fault.csv
id_service.csv		id_service.csv
requirements.txt		requirements.txt

jzxycsjzy/multi_agent_anomaly_detection

Folders and files

Latest commit

History

Repository files navigation

Multi Agent Anomaly Detection

Environment requirement

Python environment

Other Requirement

Dataset

Quick Start

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages