FiniteMonkey

FiniteMonkey is an advanced vulnerability mining engine powered purely by GPT, requiring no prior knowledge base or fine-tuning. Its effectiveness significantly surpasses most current related research approaches.

🌟 Core Philosophy

Task-driven, not question-driven
Prompt-driven, not code-driven
Focus on prompt design, not model design
Leveraging "deception" and hallucination as key mechanics

🏆 Results

As of May 2024, this tool has helped identify vulnerabilities worth over $60,000 in bounties.

🚀 Recent Updates

2024.11.19: Version 1.0 released - Demonstrating feasibility of LLM-based auditing and productization

Earlier Updates:

2024.08.02: Project renamed to finite-monkey-engine
2024.08.01: Added support for func, tact
2024.07.23: Added support for cairo, move
2024.07.01: Updated license
2024.06.01: Added Python language support
2024.05.18: Improved false positive reduction (~20%)
2024.05.16: Added cross-contract vulnerability confirmation
2024.04.29: Added basic Rust language support

📋 Prerequisites

PostgreSQL database
OpenAI API access
Python environment

🛠️ Setup & Configuration

Place project under src/dataset/agent-v1-c4
Configure project in datasets.json:

{
    "StEverVault2": {
        "path": "StEverVault",
        "files": [],
        "functions": []
    }
}

Create database using src/db.sql
Configure .env:

# Database Connection
DATABASE_URL=postgresql://user:password@localhost:5432/dbname

# API Settings
OPENAI_API_BASE="api.example.com"
OPENAI_API_KEY=sk-your-api-key-here

# Model Settings
VUL_MODEL_ID=gpt-4-turbo
CLAUDE_MODEL=claude-3-5-sonnet-20240620

# Azure Configuration
AZURE_API_KEY="your-azure-api-key"
AZURE_API_BASE="https://your-resource.openai.azure.com/"
AZURE_API_VERSION="2024-02-15-preview"
AZURE_DEPLOYMENT_NAME="your-deployment"

# API Choice
AZURE_OR_OPENAI="OPENAI"  # Options: OPENAI, AZURE, CLAUDE

# Scan Parameters
BUSINESS_FLOW_COUNT=4
SWITCH_FUNCTION_CODE=False
SWITCH_BUSINESS_CODE=True

# Scan Focus Configuration
# SCAN_FOCUS=[
#     "Contract1",
#     "Contract2",
#     "Contract3"
# ]

🌈 Supported Languages

Solidity (.sol)
Rust (.rs)
Python (.py)
Move (.move)
Cairo (.cairo)
Tact (.tact)
Func (.fc)
Java (.java)
Fake Solidity (.fr) - For scanning Solidity pseudocode

📊 Scanning Results Guide

Scans can be resumed if interrupted due to network/API issues by rerunning main.py with same project_id
Strongly recommend using GPT-4-turbo - GPT-3.5 and GPT-4.0 have inferior reasoning capabilities
Results are marked with detailed annotations and Chinese explanations:
- Prioritize entries with "result":"yes" in result column
- Filter for "dont need In-project other contract" in category column
- Check business_flow_code column for specific code
- Reference name column for code locations

🎯 Important Notes

Best suited for logic vulnerability mining in real projects
Not recommended for academic vulnerability testing
GPT-4-turbo recommended for optimal results
Average scan time: 2-3 hours for medium projects
Cost estimate: $20-30 for medium projects with 10 iterations
Current false positive rate: 30-65% depending on project size

🔍 Technical Notes

GPT-4 provides better results, GPT-3 not thoroughly tested
The tricky prompt theory can be adapted for any language with minor modifications
ANTLR AST parsing support recommended for better code slicing results
Currently supports Solidity with plans for expansion

🗺️ Roadmap

Code structure optimization
Additional language support
Documentation and code analysis
Command line interface implementation

🛡️ Scanning Characteristics

Excellent at code comprehension and logic vulnerability detection
Less effective for control flow vulnerability detection
Designed for real-world projects rather than academic test cases

💡 Implementation Tips

Each scan preserves progress automatically
GPT-4-turbo provides optimal performance compared to other models
Medium projects with 10 iterations take approximately 2.5 hours
Results include detailed categorization and Chinese explanations

📝 License

Apache License 2.0

🤝 Contributing

Contributions welcome! Please feel free to submit pull requests.

Note: The name is inspired by Large Language Monkeys paper

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
.~output.xlsx		.~output.xlsx
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
start.sh		start.sh
test.java		test.java
test.java_analysis.json		test.java_analysis.json
test_java.py		test_java.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FiniteMonkey

🌟 Core Philosophy

🏆 Results

🚀 Recent Updates

📋 Prerequisites

🛠️ Setup & Configuration

🌈 Supported Languages

📊 Scanning Results Guide

🎯 Important Notes

🔍 Technical Notes

🗺️ Roadmap

🛡️ Scanning Characteristics

💡 Implementation Tips

📝 License

🤝 Contributing

About

Releases

Packages

Contributors 3

Languages

License

BradMoonUESTC/finite-monkey-engine

Folders and files

Latest commit

History

Repository files navigation

FiniteMonkey

🌟 Core Philosophy

🏆 Results

🚀 Recent Updates

📋 Prerequisites

🛠️ Setup & Configuration

🌈 Supported Languages

📊 Scanning Results Guide

🎯 Important Notes

🔍 Technical Notes

🗺️ Roadmap

🛡️ Scanning Characteristics

💡 Implementation Tips

📝 License

🤝 Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages