This project builds upon the foundation of the browser-use, which is designed to make websites accessible for AI agents. We have enhanced the original capabilities by providing:
-
A Brand New WebUI: We offer a comprehensive web interface that supports a wide range of
browser-use
functionalities. This UI is designed to be user-friendly and enables easy interaction with the browser agent. -
Expanded LLM Support: We've integrated support for various Large Language Models (LLMs), including: Gemini, OpenAI, Azure OpenAI, Anthropic, DeepSeek, Ollama etc. And we plan to add support for even more models in the future.
-
Custom Browser Support: You can use your own browser with our tool, eliminating the need to re-login to sites or deal with other authentication challenges. This feature also supports high-definition screen recording.
-
Customized Agent: We've implemented a custom agent that enhances
browser-use
with Optimized prompts.
0103.mp4
- Python Version: Ensure you have Python 3.11 or higher installed.
- Install
browser-use
:pip install browser-use
- Install Playwright:
playwright install
- Install Dependencies:
pip install -r requirements.txt
- Configure Environment Variables:
- Copy
.env.example
to.env
and set your environment variables, including API keys for the LLM. - If using your own browser:
- Set
CHROME_PATH
to the executable path of your browser (e.g.,C:\Program Files\Google\Chrome\Application\chrome.exe
on Windows). - Set
CHROME_USER_DATA
to the user data directory of your browser (e.g.,C:\Users\<YourUsername>\AppData\Local\Google\Chrome\User Data
).
- Set
- Copy
- Run the WebUI:
python webui.py --ip 127.0.0.1 --port 7788
- Access the WebUI: Open your web browser and navigate to
http://127.0.0.1:7788
. - Using Your Own Browser:
- Close all chrome windows
- Open the WebUI in a non-Chrome browser, such as Firefox or Edge. This is important because the persistent browser context will use the Chrome data when running the agent.
- Check the "Use Own Browser" option within the Browser Settings.