🚀 OpenAdapter

Deploy & Integrate Screenshot Parsing and Action Models

OpenAdapter simplifies deploying advanced screenshot parsing and action models on local or managed systems, supporting secure automation. Initial deployment options include AWS, with future support for Anthropic and OpenAI APIs.

✨ Key Features

Automated Cloud Deployment: Deploy models on AWS EC2, with planned Anthropic/OpenAI API support and PII/PHI scrubbing.
Configurable with .env: Simplifies setup with environment variables.
Cost-Efficiency: Deploy high-performance instances on demand, with intelligent caching and resource pause/stop features to reduce costs.
Container & API Compatibility: Supports Dockerized models like OmniParser and Set-of-Mark, with future support for Anthropic and OpenAI APIs.
CI/CD with GitHub Actions: Automated integration and deployment ensure consistent updates.
Dataset Preparation and Fine-Tuning: Collect and fine-tune your models with OpenAdapter’s tools for recording, preparing, and training models using user interaction data, such as screenshots and actions, captured directly in your application.

Prerequisites

Python 3.10+
AWS Account (support for Hugging Face, Anthropic, and OpenAI APIs planned)
Environment Variables: AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, GITHUB_TOKEN

⚙️ Setup

Clone and Enter Repository:

git clone https://github.com/OpenAdaptAI/OpenAdapter.git && cd OpenAdapter

Configure Environment:

AWS_ACCESS_KEY_ID=<your aws access key>
AWS_SECRET_ACCESS_KEY=<your aws secret access key>
AWS_REGION=us-east-1
GITHUB_OWNER=OpenAdaptAI
GITHUB_REPO=<your repo>
GITHUB_TOKEN=<your github token>

Install Requirements Using uv:

pip install uv
uv venv  # Create and activate virtual environment
uv pip sync  # Install dependencies

💡 Usage

OpenAdapter provides commands to deploy and manage model instances and capture user interactions for fine-tuning. You can record, train, and tune models with your custom dataset, all managed within the OpenAdapter environment.

Recording User Interactions

To capture user interactions such as screenshots and actions, use OpenAdapter’s record command:

python -m openadapter.record "doing taxes"

This command will save the actions in a database file:

Actions saved to ~/openadapter/recording.db

Preparing and Fine-Tuning the Model

Use the recorded data to prepare a dataset and fine-tune your model:

Prepare Dataset:

python -m openadapter.train.prepare ~/openadapter/recording.db

Example output:

Preparing dataset from: ~/openadapter/recording.db
Dataset prepared at ~/openadapter/prepared_data

Fine-Tune Model: After preparing the dataset, specify the paths to the prepared data for fine-tuning:

python -m openadapter.train.tune --caption_model_path=~/openadapter/prepared_data/caption_model --som_model_path=~/openadapter/prepared_data/som_model

This flow enables OpenAdapter to use custom datasets created with OpenAdapt for more accurate action detection and screenshot parsing. Adjust paths based on your local setup.

Deployment Example (OmniParser)

Deploy OmniParser using an AWS GPU instance with OpenAdapter:

from openadapter.server import OpenAdapterConfig, Deploy
import fire

class OmniParserDeploy(Deploy):
    def __init__(self):
        super().__init__(config=OpenAdapterConfig())

if __name__ == "__main__":
    fire.Fire(OmniParserDeploy)

Commands

Deploy OmniParser on an AWS GPU instance:

python oa.deploy start

Interact with the model server:

python oa.client "http://<server_ip>:7861"

Pause or stop the instance:

python oa.deploy pause
python oa.deploy stop

Get server status and IP:

python oa.deploy status

SSH Access and Logs

SSH into the server, list Docker containers, and monitor logs:

python -m oa.deploy ssh
% docker ps -a
...<container_id>...
% docker logs -f <container_id>

Cost

Estimate: Approx. $10/day on AWS g4dn.xlarge (100GB storage). Intelligent caching helps reduce costs by reusing data across runs.

Modular Cloud Backends

AWS EC2: Current backend, with flexible instance options and security.
Planned: Hugging Face, Anthropic, OpenAI; future support for GCP and Azure.

Integrations

OpenAdapter works seamlessly with OpenAdapt to build datasets and automate models. It can also function as a standalone solution for deploying and managing models in automated environments.

Requirements

Core Requirements

Python 3.10 or higher

Optional Components

Install specific dependencies based on the use case:

Recording: Required for capturing user interactions.
```
pip install openadapter[record]
```
Training: Includes dependencies for preparing datasets and fine-tuning models.
```
pip install openadapter[train]
```
Deployment: Necessary for deploying models to production.
```
pip install openadapter[deploy]
```
Full Installation: Installs all dependencies for full-featured use.
```
pip install openadapter[full]
```

Note: GPU support is recommended for training and fine-tuning tasks, especially when working with large models like YOLO and BLIP2.

🛠️ Roadmap

AWS CDK Automation: Streamline Infrastructure as Code.
Container Optimization: Scale with ECS and ECR.
GPU & Intelligent Caching: Optimize cost and performance.
Secure Data Handling: Data encryption and VPC configurations.
Logging & Monitoring: Integrate with CloudWatch.
Serverless Options: AWS Lambda and Google Cloud Functions.
Cross-Cloud Flexibility: Extend to GCP, Azure, and additional model APIs.

License

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 OpenAdapter

✨ Key Features

Prerequisites

⚙️ Setup

💡 Usage

Recording User Interactions

Preparing and Fine-Tuning the Model

Deployment Example (OmniParser)

Commands

SSH Access and Logs

Cost

Modular Cloud Backends

Integrations

Requirements

Core Requirements

Optional Components

🛠️ Roadmap

License

About

Releases

Packages

OpenAdaptAI/OpenAdapter

Folders and files

Latest commit

History

Repository files navigation

🚀 OpenAdapter

✨ Key Features

Prerequisites

⚙️ Setup

💡 Usage

Recording User Interactions

Preparing and Fine-Tuning the Model

Deployment Example (OmniParser)

Commands

SSH Access and Logs

Cost

Modular Cloud Backends

Integrations

Requirements

Core Requirements

Optional Components

🛠️ Roadmap

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages