Add asyncio support for tritonclient (beta) #23

kimdwkimdw · 2024-11-20T08:58:14Z

Overview

This PR adds asyncio support for the Triton client library by integrating with tritonclient's asyncio modules (currently in beta status). According to the official documentation, Python asyncio support is currently in beta. While this implementation enables fully asynchronous inference requests which can improve performance for concurrent workloads, users should be aware of the beta status of the underlying functionality.

Key Changes

Added create_with_asyncio() factory method to create async-capable InferenceClient
Implemented async client initialization and model configuration fetching
Added new aio_infer() method for async inference requests
Added test coverage for async functionality
Added new sample model (sample_sleep_1sec) for testing concurrent requests
Updated Docker image to 24.05-pyt-python-py3

Test Results

Basic async inference tests passing
Concurrent request tests showing proper parallelization
Performance test with sample_sleep_1sec model shows ~10 concurrent requests completing in under 2 seconds

Usage Example

# Create async client
client = InferenceClient.create_with_asyncio(
    "model_name",
    "localhost:8001",
    protocol="grpc"
)

# Run async inference
result = await client.aio_infer(input_data)

Implementation Notes

Uses native asyncio support from tritonclient's aio modules (beta feature)
Maintains backwards compatibility with existing sync interfaces
Proper resource cleanup and error handling for async operations
Configurable via same parameters as sync client
Beta status means the API and behavior may change in future releases

Testing Notes

Added pytest-asyncio for async test support
New test cases specifically for async functionality
Concurrent request testing with artificial delays

homura-rtzr

LGTM

ancom21c

LGTM 👍

kimdwkimdw added 2 commits November 20, 2024 17:11

Add aio_grpcclient, aio_httpclient

390cc32

add test_multiple_tasks test-case

8cb4904

kimdwkimdw requested review from ancom21c and homura-rtzr November 20, 2024 08:58

homura-rtzr approved these changes Nov 20, 2024

View reviewed changes

ancom21c approved these changes Nov 20, 2024

View reviewed changes

tritony==0.0.19

32f6a0e

kimdwkimdw merged commit 58933e2 into main Dec 4, 2024
1 check passed

kimdwkimdw deleted the feature/support_aio_tritonclient branch December 4, 2024 06:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add asyncio support for tritonclient (beta) #23

Add asyncio support for tritonclient (beta) #23

kimdwkimdw commented Nov 20, 2024 •

edited

Loading

homura-rtzr left a comment

ancom21c left a comment

Add asyncio support for tritonclient (beta) #23

Add asyncio support for tritonclient (beta) #23

Conversation

kimdwkimdw commented Nov 20, 2024 • edited Loading

Overview

Key Changes

Test Results

Usage Example

Implementation Notes

Testing Notes

homura-rtzr left a comment

Choose a reason for hiding this comment

ancom21c left a comment

Choose a reason for hiding this comment

kimdwkimdw commented Nov 20, 2024 •

edited

Loading