Skip to content

Commit

Permalink
fix bugs
Browse files Browse the repository at this point in the history
  • Loading branch information
baptistecolle committed Oct 14, 2024
1 parent 32fb3ed commit db02e45
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion examples/trt_llama.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ backend:
scenario:
input_shapes:
batch_size: 4
sequence_length: 128
sequence_length: 256
generate_kwargs:
max_new_tokens: 100
min_new_tokens: 100
2 changes: 1 addition & 1 deletion optimum_benchmark/backends/tensorrt_llm/config.py
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ class TRTLLMConfig(BackendConfig):
world_size: int = 1
gpus_per_node: int = 1

max_prompt_length: int = 128
max_prompt_length: int = 256
max_new_tokens: int = -1
max_batch_size: int = 1
max_beam_width: int = 1
Expand Down

0 comments on commit db02e45

Please sign in to comment.