-
Notifications
You must be signed in to change notification settings - Fork 431
Issues: InternLM/lmdeploy
[Benchmark] benchmarks on different cuda architecture with mo...
#815
opened Dec 11, 2023 by
lvhan028
Open
9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature] Lmdeploy推理InternVL2系列是否支持在输入图片层级(假设输入多图)设定max_num以及use_thumbnali的超参
#2824
opened Nov 27, 2024 by
laserwave
[Bug] 4-bit KV cache results in very low performance while 8-bit KV cache is almost lossless
#2822
opened Nov 26, 2024 by
fzyzcjy
3 tasks done
[Bug] Why the example/cpp/llama/llama_triton_example.cc does not call the LlamaTritonModel::createEngine()?
#2803
opened Nov 25, 2024 by
BigFaceBoy
3 tasks done
[Feature] 可以考虑支持THUDM/cogvlm2-llama3-chinese-chat-19B-int4模型吗
#2799
opened Nov 24, 2024 by
Jerryporter
[Bug] Does PytorchEngine Visual Model Support Prefix Caching?
#2789
opened Nov 21, 2024 by
OftenDream
3 tasks
[Bug] Llama-3.2-1B-Instruct and InternVL2-1B does not supported kvin4, is that expected?
#2786
opened Nov 21, 2024 by
zhulinJulia24
3 tasks
[Bug] Response of converted Qwen2-57B-A14B-Instruct-GPTQ-Int4 returns garbled characters
#2785
opened Nov 21, 2024 by
zhulinJulia24
3 tasks
[Bug] SystemExit: 1 asyncio.exceptions.TimeoutError
#2782
opened Nov 20, 2024 by
LIUKAI0815
2 of 3 tasks
[Bug] Qwen2.5无法跑通tools call(官方案例代码)
awaiting response
#2775
opened Nov 20, 2024 by
turkeymz
3 tasks done
[Bug] The quantization process of Qwen/Qwen2-VL-7B-Instruct is getting killed without throwing error.
#2770
opened Nov 19, 2024 by
vjaideep08
3 tasks done
[Bug] 昇腾910B通过lmdeploy镜像,使用qwen2-vl-7b模型,推理过程报错: call aclnnBatchMatMul failed
#2769
opened Nov 18, 2024 by
fusmile0101
1 of 3 tasks
How can I specify the rope scaling type when starting the API server?
#2768
opened Nov 18, 2024 by
snachx
[Bug] The script "profile_generation.py" went haywire and crashed
#2760
opened Nov 15, 2024 by
yuchiwang
3 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.