Skip to content

Issues: NVIDIA/TensorRT-LLM

Beta
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

[AutoDeploy] Investigate DemoLLM Token Generation AutoDeploy bug Something isn't working
#4841 opened Jun 2, 2025 by lucaslie
Title: KeyError: 'gemma3' error in GemmaConfig.from_hugging_face when converting Gemma 3 model bug Something isn't working triaged Issue has been triaged by maintainers
#4825 opened Jun 2, 2025 by bebilli
2 of 4 tasks
Driver crash during warmup of DeepSeek-R1-FP4 bug Something isn't working
#4816 opened May 31, 2025 by pathorn
1 of 4 tasks
The output of Gemma 3 4B for TensorRT and Transformers is not the same, even when using float32 bug Something isn't working triaged Issue has been triaged by maintainers
#4815 opened May 31, 2025 by Alireza3242
1 of 4 tasks
Feature support: eagle multimodal inputs feature request New feature or request. This includes new model, dtype, functionality support
#4787 opened May 30, 2025 by liyi-xia
How is the performance of the model with pytorch as the backend Investigating Performance TRTLLM model inference speed, throughput, efficiency. Latency, benchmarks, regressions, opts. triaged Issue has been triaged by maintainers
#4745 opened May 29, 2025 by oppolll
DeepSeek-R1-FP4 crashes when MTP is enabled bug Something isn't working
#4708 opened May 27, 2025 by Shang-Pin
1 of 4 tasks
PluginConfig object has no attribute _paged_kv_cache question Further information is requested
#4701 opened May 27, 2025 by Yoloex
4 tasks
[AutoDeploy] Weight Fusion Revisited AutoDeploy bug Something isn't working
#4674 opened May 27, 2025 by lucaslie
Support for Devstral with pytorch backend triaged Issue has been triaged by maintainers
#4653 opened May 26, 2025 by ankitmaurya001
ProTip! Follow long discussions with comments:>50.