GitHub - lix19937/tensorrt-insight: Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda

TensorRT 是Nvidia 推出的跨 nv-gpu架构的半开源高性能AI 推理引擎框架/库，提供了cpp/python接口，以及用户自定义插件方法，涵盖了AI 推理引擎技术的主要方面。

TensorRT is a semi-open source high-performance AI inference engine framework/library developed by Nvidia, which spans across nv-gpu architectures.
Provides cpp/python interfaces and user-defined plugin methods, covering the main aspects of AI inference engine technology.

topic	主题	notes
overview	概述
layout	内存布局
compute_graph_optimize	计算图优化
dynamic_shape	动态shape
plugin	插件
calibration	标定
asp	稀疏
qat	量化感知训练
trtexec	OSS辅助工具
tool	辅助脚本
runtime	运行时
inferflow	模型调度
mps	MPS
deploy	基于onnx部署流程， trt 工具使用
py-tensorrt	python tensorrt封装	解析 tensorrt `__init__`
model_benchmark	模型性能测试
cookbook	食谱
incubator	孵化器
developer_guide	开发者指导
triton-inference-server	triton
cuda	cuda编程
onnxruntime op	onnxrt 自定义op	辅助图优化，layer输出对齐

Reference

https://docs.nvidia.com/deeplearning/tensorrt/archives/
https://developer.nvidia.com/search?page=1&sort=relevance&term=
https://github.com/HeKun-NVIDIA/TensorRT-Developer_Guide_in_Chinese/tree/main
https://docs.nvidia.com/deeplearning/tensorrt/migration-guide/index.html
https://developer.nvidia.com/zh-cn/blog/nvidia-gpu-fp8-training-inference/

Name	Name	Last commit message	Last commit date
Latest commit lix19937 Create infer_from_engine_v10x.py Mar 21, 2025 1c75d93 · Mar 21, 2025 History 882 Commits
asp	asp	Update readme.md	Nov 9, 2024
calibration	calibration	mv	Oct 12, 2024
compute_graph_optimize	compute_graph_optimize	Update readme.md	Oct 11, 2024
cuda	cuda	Update readme.md	Sep 20, 2024
deploy	deploy	Update readme.md	Jan 17, 2025
developer_guide	developer_guide	Update README.md	Oct 13, 2024
device-benchmark-mps	device-benchmark-mps	Update readme.md	Sep 30, 2024
dynamic_shape	dynamic_shape	Update readme.md	Apr 21, 2024
incubator	incubator	Create readme.md	Sep 7, 2024
inferflow	inferflow	Update readme.md	Dec 4, 2024
layout	layout	Add files via upload	Mar 19, 2024
plugin	plugin	Create readme.md	Nov 5, 2024
py-tensorrt	py-tensorrt	Update readme.md	Sep 2, 2024
qat	qat	Update qat_train.md	Mar 15, 2025
runtime	runtime	Update readme.md	Feb 15, 2024
tool	tool	Create infer_from_engine_v10x.py	Mar 21, 2025
triton-inference-server	triton-inference-server	Create readme.md	Sep 16, 2024
trtexec	trtexec	Add files via upload	Oct 24, 2024
AI推理芯片分析.md	AI推理芯片分析.md	Update AI推理芯片分析.md	Feb 17, 2025
README.md	README.md	Update README.md	Nov 26, 2024
ai_model_benchmark.md	ai_model_benchmark.md	Update ai_model_benchmark.md	Nov 26, 2024
overview.md	overview.md	Update overview.md	Feb 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reference

About

Releases

Packages

Languages

lix19937/tensorrt-insight

Folders and files

Latest commit

History

Repository files navigation

Reference

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages