|
isaac_ros_benchmark repositorybenchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
|
isaac_ros_benchmark repositorybenchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
|
isaac_ros_benchmark repositorybenchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
|
isaac_ros_benchmark repositorybenchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
Repository Summary
| Description | Performance benchmarking for NVIDIA-accelerated Isaac ROS packages |
| Checkout URI | https://github.com/nvidia-isaac-ros/isaac_ros_benchmark.git |
| VCS Type | git |
| VCS Version | main |
| Last Updated | 2025-10-26 |
| Dev Status | UNKNOWN |
| Released | UNRELEASED |
| Tags | benchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
| Contributing |
Help Wanted (-)
Good First Issues (-) Pull Requests to Review (-) |
Packages
README
Isaac ROS Benchmark
Performance benchmarking for NVIDIA-accelerated Isaac ROS packages.
Overview
Isaac ROS Benchmark builds upon the ros2_benchmark to provide configurations to benchmark Isaac ROS graphs. Performance results that measure Isaac ROS for throughput, latency, and utilization enable robotics developers to make informed decisions when designing real-time robotics applications. The Isaac ROS performance results can be independently verified, as the method, configuration, and data input used for benchmarking are provided.
The ros2_benchmark playback node plug-in, for type adaptation and
negotiation, is provided for
NITROS, which
optimizes the performance of message transport costs through
RCL with GPU accelerated graphs of
nodes.
The datasets for benchmarking are explicitly not downloaded by default. To pull down the standardized benchmark datasets, refer to the ros2_benchmark Dataset section.
Documentation
Please visit the Isaac ROS Documentation to learn how to use this repository.
Packages
Latest
Update 2025-10-24: Added new benchmarks for Isaac ROS 4.0
Performance
| Node |
Input Size |
AGX Thor |
x86_64 w/ RTX 5090 |
|---|---|---|---|
|
AprilTag Node |
720p |
392 fps 10 ms @ 30Hz |
596 fps 2.3 ms @ 30Hz |
|
FoundationPose Pose Estimation Node |
720p |
3.92 fps 260 ms @ 30Hz |
10.1 fps 89 ms @ 30Hz |
|
DNN Stereo Disparity Node Full |
576p |
178 fps 22 ms @ 30Hz |
350 fps 5.6 ms @ 30Hz |
|
DNN Stereo Disparity Node Light |
288p |
350 fps 9.4 ms @ 30Hz |
350 fps 5.0 ms @ 30Hz |
|
Stereo Disparity Node |
1080p |
217 fps 51 ms @ 30Hz |
863 fps 4.1 ms @ 30Hz |
|
Rectify Node |
1080p |
1800 fps 3.4 ms @ 30Hz |
2500 fps 2.8 ms @ 30Hz |
|
TensorRT Node DOPE |
VGA |
180 fps 7.1 ms @ 30Hz |
321 fps 5.2 ms @ 30Hz |
|
Triton Node DOPE |
VGA |
164 fps 32 ms @ 30Hz |
281 fps 4.5 ms @ 30Hz |
|
TensorRT Node PeopleSemSegNet |
544p |
589 fps 7.6 ms @ 30Hz |
557 fps 3.1 ms @ 30Hz |
|
Triton Node PeopleSemSegNet |
544p |
365 fps 9.6 ms @ 30Hz |
475 fps 5.1 ms @ 30Hz |
|
DNN Image Encoder Node |
VGA |
416 fps 13 ms @ 30Hz |
446 fps 9.4 ms @ 30Hz |
|
Occupancy Grid Localizer Node |
~50 sq. m |
28.3 fps 58 ms @ 30Hz |
50.1 fps 12 ms @ 30Hz |
|
H.264 Decoder Node |
1080p |
596 fps 10 ms @ 30Hz |
596 fps 4.0 ms @ 30Hz |
|
H.264 Encoder Node I-frame Support |
1080p |
390 fps 11 ms @ 30Hz |
596 fps 3.9 ms @ 30Hz |
|
H.264 Encoder Node P-frame Support |
1080p |
429 fps 8.6 ms @ 30Hz |
596 fps 4.4 ms @ 30Hz |
|
Nvblox Node |
– |
4.94 fps 89.8 ms |
4.96 fps 23.1 ms |
| Graph |
Input Size |
AGX Thor |
x86_64 w/ RTX 5090 |
|---|---|---|---|
|
AprilTag Graph |
720p |
366 fps 13 ms @ 30Hz |
596 fps 4.1 ms @ 30Hz |
|
Centerpose Pose Estimation Graph |
VGA |
50.5 fps 50 ms @ 30Hz |
50.2 fps 16 ms @ 30Hz |
|
DOPE Pose Estimation Graph |
VGA |
138 fps 24 ms @ 30Hz |
199 fps 14 ms @ 30Hz |
|
DNN Stereo Disparity Graph Full |
576p |
73.6 fps 29 ms @ 30Hz |
348 fps 8.5 ms @ 30Hz |
|
DNN Stereo Disparity Graph Light |
288p |
219 fps 17 ms @ 30Hz |
350 fps 7.3 ms @ 30Hz |
|
Stereo Disparity Graph |
1080p |
223 fps 5.6 ms @ 30Hz |
634 fps 6.6 ms @ 30Hz |
|
DetectNet Object Detection Graph |
544p |
143 fps 27 ms @ 30Hz |
242 fps 17 ms @ 30Hz |
|
Grounding DINO Object Detection Graph |
544p |
22.9 fps 70 ms @ 30Hz |
144 fps 15 ms @ 30Hz |
|
RT-DETR Object Detection Graph SyntheticaDETR |
720p |
219 fps 23 ms @ 30Hz |
457 fps 8.0 ms @ 30Hz |
|
TensorRT Graph PeopleSemSegNet |
544p |
460 fps 15 ms @ 30Hz |
510 fps 12 ms @ 30Hz |
|
SAM Image Segmentation Graph Full SAM |
720p |
2.23 fps 270 ms @ 30Hz |
20.8 fps 58 ms @ 30Hz |
|
SAM Image Segmentation Graph Mobile SAM |
720p |
21.0 fps 580 ms @ 30Hz |
74.1 fps 26 ms @ 30Hz |
CONTRIBUTING
Isaac ROS Contribution Rules
Any contribution that you make to this repository will be under the Apache 2 License, as dictated by that license:
5. Submission of Contributions. Unless You explicitly state otherwise, any Contribution intentionally submitted for inclusion in the Work by You to the Licensor shall be under the terms and conditions of this License, without any additional terms or conditions. Notwithstanding the above, nothing herein shall supersede or modify the terms of any separate license agreement you may have executed with Licensor regarding such Contributions.
Contributors must sign-off each commit by adding a Signed-off-by: ...
line to commit messages to certify that they have the right to submit
the code they are contributing to the project according to the
Developer Certificate of Origin (DCO).
|
isaac_ros_benchmark repositorybenchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
|
isaac_ros_benchmark repositorybenchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
|
isaac_ros_benchmark repositorybenchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
|
isaac_ros_benchmark repositorybenchmarking performance gpu nvidia performance-testing jetson ros2 ros2-humble |
