Skip to content

[Versioning] Bump v0.8.0 #1257

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 34 commits into from
Mar 27, 2025
Merged

[Versioning] Bump v0.8.0 #1257

merged 34 commits into from
Mar 27, 2025

Conversation

vmoens
Copy link
Collaborator

@vmoens vmoens commented Mar 10, 2025

[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 10, 2025
ghstack-source-id: 76f6b33
Pull Request resolved: #1257
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 10, 2025
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 10, 2025
ghstack-source-id: 8ba7372
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 10, 2025
ghstack-source-id: a4353c6
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 10, 2025
ghstack-source-id: aa9617a
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 10, 2025
ghstack-source-id: b44d63e
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 10, 2025
ghstack-source-id: 647851d
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 10, 2025
ghstack-source-id: d3885be
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: 32d4af0
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: 9cc1cea
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: e3cadad
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: a8a339c
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: e7760f1
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: fa34eaa
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: 571c3c4
Pull Request resolved: #1257
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: 1e0e98c
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: 4a45dde
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: d3efcdb
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: 427e064
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 11, 2025
ghstack-source-id: e8d98be
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 12, 2025
ghstack-source-id: 284900f
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 12, 2025
ghstack-source-id: 163d484
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 12, 2025
ghstack-source-id: be2c9b9
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 12, 2025
ghstack-source-id: 340b86a
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 12, 2025
ghstack-source-id: 83b7dfd
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 21, 2025
ghstack-source-id: efe4ed8
Pull Request resolved: #1257
[ghstack-poisoned]
vmoens pushed a commit that referenced this pull request Mar 22, 2025
ghstack-source-id: 817f2bc
Pull Request resolved: #1257
[ghstack-poisoned]
@vmoens vmoens merged commit 4374aaa into gh/vmoens/50/base Mar 27, 2025
20 of 34 checks passed
vmoens pushed a commit that referenced this pull request Mar 27, 2025
ghstack-source-id: b95089e
Pull Request resolved: #1257
@vmoens vmoens deleted the gh/vmoens/50/head branch March 27, 2025 08:05
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 45.3050μs 21.2707μs 47.0130 KOps/s 46.7342 KOps/s $\color{#35bf28}+0.60\%$
test_plain_set_stack_nested 58.0880μs 21.4793μs 46.5565 KOps/s 45.9012 KOps/s $\color{#35bf28}+1.43\%$
test_plain_set_nested_inplace 67.6060μs 23.0137μs 43.4524 KOps/s 42.3880 KOps/s $\color{#35bf28}+2.51\%$
test_plain_set_stack_nested_inplace 55.3230μs 23.0236μs 43.4338 KOps/s 42.4318 KOps/s $\color{#35bf28}+2.36\%$
test_items 30.5870μs 4.1841μs 239.0002 KOps/s 236.5207 KOps/s $\color{#35bf28}+1.05\%$
test_items_nested 0.8538ms 0.4081ms 2.4507 KOps/s 2.4766 KOps/s $\color{#d91a1a}-1.05\%$
test_items_nested_locked 1.0283ms 0.4172ms 2.3967 KOps/s 2.4412 KOps/s $\color{#d91a1a}-1.82\%$
test_items_nested_leaf 0.1601ms 76.9509μs 12.9953 KOps/s 13.1231 KOps/s $\color{#d91a1a}-0.97\%$
test_items_stack_nested 0.5969ms 0.4122ms 2.4263 KOps/s 2.4701 KOps/s $\color{#d91a1a}-1.77\%$
test_items_stack_nested_leaf 0.3700ms 77.4923μs 12.9045 KOps/s 13.0478 KOps/s $\color{#d91a1a}-1.10\%$
test_items_stack_nested_locked 0.5661ms 0.4120ms 2.4271 KOps/s 2.4578 KOps/s $\color{#d91a1a}-1.25\%$
test_keys 21.7710μs 3.7130μs 269.3266 KOps/s 275.9065 KOps/s $\color{#d91a1a}-2.38\%$
test_keys_nested 0.2759ms 0.1669ms 5.9924 KOps/s 6.0249 KOps/s $\color{#d91a1a}-0.54\%$
test_keys_nested_locked 1.6777ms 0.1757ms 5.6902 KOps/s 5.7939 KOps/s $\color{#d91a1a}-1.79\%$
test_keys_nested_leaf 0.1976ms 0.1461ms 6.8470 KOps/s 6.9289 KOps/s $\color{#d91a1a}-1.18\%$
test_keys_stack_nested 0.2527ms 0.1675ms 5.9706 KOps/s 6.0539 KOps/s $\color{#d91a1a}-1.38\%$
test_keys_stack_nested_leaf 0.2087ms 0.1464ms 6.8310 KOps/s 6.9587 KOps/s $\color{#d91a1a}-1.83\%$
test_keys_stack_nested_locked 0.3095ms 0.1742ms 5.7402 KOps/s 5.8552 KOps/s $\color{#d91a1a}-1.96\%$
test_values 9.8462μs 1.0551μs 947.7805 KOps/s 941.3581 KOps/s $\color{#35bf28}+0.68\%$
test_values_nested 0.1131ms 66.6354μs 15.0070 KOps/s 15.0193 KOps/s $\color{#d91a1a}-0.08\%$
test_values_nested_locked 0.1271ms 66.7268μs 14.9865 KOps/s 15.1041 KOps/s $\color{#d91a1a}-0.78\%$
test_values_nested_leaf 0.1176ms 74.8434μs 13.3612 KOps/s 13.3484 KOps/s $\color{#35bf28}+0.10\%$
test_values_stack_nested 0.1220ms 67.3631μs 14.8449 KOps/s 15.1684 KOps/s $\color{#d91a1a}-2.13\%$
test_values_stack_nested_leaf 0.1367ms 74.7891μs 13.3709 KOps/s 13.6403 KOps/s $\color{#d91a1a}-1.97\%$
test_values_stack_nested_locked 0.1339ms 66.7379μs 14.9840 KOps/s 15.1265 KOps/s $\color{#d91a1a}-0.94\%$
test_membership 6.4807μs 0.7173μs 1.3941 MOps/s 1.4572 MOps/s $\color{#d91a1a}-4.33\%$
test_membership_nested 23.3330μs 2.9215μs 342.2841 KOps/s 339.3417 KOps/s $\color{#35bf28}+0.87\%$
test_membership_nested_leaf 22.9030μs 2.9491μs 339.0917 KOps/s 337.0741 KOps/s $\color{#35bf28}+0.60\%$
test_membership_stacked_nested 36.5380μs 2.8770μs 347.5891 KOps/s 341.9098 KOps/s $\color{#35bf28}+1.66\%$
test_membership_stacked_nested_leaf 25.6980μs 2.9333μs 340.9170 KOps/s 339.3119 KOps/s $\color{#35bf28}+0.47\%$
test_membership_nested_last 22.4420μs 4.3459μs 230.1017 KOps/s 229.7454 KOps/s $\color{#35bf28}+0.16\%$
test_membership_nested_leaf_last 23.2730μs 4.3611μs 229.3014 KOps/s 230.9647 KOps/s $\color{#d91a1a}-0.72\%$
test_membership_stacked_nested_last 26.7100μs 4.3480μs 229.9908 KOps/s 231.0965 KOps/s $\color{#d91a1a}-0.48\%$
test_membership_stacked_nested_leaf_last 20.5880μs 4.3423μs 230.2927 KOps/s 229.7179 KOps/s $\color{#35bf28}+0.25\%$
test_nested_getleaf 50.0230μs 18.0384μs 55.4373 KOps/s 56.8539 KOps/s $\color{#d91a1a}-2.49\%$
test_nested_get 48.1800μs 17.0679μs 58.5895 KOps/s 59.8581 KOps/s $\color{#d91a1a}-2.12\%$
test_stacked_getleaf 62.3060μs 18.0282μs 55.4687 KOps/s 57.2789 KOps/s $\color{#d91a1a}-3.16\%$
test_stacked_get 42.0180μs 17.1626μs 58.2662 KOps/s 60.0160 KOps/s $\color{#d91a1a}-2.92\%$
test_nested_getitemleaf 49.0210μs 18.6360μs 53.6596 KOps/s 55.0096 KOps/s $\color{#d91a1a}-2.45\%$
test_nested_getitem 48.4100μs 17.7703μs 56.2735 KOps/s 57.3976 KOps/s $\color{#d91a1a}-1.96\%$
test_stacked_getitemleaf 45.5850μs 18.4757μs 54.1250 KOps/s 55.9629 KOps/s $\color{#d91a1a}-3.28\%$
test_stacked_getitem 46.9480μs 17.5623μs 56.9401 KOps/s 57.7128 KOps/s $\color{#d91a1a}-1.34\%$
test_lock_nested 0.6584ms 0.4172ms 2.3967 KOps/s 2.4658 KOps/s $\color{#d91a1a}-2.80\%$
test_lock_stack_nested 0.5432ms 0.4228ms 2.3649 KOps/s 2.3743 KOps/s $\color{#d91a1a}-0.39\%$
test_unlock_nested 0.4401ms 0.3380ms 2.9584 KOps/s 3.0104 KOps/s $\color{#d91a1a}-1.73\%$
test_unlock_stack_nested 0.5517ms 0.3409ms 2.9334 KOps/s 2.9259 KOps/s $\color{#35bf28}+0.26\%$
test_flatten_speed 0.1999ms 99.5254μs 10.0477 KOps/s 9.9552 KOps/s $\color{#35bf28}+0.93\%$
test_unflatten_speed 0.7173ms 0.5999ms 1.6669 KOps/s 1.6879 KOps/s $\color{#d91a1a}-1.25\%$
test_common_ops 1.0971ms 0.8203ms 1.2190 KOps/s 1.2135 KOps/s $\color{#35bf28}+0.45\%$
test_creation 34.8150μs 2.5773μs 388.0060 KOps/s 399.8933 KOps/s $\color{#d91a1a}-2.97\%$
test_creation_empty 67.4330μs 11.9210μs 83.8855 KOps/s 79.0592 KOps/s $\textbf{\color{#35bf28}+6.10\%}$
test_creation_nested_1 64.0900μs 14.8165μs 67.4923 KOps/s 63.7761 KOps/s $\textbf{\color{#35bf28}+5.83\%}$
test_creation_nested_2 55.2130μs 19.2728μs 51.8867 KOps/s 50.1849 KOps/s $\color{#35bf28}+3.39\%$
test_clone 48.4200μs 13.8308μs 72.3022 KOps/s 75.6058 KOps/s $\color{#d91a1a}-4.37\%$
test_getitem[int] 0.8223ms 13.0751μs 76.4810 KOps/s 79.1792 KOps/s $\color{#d91a1a}-3.41\%$
test_getitem[slice_int] 0.1310ms 25.6349μs 39.0093 KOps/s 40.4077 KOps/s $\color{#d91a1a}-3.46\%$
test_getitem[range] 0.1655ms 51.3518μs 19.4735 KOps/s 20.5800 KOps/s $\textbf{\color{#d91a1a}-5.38\%}$
test_getitem[tuple] 0.1203ms 20.4072μs 49.0023 KOps/s 49.7250 KOps/s $\color{#d91a1a}-1.45\%$
test_getitem[list] 0.1617ms 45.7475μs 21.8591 KOps/s 22.1678 KOps/s $\color{#d91a1a}-1.39\%$
test_setitem_dim[int] 48.0300μs 26.3765μs 37.9126 KOps/s 39.6979 KOps/s $\color{#d91a1a}-4.50\%$
test_setitem_dim[slice_int] 91.3100μs 51.6192μs 19.3727 KOps/s 19.8307 KOps/s $\color{#d91a1a}-2.31\%$
test_setitem_dim[range] 0.1631ms 78.1643μs 12.7936 KOps/s 13.4953 KOps/s $\textbf{\color{#d91a1a}-5.20\%}$
test_setitem_dim[tuple] 80.6390μs 40.7926μs 24.5143 KOps/s 25.1217 KOps/s $\color{#d91a1a}-2.42\%$
test_setitem 64.0700μs 21.1195μs 47.3495 KOps/s 48.1913 KOps/s $\color{#d91a1a}-1.75\%$
test_set 56.1350μs 20.5283μs 48.7132 KOps/s 49.1860 KOps/s $\color{#d91a1a}-0.96\%$
test_set_shared 3.5658ms 0.1831ms 5.4609 KOps/s 5.6183 KOps/s $\color{#d91a1a}-2.80\%$
test_update 0.1153ms 26.4461μs 37.8128 KOps/s 37.5329 KOps/s $\color{#35bf28}+0.75\%$
test_update_nested 0.1394ms 42.0531μs 23.7794 KOps/s 23.6360 KOps/s $\color{#35bf28}+0.61\%$
test_update__nested 0.3956ms 35.0279μs 28.5487 KOps/s 29.2012 KOps/s $\color{#d91a1a}-2.23\%$
test_set_nested 67.9470μs 22.7091μs 44.0352 KOps/s 43.5406 KOps/s $\color{#35bf28}+1.14\%$
test_set_nested_new 0.1058ms 27.6780μs 36.1297 KOps/s 36.1206 KOps/s $\color{#35bf28}+0.03\%$
test_select 0.1188ms 44.3707μs 22.5374 KOps/s 22.6279 KOps/s $\color{#d91a1a}-0.40\%$
test_select_nested 0.1310ms 65.1155μs 15.3573 KOps/s 15.6070 KOps/s $\color{#d91a1a}-1.60\%$
test_exclude_nested 0.1544ms 82.0235μs 12.1916 KOps/s 12.4248 KOps/s $\color{#d91a1a}-1.88\%$
test_empty[True] 0.5489ms 0.4088ms 2.4463 KOps/s 2.4620 KOps/s $\color{#d91a1a}-0.64\%$
test_empty[False] 8.2580μs 1.3649μs 732.6747 KOps/s 717.4644 KOps/s $\color{#35bf28}+2.12\%$
test_unbind_speed 0.6034ms 0.2679ms 3.7332 KOps/s 3.7257 KOps/s $\color{#35bf28}+0.20\%$
test_unbind_speed_stack0 0.4570ms 0.2659ms 3.7615 KOps/s 3.7636 KOps/s $\color{#d91a1a}-0.06\%$
test_unbind_speed_stack1 0.1060s 0.7242ms 1.3808 KOps/s 1.2398 KOps/s $\textbf{\color{#35bf28}+11.37\%}$
test_split 0.1071s 1.7868ms 559.6617 Ops/s 575.0286 Ops/s $\color{#d91a1a}-2.67\%$
test_chunk 0.1008s 1.7680ms 565.6039 Ops/s 633.8841 Ops/s $\textbf{\color{#d91a1a}-10.77\%}$
test_consolidate_njt[False-None] 8.2789ms 8.0416ms 124.3528 Ops/s 110.4412 Ops/s $\textbf{\color{#35bf28}+12.60\%}$
test_creation[device0] 0.2155ms 92.6188μs 10.7969 KOps/s 11.0540 KOps/s $\color{#d91a1a}-2.33\%$
test_creation_from_tensor 4.1880ms 96.2115μs 10.3938 KOps/s 10.6099 KOps/s $\color{#d91a1a}-2.04\%$
test_add_one[memmap_tensor0] 84.7380μs 5.1631μs 193.6809 KOps/s 203.7442 KOps/s $\color{#d91a1a}-4.94\%$
test_contiguous[memmap_tensor0] 18.2140μs 0.5028μs 1.9888 MOps/s 1.9521 MOps/s $\color{#35bf28}+1.88\%$
test_stack[memmap_tensor0] 31.2280μs 3.5656μs 280.4594 KOps/s 295.9895 KOps/s $\textbf{\color{#d91a1a}-5.25\%}$
test_memmaptd_index 0.3383ms 0.2333ms 4.2862 KOps/s 4.3822 KOps/s $\color{#d91a1a}-2.19\%$
test_memmaptd_index_astensor 0.4841ms 0.3213ms 3.1123 KOps/s 3.1298 KOps/s $\color{#d91a1a}-0.56\%$
test_memmaptd_index_op 1.0711ms 0.5983ms 1.6715 KOps/s 1.6206 KOps/s $\color{#35bf28}+3.14\%$
test_serialize_model 0.2263s 0.1328s 7.5303 Ops/s 8.7098 Ops/s $\textbf{\color{#d91a1a}-13.54\%}$
test_serialize_model_pickle 0.4523s 0.3858s 2.5922 Ops/s 2.3677 Ops/s $\textbf{\color{#35bf28}+9.48\%}$
test_serialize_weights 0.1231s 0.1162s 8.6026 Ops/s 8.6186 Ops/s $\color{#d91a1a}-0.19\%$
test_serialize_weights_returnearly 0.1690s 0.1593s 6.2787 Ops/s 5.7317 Ops/s $\textbf{\color{#35bf28}+9.54\%}$
test_serialize_weights_pickle 1.1085s 0.7043s 1.4198 Ops/s 2.4028 Ops/s $\textbf{\color{#d91a1a}-40.91\%}$
test_serialize_weights_filesystem 0.1470s 0.1423s 7.0298 Ops/s 6.8403 Ops/s $\color{#35bf28}+2.77\%$
test_serialize_model_filesystem 0.1514s 0.1413s 7.0766 Ops/s 6.5840 Ops/s $\textbf{\color{#35bf28}+7.48\%}$
test_reshape_pytree 72.5550μs 26.3238μs 37.9884 KOps/s 37.8116 KOps/s $\color{#35bf28}+0.47\%$
test_reshape_td 0.1159ms 33.1663μs 30.1511 KOps/s 31.1156 KOps/s $\color{#d91a1a}-3.10\%$
test_view_pytree 58.1780μs 26.4153μs 37.8568 KOps/s 37.8244 KOps/s $\color{#35bf28}+0.09\%$
test_view_td 83.7660μs 41.3861μs 24.1627 KOps/s 24.4103 KOps/s $\color{#d91a1a}-1.01\%$
test_unbind_pytree 90.8290μs 29.3396μs 34.0836 KOps/s 34.1085 KOps/s $\color{#d91a1a}-0.07\%$
test_unbind_td 0.3179ms 40.1469μs 24.9085 KOps/s 25.3416 KOps/s $\color{#d91a1a}-1.71\%$
test_split_pytree 92.8130μs 29.2799μs 34.1531 KOps/s 34.4764 KOps/s $\color{#d91a1a}-0.94\%$
test_split_td 0.5007ms 46.0871μs 21.6981 KOps/s 22.2504 KOps/s $\color{#d91a1a}-2.48\%$
test_add_pytree 87.3430μs 35.7143μs 28.0000 KOps/s 27.7837 KOps/s $\color{#35bf28}+0.78\%$
test_add_td 0.2606ms 59.9498μs 16.6806 KOps/s 16.9684 KOps/s $\color{#d91a1a}-1.70\%$
test_compile_add_one_nested[tensordict-compile] 0.1339ms 67.5203μs 14.8104 KOps/s 14.9749 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_add_one_nested[tensordict-eager] 1.5926ms 0.1815ms 5.5085 KOps/s 5.5543 KOps/s $\color{#d91a1a}-0.82\%$
test_compile_add_one_nested[pytree-compile] 0.1252ms 46.2411μs 21.6258 KOps/s 21.8385 KOps/s $\color{#d91a1a}-0.97\%$
test_compile_add_one_nested[pytree-eager] 0.2258ms 0.1184ms 8.4492 KOps/s 8.4230 KOps/s $\color{#35bf28}+0.31\%$
test_compile_copy_nested[tensordict-compile] 88.6360μs 27.9444μs 35.7854 KOps/s 35.8877 KOps/s $\color{#d91a1a}-0.29\%$
test_compile_copy_nested[tensordict-eager] 0.1353ms 63.7964μs 15.6749 KOps/s 15.5758 KOps/s $\color{#35bf28}+0.64\%$
test_compile_copy_nested[pytree-compile] 0.1651ms 79.9774μs 12.5035 KOps/s 12.2909 KOps/s $\color{#35bf28}+1.73\%$
test_compile_copy_nested[pytree-eager] 0.1827ms 65.8022μs 15.1971 KOps/s 15.0327 KOps/s $\color{#35bf28}+1.09\%$
test_compile_add_one_flat[tensordict-compile] 0.1866ms 0.1084ms 9.2281 KOps/s 9.3704 KOps/s $\color{#d91a1a}-1.52\%$
test_compile_add_one_flat[tensordict-eager] 0.2985ms 0.2202ms 4.5410 KOps/s 4.6177 KOps/s $\color{#d91a1a}-1.66\%$
test_compile_add_one_flat[tensorclass-compile] 0.1235ms 49.3387μs 20.2680 KOps/s 21.7902 KOps/s $\textbf{\color{#d91a1a}-6.99\%}$
test_compile_add_one_flat[tensorclass-eager] 0.6558ms 69.1191μs 14.4678 KOps/s 14.8309 KOps/s $\color{#d91a1a}-2.45\%$
test_compile_add_one_flat[pytree-compile] 0.2322ms 0.1010ms 9.9019 KOps/s 10.0044 KOps/s $\color{#d91a1a}-1.02\%$
test_compile_add_one_flat[pytree-eager] 0.4240ms 0.2061ms 4.8532 KOps/s 4.9776 KOps/s $\color{#d91a1a}-2.50\%$
test_compile_add_self_flat[tensordict-eager] 0.3976ms 0.2349ms 4.2580 KOps/s 4.3212 KOps/s $\color{#d91a1a}-1.46\%$
test_compile_add_self_flat[tensordict-compile] 0.2139ms 0.1116ms 8.9608 KOps/s 9.2623 KOps/s $\color{#d91a1a}-3.26\%$
test_compile_add_self_flat[tensorclass-eager] 0.1215ms 64.4904μs 15.5062 KOps/s 15.8277 KOps/s $\color{#d91a1a}-2.03\%$
test_compile_add_self_flat[tensorclass-compile] 0.1234ms 50.4580μs 19.8185 KOps/s 20.2450 KOps/s $\color{#d91a1a}-2.11\%$
test_compile_add_self_flat[pytree-eager] 0.3617ms 0.1588ms 6.2955 KOps/s 6.4266 KOps/s $\color{#d91a1a}-2.04\%$
test_compile_add_self_flat[pytree-compile] 0.1816ms 99.2706μs 10.0735 KOps/s 10.0183 KOps/s $\color{#35bf28}+0.55\%$
test_compile_copy_flat[tensordict-compile] 82.3930μs 22.0352μs 45.3819 KOps/s 47.0654 KOps/s $\color{#d91a1a}-3.58\%$
test_compile_copy_flat[tensordict-eager] 0.1454ms 68.8482μs 14.5247 KOps/s 15.0557 KOps/s $\color{#d91a1a}-3.53\%$
test_compile_copy_flat[pytree-compile] 0.1709ms 84.3238μs 11.8590 KOps/s 11.8496 KOps/s $\color{#35bf28}+0.08\%$
test_compile_copy_flat[pytree-eager] 0.1357ms 66.7849μs 14.9735 KOps/s 15.0245 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_assign_and_add[tensordict-compile] 0.3243ms 0.2136ms 4.6812 KOps/s 4.6176 KOps/s $\color{#35bf28}+1.38\%$
test_compile_assign_and_add[tensordict-eager] 2.4674ms 1.3882ms 720.3707 Ops/s 716.6900 Ops/s $\color{#35bf28}+0.51\%$
test_compile_assign_and_add[pytree-compile] 0.4107ms 0.2090ms 4.7847 KOps/s 4.7581 KOps/s $\color{#35bf28}+0.56\%$
test_compile_assign_and_add[pytree-eager] 1.3343ms 0.8352ms 1.1974 KOps/s 1.1938 KOps/s $\color{#35bf28}+0.30\%$
test_compile_assign_and_add_stack[compile] 0.5280ms 0.4558ms 2.1941 KOps/s 2.1639 KOps/s $\color{#35bf28}+1.39\%$
test_compile_assign_and_add_stack[eager] 3.5094ms 2.7077ms 369.3184 Ops/s 365.1359 Ops/s $\color{#35bf28}+1.15\%$
test_compile_indexing[tensor-tensordict-compile] 0.1046ms 38.9327μs 25.6853 KOps/s 26.6594 KOps/s $\color{#d91a1a}-3.65\%$
test_compile_indexing[tensor-tensordict-eager] 0.6346ms 34.8141μs 28.7240 KOps/s 28.0934 KOps/s $\color{#35bf28}+2.24\%$
test_compile_indexing[tensor-tensorclass-compile] 87.9140μs 31.8223μs 31.4245 KOps/s 33.0080 KOps/s $\color{#d91a1a}-4.80\%$
test_compile_indexing[tensor-tensorclass-eager] 80.4200μs 23.3112μs 42.8979 KOps/s 43.2218 KOps/s $\color{#d91a1a}-0.75\%$
test_compile_indexing[tensor-pytree-compile] 82.3730μs 31.8577μs 31.3896 KOps/s 31.7205 KOps/s $\color{#d91a1a}-1.04\%$
test_compile_indexing[tensor-pytree-eager] 63.6580μs 23.4811μs 42.5875 KOps/s 42.8863 KOps/s $\color{#d91a1a}-0.70\%$
test_compile_indexing[slice-tensordict-compile] 0.1142ms 54.1580μs 18.4645 KOps/s 18.7721 KOps/s $\color{#d91a1a}-1.64\%$
test_compile_indexing[slice-tensordict-eager] 0.4035ms 21.7375μs 46.0035 KOps/s 46.3771 KOps/s $\color{#d91a1a}-0.81\%$
test_compile_indexing[slice-tensorclass-compile] 0.1160ms 46.8919μs 21.3256 KOps/s 21.7520 KOps/s $\color{#d91a1a}-1.96\%$
test_compile_indexing[slice-tensorclass-eager] 70.6720μs 18.3145μs 54.6015 KOps/s 53.8740 KOps/s $\color{#35bf28}+1.35\%$
test_compile_indexing[slice-pytree-compile] 0.1218ms 47.8018μs 20.9197 KOps/s 21.3354 KOps/s $\color{#d91a1a}-1.95\%$
test_compile_indexing[slice-pytree-eager] 65.1520μs 18.3534μs 54.4857 KOps/s 55.0914 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_indexing[int-tensordict-compile] 0.1164ms 55.4812μs 18.0241 KOps/s 18.5062 KOps/s $\color{#d91a1a}-2.61\%$
test_compile_indexing[int-tensordict-eager] 0.9610ms 21.6099μs 46.2751 KOps/s 46.7699 KOps/s $\color{#d91a1a}-1.06\%$
test_compile_indexing[int-tensorclass-compile] 0.1211ms 47.2107μs 21.1816 KOps/s 21.3415 KOps/s $\color{#d91a1a}-0.75\%$
test_compile_indexing[int-tensorclass-eager] 0.2841ms 18.5071μs 54.0334 KOps/s 54.9363 KOps/s $\color{#d91a1a}-1.64\%$
test_compile_indexing[int-pytree-compile] 0.1160ms 47.6485μs 20.9870 KOps/s 21.6343 KOps/s $\color{#d91a1a}-2.99\%$
test_compile_indexing[int-pytree-eager] 82.0830μs 18.3968μs 54.3573 KOps/s 55.1897 KOps/s $\color{#d91a1a}-1.51\%$
test_mod_add[eager] 88.2250μs 37.1653μs 26.9068 KOps/s 26.9826 KOps/s $\color{#d91a1a}-0.28\%$
test_mod_add[compile] 0.1205ms 66.1942μs 15.1071 KOps/s 15.1058 KOps/s $+0.01\%$
test_mod_add[compile-overhead] 0.1356ms 65.8653μs 15.1825 KOps/s 14.6551 KOps/s $\color{#35bf28}+3.60\%$
test_mod_wrap[eager] 0.3483ms 0.2226ms 4.4932 KOps/s 4.3271 KOps/s $\color{#35bf28}+3.84\%$
test_mod_wrap[compile] 2.2794ms 0.2292ms 4.3636 KOps/s 4.2165 KOps/s $\color{#35bf28}+3.49\%$
test_mod_wrap[compile-overhead] 0.4483ms 0.2280ms 4.3859 KOps/s 4.3642 KOps/s $\color{#35bf28}+0.50\%$
test_mod_wrap_and_backward[eager] 12.2045ms 10.8904ms 91.8244 Ops/s 71.6631 Ops/s $\textbf{\color{#35bf28}+28.13\%}$
test_mod_wrap_and_backward[compile] 13.6961ms 11.0415ms 90.5676 Ops/s 73.7514 Ops/s $\textbf{\color{#35bf28}+22.80\%}$
test_mod_wrap_and_backward[compile-overhead] 13.4703ms 11.3823ms 87.8555 Ops/s 76.6307 Ops/s $\textbf{\color{#35bf28}+14.65\%}$
test_seq_add[eager] 0.2219ms 0.1255ms 7.9712 KOps/s 8.3032 KOps/s $\color{#d91a1a}-4.00\%$
test_seq_add[compile] 0.1419ms 78.2213μs 12.7842 KOps/s 12.8439 KOps/s $\color{#d91a1a}-0.46\%$
test_seq_add[compile-overhead] 0.1385ms 75.6843μs 13.2128 KOps/s 13.4011 KOps/s $\color{#d91a1a}-1.41\%$
test_seq_wrap[eager] 0.8701ms 0.4644ms 2.1535 KOps/s 2.1507 KOps/s $\color{#35bf28}+0.13\%$
test_seq_wrap[compile] 0.4358ms 0.2467ms 4.0539 KOps/s 4.0390 KOps/s $\color{#35bf28}+0.37\%$
test_seq_wrap[compile-overhead] 0.4321ms 0.2444ms 4.0916 KOps/s 4.0688 KOps/s $\color{#35bf28}+0.56\%$
test_func_call_runtime[False-eager] 0.7017ms 0.5395ms 1.8537 KOps/s 1.7941 KOps/s $\color{#35bf28}+3.32\%$
test_func_call_runtime[False-compile] 0.8970ms 0.4465ms 2.2396 KOps/s 2.2323 KOps/s $\color{#35bf28}+0.33\%$
test_func_call_runtime[False-compile-overhead] 0.9630ms 0.4520ms 2.2126 KOps/s 2.2441 KOps/s $\color{#d91a1a}-1.40\%$
test_func_call_runtime[True-eager] 0.9611ms 0.7516ms 1.3305 KOps/s 1.3193 KOps/s $\color{#35bf28}+0.85\%$
test_func_call_runtime[True-compile] 0.9324ms 0.4697ms 2.1288 KOps/s 2.1439 KOps/s $\color{#d91a1a}-0.71\%$
test_func_call_runtime[True-compile-overhead] 0.6352ms 0.4717ms 2.1199 KOps/s 2.1408 KOps/s $\color{#d91a1a}-0.98\%$
test_func_call_cm_runtime[False-eager] 0.6527ms 0.5350ms 1.8690 KOps/s 1.8248 KOps/s $\color{#35bf28}+2.43\%$
test_func_call_cm_runtime[False-compile] 0.8169ms 0.4487ms 2.2287 KOps/s 2.2539 KOps/s $\color{#d91a1a}-1.12\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6548ms 0.4459ms 2.2425 KOps/s 2.2448 KOps/s $\color{#d91a1a}-0.10\%$
test_func_call_cm_runtime[True-eager] 1.2854ms 0.8874ms 1.1269 KOps/s 1.1112 KOps/s $\color{#35bf28}+1.41\%$
test_func_call_cm_runtime[True-compile] 0.9380ms 0.7889ms 1.2676 KOps/s 1.2388 KOps/s $\color{#35bf28}+2.33\%$
test_func_call_cm_runtime[True-compile-overhead] 1.0703ms 0.7980ms 1.2531 KOps/s 1.2322 KOps/s $\color{#35bf28}+1.70\%$
test_vmap_func_call_cm_runtime[eager] 2.6059ms 1.9128ms 522.7874 Ops/s 519.5714 Ops/s $\color{#35bf28}+0.62\%$
test_vmap_func_call_cm_runtime[compile] 0.9144ms 0.5465ms 1.8297 KOps/s 1.8513 KOps/s $\color{#d91a1a}-1.17\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8813ms 0.5426ms 1.8428 KOps/s 1.8459 KOps/s $\color{#d91a1a}-0.17\%$
test_distributed 0.3194ms 0.1291ms 7.7457 KOps/s 7.8894 KOps/s $\color{#d91a1a}-1.82\%$
test_tdmodule 77.3140μs 28.6483μs 34.9061 KOps/s 35.4150 KOps/s $\color{#d91a1a}-1.44\%$
test_tdmodule_dispatch 0.1360ms 57.6947μs 17.3326 KOps/s 19.3442 KOps/s $\textbf{\color{#d91a1a}-10.40\%}$
test_tdseq 67.7860μs 31.0207μs 32.2366 KOps/s 32.9841 KOps/s $\color{#d91a1a}-2.27\%$
test_tdseq_dispatch 0.1004ms 56.7262μs 17.6285 KOps/s 17.7698 KOps/s $\color{#d91a1a}-0.79\%$
test_instantiation_functorch 2.0175ms 1.5377ms 650.3193 Ops/s 656.7756 Ops/s $\color{#d91a1a}-0.98\%$
test_exec_functorch 0.3166ms 0.1807ms 5.5345 KOps/s 5.5981 KOps/s $\color{#d91a1a}-1.14\%$
test_exec_functional_call 0.4009ms 0.1748ms 5.7218 KOps/s 5.5655 KOps/s $\color{#35bf28}+2.81\%$
test_exec_td_decorator 0.5424ms 0.2378ms 4.2053 KOps/s 4.2050 KOps/s $+0.01\%$
test_vmap_mlp_speed_decorator[True-True] 1.2033ms 0.6839ms 1.4621 KOps/s 1.4997 KOps/s $\color{#d91a1a}-2.50\%$
test_vmap_mlp_speed_decorator[True-False] 1.6446ms 0.6877ms 1.4542 KOps/s 1.5115 KOps/s $\color{#d91a1a}-3.79\%$
test_vmap_mlp_speed_decorator[False-True] 0.8292ms 0.5444ms 1.8370 KOps/s 1.8892 KOps/s $\color{#d91a1a}-2.76\%$
test_vmap_mlp_speed_decorator[False-False] 0.8227ms 0.5440ms 1.8384 KOps/s 1.8847 KOps/s $\color{#d91a1a}-2.46\%$
test_to_module_speed[True] 2.6071ms 1.3517ms 739.8363 Ops/s 731.0207 Ops/s $\color{#35bf28}+1.21\%$
test_to_module_speed[False] 1.8294ms 1.3152ms 760.3306 Ops/s 764.7523 Ops/s $\color{#d91a1a}-0.58\%$
test_tc_init 87.3920μs 47.0797μs 21.2406 KOps/s 20.8099 KOps/s $\color{#35bf28}+2.07\%$
test_tc_init_nested 0.1973ms 93.8083μs 10.6600 KOps/s 10.1960 KOps/s $\color{#35bf28}+4.55\%$
test_tc_first_layer_tensor 40.3450μs 1.7120μs 584.0957 KOps/s 593.7859 KOps/s $\color{#d91a1a}-1.63\%$
test_tc_first_layer_nontensor 32.6700μs 4.9549μs 201.8193 KOps/s 203.3808 KOps/s $\color{#d91a1a}-0.77\%$
test_tc_second_layer_tensor 48.2290μs 3.1975μs 312.7401 KOps/s 313.3826 KOps/s $\color{#d91a1a}-0.20\%$
test_tc_second_layer_nontensor 39.2330μs 6.5466μs 152.7515 KOps/s 154.5685 KOps/s $\color{#d91a1a}-1.18\%$
test_unbind 0.2606s 13.9889ms 71.4854 Ops/s 68.5191 Ops/s $\color{#35bf28}+4.33\%$
test_full_like 8.3040ms 5.4439ms 183.6930 Ops/s 285.5508 Ops/s $\textbf{\color{#d91a1a}-35.67\%}$
test_zeros_like 10.1251ms 5.0942ms 196.2999 Ops/s 346.0594 Ops/s $\textbf{\color{#d91a1a}-43.28\%}$
test_ones_like 6.0320ms 3.7863ms 264.1113 Ops/s 276.9189 Ops/s $\color{#d91a1a}-4.63\%$
test_clone 7.4952ms 5.3547ms 186.7526 Ops/s 182.6829 Ops/s $\color{#35bf28}+2.23\%$
test_squeeze 66.1130μs 13.0462μs 76.6509 KOps/s 78.2702 KOps/s $\color{#d91a1a}-2.07\%$
test_unsqueeze 0.3249ms 97.3642μs 10.2707 KOps/s 10.6417 KOps/s $\color{#d91a1a}-3.49\%$
test_split 0.3574ms 0.1974ms 5.0665 KOps/s 5.2295 KOps/s $\color{#d91a1a}-3.12\%$
test_permute 0.3174ms 0.2050ms 4.8790 KOps/s 5.0336 KOps/s $\color{#d91a1a}-3.07\%$
test_stack 29.6607ms 24.9382ms 40.0992 Ops/s 39.1426 Ops/s $\color{#35bf28}+2.44\%$
test_cat 29.6458ms 24.8444ms 40.2505 Ops/s 39.0491 Ops/s $\color{#35bf28}+3.08\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}57$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 33.5610μs 11.8928μs 84.0845 KOps/s 76.0824 KOps/s $\textbf{\color{#35bf28}+10.52\%}$
test_plain_set_stack_nested 50.0710μs 12.0700μs 82.8499 KOps/s 75.0658 KOps/s $\textbf{\color{#35bf28}+10.37\%}$
test_plain_set_nested_inplace 42.9900μs 12.9793μs 77.0460 KOps/s 69.8720 KOps/s $\textbf{\color{#35bf28}+10.27\%}$
test_plain_set_stack_nested_inplace 61.9210μs 13.1698μs 75.9312 KOps/s 70.2072 KOps/s $\textbf{\color{#35bf28}+8.15\%}$
test_items 38.6910μs 2.8711μs 348.2964 KOps/s 345.0180 KOps/s $\color{#35bf28}+0.95\%$
test_items_nested 0.4132ms 0.3645ms 2.7437 KOps/s 2.7099 KOps/s $\color{#35bf28}+1.25\%$
test_items_nested_locked 0.4215ms 0.3686ms 2.7133 KOps/s 2.7112 KOps/s $\color{#35bf28}+0.08\%$
test_items_nested_leaf 88.8910μs 62.2869μs 16.0547 KOps/s 15.9663 KOps/s $\color{#35bf28}+0.55\%$
test_items_stack_nested 0.5403ms 0.3633ms 2.7529 KOps/s 2.6973 KOps/s $\color{#35bf28}+2.06\%$
test_items_stack_nested_leaf 94.1210μs 62.5021μs 15.9995 KOps/s 16.0576 KOps/s $\color{#d91a1a}-0.36\%$
test_items_stack_nested_locked 0.5178ms 0.3665ms 2.7285 KOps/s 2.7091 KOps/s $\color{#35bf28}+0.72\%$
test_keys 50.8110μs 3.4249μs 291.9807 KOps/s 293.0005 KOps/s $\color{#d91a1a}-0.35\%$
test_keys_nested 0.1280ms 88.8774μs 11.2515 KOps/s 11.3781 KOps/s $\color{#d91a1a}-1.11\%$
test_keys_nested_locked 0.7070ms 94.1514μs 10.6212 KOps/s 10.6538 KOps/s $\color{#d91a1a}-0.31\%$
test_keys_nested_leaf 0.1171ms 79.2620μs 12.6164 KOps/s 12.6103 KOps/s $\color{#35bf28}+0.05\%$
test_keys_stack_nested 0.1621ms 87.4928μs 11.4295 KOps/s 11.4552 KOps/s $\color{#d91a1a}-0.22\%$
test_keys_stack_nested_leaf 0.1140ms 78.7150μs 12.7041 KOps/s 12.7983 KOps/s $\color{#d91a1a}-0.74\%$
test_keys_stack_nested_locked 0.1399ms 94.4846μs 10.5837 KOps/s 10.6732 KOps/s $\color{#d91a1a}-0.84\%$
test_values 4.2850μs 0.8537μs 1.1714 MOps/s 1.1737 MOps/s $\color{#d91a1a}-0.20\%$
test_values_nested 64.4010μs 38.5152μs 25.9638 KOps/s 26.2124 KOps/s $\color{#d91a1a}-0.95\%$
test_values_nested_locked 64.4210μs 40.2606μs 24.8382 KOps/s 25.0892 KOps/s $\color{#d91a1a}-1.00\%$
test_values_nested_leaf 68.5610μs 43.4745μs 23.0020 KOps/s 23.1680 KOps/s $\color{#d91a1a}-0.72\%$
test_values_stack_nested 70.0610μs 38.4174μs 26.0298 KOps/s 26.0999 KOps/s $\color{#d91a1a}-0.27\%$
test_values_stack_nested_leaf 70.1910μs 43.3635μs 23.0609 KOps/s 23.1743 KOps/s $\color{#d91a1a}-0.49\%$
test_values_stack_nested_locked 70.3010μs 40.6058μs 24.6270 KOps/s 25.1165 KOps/s $\color{#d91a1a}-1.95\%$
test_membership 2.3706μs 0.5001μs 1.9996 MOps/s 1.9951 MOps/s $\color{#35bf28}+0.23\%$
test_membership_nested 18.3500μs 1.9857μs 503.5903 KOps/s 504.0555 KOps/s $\color{#d91a1a}-0.09\%$
test_membership_nested_leaf 15.5350μs 1.9964μs 500.9061 KOps/s 499.9143 KOps/s $\color{#35bf28}+0.20\%$
test_membership_stacked_nested 25.8710μs 2.0592μs 485.6302 KOps/s 485.2543 KOps/s $\color{#35bf28}+0.08\%$
test_membership_stacked_nested_leaf 35.5700μs 2.0558μs 486.4336 KOps/s 485.0675 KOps/s $\color{#35bf28}+0.28\%$
test_membership_nested_last 42.9110μs 3.0354μs 329.4485 KOps/s 330.1285 KOps/s $\color{#d91a1a}-0.21\%$
test_membership_nested_leaf_last 49.0610μs 3.0131μs 331.8866 KOps/s 329.9813 KOps/s $\color{#35bf28}+0.58\%$
test_membership_stacked_nested_last 53.5210μs 3.0347μs 329.5178 KOps/s 331.4650 KOps/s $\color{#d91a1a}-0.59\%$
test_membership_stacked_nested_leaf_last 32.1900μs 3.0275μs 330.3070 KOps/s 331.5549 KOps/s $\color{#d91a1a}-0.38\%$
test_nested_getleaf 49.7900μs 13.4588μs 74.3010 KOps/s 74.2010 KOps/s $\color{#35bf28}+0.13\%$
test_nested_get 38.0810μs 12.8057μs 78.0901 KOps/s 77.4962 KOps/s $\color{#35bf28}+0.77\%$
test_stacked_getleaf 42.2710μs 13.4125μs 74.5573 KOps/s 74.8667 KOps/s $\color{#d91a1a}-0.41\%$
test_stacked_get 43.3200μs 12.7444μs 78.4656 KOps/s 79.0482 KOps/s $\color{#d91a1a}-0.74\%$
test_nested_getitemleaf 55.4810μs 14.0129μs 71.3628 KOps/s 72.1874 KOps/s $\color{#d91a1a}-1.14\%$
test_nested_getitem 36.5100μs 13.1651μs 75.9584 KOps/s 76.4258 KOps/s $\color{#d91a1a}-0.61\%$
test_stacked_getitemleaf 42.6900μs 13.8193μs 72.3627 KOps/s 72.9835 KOps/s $\color{#d91a1a}-0.85\%$
test_stacked_getitem 59.3810μs 13.0170μs 76.8225 KOps/s 77.6344 KOps/s $\color{#d91a1a}-1.05\%$
test_lock_nested 7.8819ms 0.3453ms 2.8963 KOps/s 2.9191 KOps/s $\color{#d91a1a}-0.78\%$
test_lock_stack_nested 0.3896ms 0.3416ms 2.9276 KOps/s 2.8441 KOps/s $\color{#35bf28}+2.94\%$
test_unlock_nested 0.3599ms 0.2853ms 3.5053 KOps/s 3.4754 KOps/s $\color{#35bf28}+0.86\%$
test_unlock_stack_nested 0.3178ms 0.2801ms 3.5697 KOps/s 3.4461 KOps/s $\color{#35bf28}+3.58\%$
test_flatten_speed 0.1149ms 78.6988μs 12.7067 KOps/s 12.8153 KOps/s $\color{#d91a1a}-0.85\%$
test_unflatten_speed 0.4600ms 0.4045ms 2.4722 KOps/s 2.5067 KOps/s $\color{#d91a1a}-1.37\%$
test_common_ops 0.7142ms 0.5731ms 1.7448 KOps/s 1.5766 KOps/s $\textbf{\color{#35bf28}+10.67\%}$
test_creation 89.3410μs 1.7537μs 570.2353 KOps/s 572.2836 KOps/s $\color{#d91a1a}-0.36\%$
test_creation_empty 34.5900μs 6.4577μs 154.8543 KOps/s 116.8732 KOps/s $\textbf{\color{#35bf28}+32.50\%}$
test_creation_nested_1 31.3110μs 8.3032μs 120.4353 KOps/s 97.8051 KOps/s $\textbf{\color{#35bf28}+23.14\%}$
test_creation_nested_2 36.3810μs 10.9683μs 91.1722 KOps/s 76.5730 KOps/s $\textbf{\color{#35bf28}+19.07\%}$
test_clone 56.6410μs 10.4960μs 95.2746 KOps/s 93.0746 KOps/s $\color{#35bf28}+2.36\%$
test_getitem[int] 1.2723ms 10.4584μs 95.6165 KOps/s 92.3427 KOps/s $\color{#35bf28}+3.55\%$
test_getitem[slice_int] 0.1124ms 20.5693μs 48.6161 KOps/s 47.7019 KOps/s $\color{#35bf28}+1.92\%$
test_getitem[range] 0.1289ms 37.5834μs 26.6075 KOps/s 26.0742 KOps/s $\color{#35bf28}+2.05\%$
test_getitem[tuple] 0.1144ms 17.7654μs 56.2891 KOps/s 54.6763 KOps/s $\color{#35bf28}+2.95\%$
test_getitem[list] 0.1286ms 32.5914μs 30.6829 KOps/s 29.4817 KOps/s $\color{#35bf28}+4.07\%$
test_setitem_dim[int] 41.2210μs 18.7333μs 53.3809 KOps/s 50.6845 KOps/s $\textbf{\color{#35bf28}+5.32\%}$
test_setitem_dim[slice_int] 61.1510μs 38.3269μs 26.0913 KOps/s 25.6533 KOps/s $\color{#35bf28}+1.71\%$
test_setitem_dim[range] 92.5810μs 53.4879μs 18.6958 KOps/s 18.6403 KOps/s $\color{#35bf28}+0.30\%$
test_setitem_dim[tuple] 53.2110μs 31.5827μs 31.6629 KOps/s 29.9016 KOps/s $\textbf{\color{#35bf28}+5.89\%}$
test_setitem 57.4310μs 13.8563μs 72.1692 KOps/s 65.5837 KOps/s $\textbf{\color{#35bf28}+10.04\%}$
test_set 48.1110μs 13.1177μs 76.2330 KOps/s 67.7087 KOps/s $\textbf{\color{#35bf28}+12.59\%}$
test_set_shared 0.6018ms 0.1573ms 6.3565 KOps/s 6.3084 KOps/s $\color{#35bf28}+0.76\%$
test_update 0.3387ms 17.0515μs 58.6458 KOps/s 52.0835 KOps/s $\textbf{\color{#35bf28}+12.60\%}$
test_update_nested 58.0410μs 25.6765μs 38.9462 KOps/s 35.2330 KOps/s $\textbf{\color{#35bf28}+10.54\%}$
test_update__nested 0.5233ms 25.4119μs 39.3517 KOps/s 39.5321 KOps/s $\color{#d91a1a}-0.46\%$
test_set_nested 49.2710μs 14.5019μs 68.9565 KOps/s 62.7024 KOps/s $\textbf{\color{#35bf28}+9.97\%}$
test_set_nested_new 57.1810μs 17.0799μs 58.5482 KOps/s 54.4888 KOps/s $\textbf{\color{#35bf28}+7.45\%}$
test_select 71.7710μs 28.9653μs 34.5240 KOps/s 33.2589 KOps/s $\color{#35bf28}+3.80\%$
test_select_nested 77.4310μs 44.1847μs 22.6323 KOps/s 22.7135 KOps/s $\color{#d91a1a}-0.36\%$
test_exclude_nested 92.3920μs 63.5604μs 15.7331 KOps/s 15.9077 KOps/s $\color{#d91a1a}-1.10\%$
test_empty[True] 0.3534ms 0.2976ms 3.3605 KOps/s 3.4056 KOps/s $\color{#d91a1a}-1.32\%$
test_empty[False] 3.3711μs 0.8234μs 1.2145 MOps/s 1.2235 MOps/s $\color{#d91a1a}-0.74\%$
test_to 89.5210μs 57.7621μs 17.3124 KOps/s 16.8728 KOps/s $\color{#35bf28}+2.61\%$
test_to_nonblocking 96.0510μs 49.4778μs 20.2111 KOps/s 20.5324 KOps/s $\color{#d91a1a}-1.57\%$
test_unbind_speed 0.2750ms 0.2367ms 4.2253 KOps/s 4.1030 KOps/s $\color{#35bf28}+2.98\%$
test_unbind_speed_stack0 0.2927ms 0.2349ms 4.2578 KOps/s 4.1231 KOps/s $\color{#35bf28}+3.27\%$
test_unbind_speed_stack1 92.2995ms 0.7319ms 1.3664 KOps/s 1.3526 KOps/s $\color{#35bf28}+1.01\%$
test_split 93.3556ms 1.5731ms 635.7057 Ops/s 623.0582 Ops/s $\color{#35bf28}+2.03\%$
test_chunk 95.3172ms 1.5862ms 630.4207 Ops/s 618.7683 Ops/s $\color{#35bf28}+1.88\%$
test_consolidate[False-None] 95.3325ms 3.0881ms 323.8218 Ops/s 357.1980 Ops/s $\textbf{\color{#d91a1a}-9.34\%}$
test_consolidate[default-None] 1.8096ms 1.7032ms 587.1209 Ops/s 579.2627 Ops/s $\color{#35bf28}+1.36\%$
test_consolidate[reduce-overhead-None] 1.8469ms 1.7289ms 578.3871 Ops/s 569.3619 Ops/s $\color{#35bf28}+1.59\%$
test_consolidate_njt[False-None] 6.5979ms 6.3148ms 158.3593 Ops/s 150.1611 Ops/s $\textbf{\color{#35bf28}+5.46\%}$
test_to[False-False-None] 1.8178ms 1.7247ms 579.8054 Ops/s 551.8505 Ops/s $\textbf{\color{#35bf28}+5.07\%}$
test_to[True-False-None] 1.5410ms 1.3408ms 745.8407 Ops/s 696.1935 Ops/s $\textbf{\color{#35bf28}+7.13\%}$
test_to[within-False-None] 4.2808ms 4.2019ms 237.9875 Ops/s 227.7119 Ops/s $\color{#35bf28}+4.51\%$
test_to[True-default-None] 5.2881ms 5.1301ms 194.9280 Ops/s 187.8191 Ops/s $\color{#35bf28}+3.78\%$
test_to_njt[False-False-None] 6.9208ms 6.8413ms 146.1701 Ops/s 139.4280 Ops/s $\color{#35bf28}+4.84\%$
test_to_njt[True-False-None] 5.4592ms 5.3222ms 187.8905 Ops/s 172.0899 Ops/s $\textbf{\color{#35bf28}+9.18\%}$
test_to_njt[within-False-None] 11.8095ms 11.6799ms 85.6172 Ops/s 81.2041 Ops/s $\textbf{\color{#35bf28}+5.43\%}$
test_creation[device0] 0.4723ms 79.6331μs 12.5576 KOps/s 12.3823 KOps/s $\color{#35bf28}+1.42\%$
test_creation_from_tensor 0.5442ms 82.6395μs 12.1007 KOps/s 11.7868 KOps/s $\color{#35bf28}+2.66\%$
test_add_one[memmap_tensor0] 0.5614ms 6.6989μs 149.2793 KOps/s 139.7282 KOps/s $\textbf{\color{#35bf28}+6.84\%}$
test_contiguous[memmap_tensor0] 4.5040μs 0.4250μs 2.3531 MOps/s 2.2825 MOps/s $\color{#35bf28}+3.10\%$
test_stack[memmap_tensor0] 36.6210μs 4.2818μs 233.5471 KOps/s 208.8466 KOps/s $\textbf{\color{#35bf28}+11.83\%}$
test_memmaptd_index 1.4778ms 0.2364ms 4.2297 KOps/s 4.0230 KOps/s $\textbf{\color{#35bf28}+5.14\%}$
test_memmaptd_index_astensor 0.4206ms 0.2966ms 3.3711 KOps/s 3.1882 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_memmaptd_index_op 0.7125ms 0.5402ms 1.8513 KOps/s 1.6561 KOps/s $\textbf{\color{#35bf28}+11.79\%}$
test_serialize_model 0.1326s 0.1309s 7.6409 Ops/s 7.6643 Ops/s $\color{#d91a1a}-0.31\%$
test_serialize_model_pickle 1.3516s 1.2162s 0.8222 Ops/s 0.8247 Ops/s $\color{#d91a1a}-0.30\%$
test_serialize_weights 0.4057s 0.1695s 5.9002 Ops/s 7.6540 Ops/s $\textbf{\color{#d91a1a}-22.91\%}$
test_serialize_weights_returnearly 0.3321s 53.2655ms 18.7739 Ops/s 23.1384 Ops/s $\textbf{\color{#d91a1a}-18.86\%}$
test_serialize_weights_pickle 1.4371s 1.2283s 0.8142 Ops/s 0.8220 Ops/s $\color{#d91a1a}-0.95\%$
test_reshape_pytree 65.0810μs 22.3422μs 44.7584 KOps/s 44.0504 KOps/s $\color{#35bf28}+1.61\%$
test_reshape_td 61.9810μs 26.4655μs 37.7850 KOps/s 35.4300 KOps/s $\textbf{\color{#35bf28}+6.65\%}$
test_view_pytree 54.4210μs 22.3311μs 44.7807 KOps/s 43.9750 KOps/s $\color{#35bf28}+1.83\%$
test_view_td 65.9010μs 31.9317μs 31.3168 KOps/s 29.7964 KOps/s $\textbf{\color{#35bf28}+5.10\%}$
test_unbind_pytree 51.6210μs 27.5653μs 36.2775 KOps/s 34.2076 KOps/s $\textbf{\color{#35bf28}+6.05\%}$
test_unbind_td 0.7384ms 36.1130μs 27.6909 KOps/s 26.0695 KOps/s $\textbf{\color{#35bf28}+6.22\%}$
test_split_pytree 56.7210μs 29.4268μs 33.9826 KOps/s 30.5886 KOps/s $\textbf{\color{#35bf28}+11.10\%}$
test_split_td 0.8839ms 38.1864μs 26.1873 KOps/s 24.8535 KOps/s $\textbf{\color{#35bf28}+5.37\%}$
test_add_pytree 65.3810μs 34.6835μs 28.8321 KOps/s 25.9520 KOps/s $\textbf{\color{#35bf28}+11.10\%}$
test_add_td 98.6420μs 46.3780μs 21.5620 KOps/s 18.8607 KOps/s $\textbf{\color{#35bf28}+14.32\%}$
test_compile_add_one_nested[tensordict-compile] 0.1827ms 0.1211ms 8.2591 KOps/s 7.8797 KOps/s $\color{#35bf28}+4.81\%$
test_compile_add_one_nested[tensordict-eager] 0.2364ms 0.1412ms 7.0817 KOps/s 6.9905 KOps/s $\color{#35bf28}+1.30\%$
test_compile_add_one_nested[pytree-compile] 0.4818ms 94.3103μs 10.6033 KOps/s 10.3386 KOps/s $\color{#35bf28}+2.56\%$
test_compile_add_one_nested[pytree-eager] 1.0373ms 0.1484ms 6.7396 KOps/s 6.4537 KOps/s $\color{#35bf28}+4.43\%$
test_compile_copy_nested[tensordict-compile] 0.4196ms 24.8464μs 40.2472 KOps/s 41.0266 KOps/s $\color{#d91a1a}-1.90\%$
test_compile_copy_nested[tensordict-eager] 0.4363ms 35.6315μs 28.0650 KOps/s 28.1995 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_copy_nested[pytree-compile] 0.3184ms 64.2239μs 15.5705 KOps/s 15.3465 KOps/s $\color{#35bf28}+1.46\%$
test_compile_copy_nested[pytree-eager] 0.4394ms 49.4047μs 20.2410 KOps/s 20.0624 KOps/s $\color{#35bf28}+0.89\%$
test_compile_add_one_flat[tensordict-compile] 0.1844ms 0.1416ms 7.0643 KOps/s 7.1306 KOps/s $\color{#d91a1a}-0.93\%$
test_compile_add_one_flat[tensordict-eager] 0.6160ms 0.2198ms 4.5487 KOps/s 4.5187 KOps/s $\color{#35bf28}+0.66\%$
test_compile_add_one_flat[tensorclass-compile] 0.5034ms 96.1965μs 10.3954 KOps/s 10.1034 KOps/s $\color{#35bf28}+2.89\%$
test_compile_add_one_flat[tensorclass-eager] 0.4560ms 57.9532μs 17.2553 KOps/s 17.2294 KOps/s $\color{#35bf28}+0.15\%$
test_compile_add_one_flat[pytree-compile] 0.1723ms 0.1339ms 7.4693 KOps/s 7.3007 KOps/s $\color{#35bf28}+2.31\%$
test_compile_add_one_flat[pytree-eager] 0.8837ms 0.4772ms 2.0955 KOps/s 2.0058 KOps/s $\color{#35bf28}+4.47\%$
test_compile_add_self_flat[tensordict-eager] 0.6590ms 0.2618ms 3.8195 KOps/s 3.7638 KOps/s $\color{#35bf28}+1.48\%$
test_compile_add_self_flat[tensordict-compile] 0.1928ms 0.1410ms 7.0927 KOps/s 7.0332 KOps/s $\color{#35bf28}+0.85\%$
test_compile_add_self_flat[tensorclass-eager] 0.4757ms 69.9661μs 14.2926 KOps/s 13.8413 KOps/s $\color{#35bf28}+3.26\%$
test_compile_add_self_flat[tensorclass-compile] 0.1333ms 96.4990μs 10.3628 KOps/s 10.1618 KOps/s $\color{#35bf28}+1.98\%$
test_compile_add_self_flat[pytree-eager] 0.8033ms 0.4040ms 2.4753 KOps/s 2.3652 KOps/s $\color{#35bf28}+4.66\%$
test_compile_add_self_flat[pytree-compile] 0.1712ms 0.1335ms 7.4898 KOps/s 7.3637 KOps/s $\color{#35bf28}+1.71\%$
test_compile_copy_flat[tensordict-compile] 0.1269ms 18.4729μs 54.1335 KOps/s 55.4472 KOps/s $\color{#d91a1a}-2.37\%$
test_compile_copy_flat[tensordict-eager] 57.8810μs 31.8186μs 31.4282 KOps/s 31.5235 KOps/s $\color{#d91a1a}-0.30\%$
test_compile_copy_flat[pytree-compile] 0.1992ms 69.6382μs 14.3599 KOps/s 14.2215 KOps/s $\color{#35bf28}+0.97\%$
test_compile_copy_flat[pytree-eager] 83.2510μs 52.0535μs 19.2110 KOps/s 19.1840 KOps/s $\color{#35bf28}+0.14\%$
test_compile_assign_and_add[tensordict-compile] 1.6336ms 0.3993ms 2.5046 KOps/s 2.1973 KOps/s $\textbf{\color{#35bf28}+13.99\%}$
test_compile_assign_and_add[tensordict-eager] 2.7523ms 2.6095ms 383.2201 Ops/s 365.7788 Ops/s $\color{#35bf28}+4.77\%$
test_compile_assign_and_add[pytree-compile] 1.5943ms 0.3810ms 2.6249 KOps/s 2.2598 KOps/s $\textbf{\color{#35bf28}+16.16\%}$
test_compile_assign_and_add[pytree-eager] 2.7181ms 2.6002ms 384.5890 Ops/s 365.5168 Ops/s $\textbf{\color{#35bf28}+5.22\%}$
test_compile_indexing[tensor-tensordict-compile] 0.5240ms 0.1147ms 8.7161 KOps/s 8.7272 KOps/s $\color{#d91a1a}-0.13\%$
test_compile_indexing[tensor-tensordict-eager] 0.5917ms 82.1409μs 12.1742 KOps/s 11.8857 KOps/s $\color{#35bf28}+2.43\%$
test_compile_indexing[tensor-tensorclass-compile] 0.7710ms 0.1083ms 9.2339 KOps/s 9.2557 KOps/s $\color{#d91a1a}-0.24\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1509ms 68.2989μs 14.6415 KOps/s 14.3270 KOps/s $\color{#35bf28}+2.20\%$
test_compile_indexing[tensor-pytree-compile] 0.2277ms 0.1118ms 8.9433 KOps/s 9.2131 KOps/s $\color{#d91a1a}-2.93\%$
test_compile_indexing[tensor-pytree-eager] 0.1499ms 67.9155μs 14.7242 KOps/s 14.4074 KOps/s $\color{#35bf28}+2.20\%$
test_compile_indexing[slice-tensordict-compile] 0.1378ms 0.1027ms 9.7370 KOps/s 9.9270 KOps/s $\color{#d91a1a}-1.91\%$
test_compile_indexing[slice-tensordict-eager] 0.1493ms 18.1355μs 55.1404 KOps/s 51.3839 KOps/s $\textbf{\color{#35bf28}+7.31\%}$
test_compile_indexing[slice-tensorclass-compile] 0.2187ms 94.9267μs 10.5344 KOps/s 10.3162 KOps/s $\color{#35bf28}+2.12\%$
test_compile_indexing[slice-tensorclass-eager] 55.5610μs 15.2480μs 65.5825 KOps/s 62.2855 KOps/s $\textbf{\color{#35bf28}+5.29\%}$
test_compile_indexing[slice-pytree-compile] 0.1886ms 96.1564μs 10.3997 KOps/s 10.2532 KOps/s $\color{#35bf28}+1.43\%$
test_compile_indexing[slice-pytree-eager] 0.1028ms 15.2539μs 65.5569 KOps/s 62.3636 KOps/s $\textbf{\color{#35bf28}+5.12\%}$
test_compile_indexing[int-tensordict-compile] 0.1917ms 0.1005ms 9.9489 KOps/s 9.7833 KOps/s $\color{#35bf28}+1.69\%$
test_compile_indexing[int-tensordict-eager] 0.5663ms 17.7666μs 56.2854 KOps/s 52.5345 KOps/s $\textbf{\color{#35bf28}+7.14\%}$
test_compile_indexing[int-tensorclass-compile] 0.1846ms 97.9843μs 10.2057 KOps/s 10.2936 KOps/s $\color{#d91a1a}-0.85\%$
test_compile_indexing[int-tensorclass-eager] 50.6410μs 15.3381μs 65.1969 KOps/s 62.5522 KOps/s $\color{#35bf28}+4.23\%$
test_compile_indexing[int-pytree-compile] 0.1909ms 96.4473μs 10.3684 KOps/s 10.2695 KOps/s $\color{#35bf28}+0.96\%$
test_compile_indexing[int-pytree-eager] 0.1214ms 16.7134μs 59.8322 KOps/s 62.7467 KOps/s $\color{#d91a1a}-4.64\%$
test_mod_add[eager] 0.1348ms 36.7285μs 27.2268 KOps/s 24.1637 KOps/s $\textbf{\color{#35bf28}+12.68\%}$
test_mod_add[compile] 0.1917ms 83.3710μs 11.9946 KOps/s 12.0034 KOps/s $\color{#d91a1a}-0.07\%$
test_mod_add[compile-overhead] 0.3200ms 0.1672ms 5.9808 KOps/s 5.5339 KOps/s $\textbf{\color{#35bf28}+8.08\%}$
test_mod_wrap[eager] 0.3583ms 0.2484ms 4.0259 KOps/s 3.8470 KOps/s $\color{#35bf28}+4.65\%$
test_mod_wrap[compile] 0.3654ms 0.2895ms 3.4539 KOps/s 3.3604 KOps/s $\color{#35bf28}+2.78\%$
test_mod_wrap[compile-overhead] 7.5250ms 3.8039ms 262.8871 Ops/s 262.5623 Ops/s $\color{#35bf28}+0.12\%$
test_mod_wrap_and_backward[eager] 1.6984ms 1.4501ms 689.5880 Ops/s 670.7080 Ops/s $\color{#35bf28}+2.81\%$
test_mod_wrap_and_backward[compile] 1.4790ms 1.3814ms 723.9120 Ops/s 710.9931 Ops/s $\color{#35bf28}+1.82\%$
test_mod_wrap_and_backward[compile-overhead] 1.5713ms 1.0493ms 952.9713 Ops/s 926.7037 Ops/s $\color{#35bf28}+2.83\%$
test_seq_add[eager] 0.1622ms 0.1163ms 8.6010 KOps/s 7.7592 KOps/s $\textbf{\color{#35bf28}+10.85\%}$
test_seq_add[compile] 0.1272ms 89.1197μs 11.2209 KOps/s 10.8240 KOps/s $\color{#35bf28}+3.67\%$
test_seq_add[compile-overhead] 0.1942ms 0.1342ms 7.4507 KOps/s 7.5556 KOps/s $\color{#d91a1a}-1.39\%$
test_seq_wrap[eager] 0.4994ms 0.4276ms 2.3384 KOps/s 2.2068 KOps/s $\textbf{\color{#35bf28}+5.96\%}$
test_seq_wrap[compile] 0.3798ms 0.3060ms 3.2682 KOps/s 3.1931 KOps/s $\color{#35bf28}+2.35\%$
test_seq_wrap[compile-overhead] 0.2820ms 0.2293ms 4.3617 KOps/s 4.3037 KOps/s $\color{#35bf28}+1.35\%$
test_func_call_runtime[False-eager] 1.0407ms 0.7361ms 1.3585 KOps/s 1.3055 KOps/s $\color{#35bf28}+4.05\%$
test_func_call_runtime[False-compile] 1.0909ms 0.7430ms 1.3458 KOps/s 1.2915 KOps/s $\color{#35bf28}+4.20\%$
test_func_call_runtime[False-compile-overhead] 0.4160ms 0.3634ms 2.7520 KOps/s 2.6974 KOps/s $\color{#35bf28}+2.02\%$
test_func_call_runtime[True-eager] 1.1791ms 0.8979ms 1.1137 KOps/s 1.0806 KOps/s $\color{#35bf28}+3.06\%$
test_func_call_runtime[True-compile] 0.8709ms 0.7622ms 1.3120 KOps/s 1.2693 KOps/s $\color{#35bf28}+3.36\%$
test_func_call_runtime[True-compile-overhead] 0.4861ms 0.3881ms 2.5768 KOps/s 2.5553 KOps/s $\color{#35bf28}+0.84\%$
test_func_call_cm_runtime[False-eager] 0.8605ms 0.7336ms 1.3632 KOps/s 1.3102 KOps/s $\color{#35bf28}+4.05\%$
test_func_call_cm_runtime[False-compile] 0.9543ms 0.7380ms 1.3550 KOps/s 1.2964 KOps/s $\color{#35bf28}+4.52\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4317ms 0.3646ms 2.7430 KOps/s 2.6910 KOps/s $\color{#35bf28}+1.93\%$
test_func_call_cm_runtime[True-eager] 1.1548ms 0.9952ms 1.0048 KOps/s 956.5567 Ops/s $\textbf{\color{#35bf28}+5.04\%}$
test_func_call_cm_runtime[True-compile] 1.1103ms 0.9928ms 1.0072 KOps/s 978.2226 Ops/s $\color{#35bf28}+2.97\%$
test_func_call_cm_runtime[True-compile-overhead] 1.0638ms 0.9863ms 1.0139 KOps/s 979.0844 Ops/s $\color{#35bf28}+3.55\%$
test_vmap_func_call_cm_runtime[eager] 2.4775ms 2.0865ms 479.2738 Ops/s 467.5899 Ops/s $\color{#35bf28}+2.50\%$
test_vmap_func_call_cm_runtime[compile] 0.8963ms 0.8183ms 1.2220 KOps/s 1.1867 KOps/s $\color{#35bf28}+2.98\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5689ms 0.4216ms 2.3718 KOps/s 2.3446 KOps/s $\color{#35bf28}+1.16\%$
test_distributed 2.9102ms 0.1838ms 5.4400 KOps/s 8.6985 KOps/s $\textbf{\color{#d91a1a}-37.46\%}$
test_tdmodule 0.2511ms 19.3067μs 51.7956 KOps/s 44.6204 KOps/s $\textbf{\color{#35bf28}+16.08\%}$
test_tdmodule_dispatch 52.4210μs 33.4276μs 29.9154 KOps/s 24.9267 KOps/s $\textbf{\color{#35bf28}+20.01\%}$
test_tdseq 28.7110μs 19.3078μs 51.7925 KOps/s 45.1222 KOps/s $\textbf{\color{#35bf28}+14.78\%}$
test_tdseq_dispatch 57.1010μs 36.2737μs 27.5682 KOps/s 23.5651 KOps/s $\textbf{\color{#35bf28}+16.99\%}$
test_instantiation_functorch 1.6090ms 1.5096ms 662.4298 Ops/s 635.4801 Ops/s $\color{#35bf28}+4.24\%$
test_exec_functorch 0.1837ms 0.1403ms 7.1254 KOps/s 6.8447 KOps/s $\color{#35bf28}+4.10\%$
test_exec_functional_call 0.2089ms 0.1344ms 7.4391 KOps/s 7.0591 KOps/s $\textbf{\color{#35bf28}+5.38\%}$
test_exec_td_decorator 0.3795ms 0.1871ms 5.3453 KOps/s 5.1497 KOps/s $\color{#35bf28}+3.80\%$
test_vmap_mlp_speed_decorator[True-True] 0.7869ms 0.6820ms 1.4662 KOps/s 1.4250 KOps/s $\color{#35bf28}+2.89\%$
test_vmap_mlp_speed_decorator[True-False] 0.8199ms 0.6817ms 1.4670 KOps/s 1.4269 KOps/s $\color{#35bf28}+2.81\%$
test_vmap_mlp_speed_decorator[False-True] 0.7309ms 0.6084ms 1.6437 KOps/s 1.6504 KOps/s $\color{#d91a1a}-0.41\%$
test_vmap_mlp_speed_decorator[False-False] 0.7530ms 0.5980ms 1.6723 KOps/s 1.6565 KOps/s $\color{#35bf28}+0.96\%$
test_vmap_transformer_speed_decorator[True-True] 20.1764ms 19.3924ms 51.5666 Ops/s 51.1612 Ops/s $\color{#35bf28}+0.79\%$
test_vmap_transformer_speed_decorator[True-False] 19.3804ms 19.3029ms 51.8058 Ops/s 51.1916 Ops/s $\color{#35bf28}+1.20\%$
test_vmap_transformer_speed_decorator[False-True] 19.5021ms 19.1880ms 52.1159 Ops/s 51.5651 Ops/s $\color{#35bf28}+1.07\%$
test_vmap_transformer_speed_decorator[False-False] 19.9277ms 19.2146ms 52.0438 Ops/s 51.6723 Ops/s $\color{#35bf28}+0.72\%$
test_to_module_speed[True] 1.5180ms 0.9605ms 1.0412 KOps/s 1.0202 KOps/s $\color{#35bf28}+2.06\%$
test_to_module_speed[False] 1.0114ms 0.9437ms 1.0596 KOps/s 1.0466 KOps/s $\color{#35bf28}+1.24\%$
test_tc_init 70.9210μs 32.7712μs 30.5146 KOps/s 28.3256 KOps/s $\textbf{\color{#35bf28}+7.73\%}$
test_tc_init_nested 0.1086ms 66.0283μs 15.1450 KOps/s 14.4157 KOps/s $\textbf{\color{#35bf28}+5.06\%}$
test_tc_first_layer_tensor 29.2710μs 0.9238μs 1.0825 MOps/s 1.0377 MOps/s $\color{#35bf28}+4.32\%$
test_tc_first_layer_nontensor 30.3310μs 2.4168μs 413.7702 KOps/s 406.7062 KOps/s $\color{#35bf28}+1.74\%$
test_tc_second_layer_tensor 33.5910μs 1.7998μs 555.6137 KOps/s 550.0382 KOps/s $\color{#35bf28}+1.01\%$
test_tc_second_layer_nontensor 31.5600μs 3.2979μs 303.2227 KOps/s 296.7729 KOps/s $\color{#35bf28}+2.17\%$
test_unbind 0.2162s 11.9905ms 83.3996 Ops/s 143.1893 Ops/s $\textbf{\color{#d91a1a}-41.76\%}$
test_full_like 4.5305ms 4.3509ms 229.8356 Ops/s 112.2124 Ops/s $\textbf{\color{#35bf28}+104.82\%}$
test_zeros_like 9.2803ms 7.2904ms 137.1670 Ops/s 230.7453 Ops/s $\textbf{\color{#d91a1a}-40.55\%}$
test_ones_like 5.5364ms 4.3334ms 230.7675 Ops/s 229.7837 Ops/s $\color{#35bf28}+0.43\%$
test_clone 6.9280ms 6.3645ms 157.1210 Ops/s 109.9239 Ops/s $\textbf{\color{#35bf28}+42.94\%}$
test_squeeze 59.7610μs 9.3902μs 106.4940 KOps/s 101.0200 KOps/s $\textbf{\color{#35bf28}+5.42\%}$
test_unsqueeze 0.1224ms 70.8649μs 14.1114 KOps/s 13.5869 KOps/s $\color{#35bf28}+3.86\%$
test_split 0.4026ms 0.1513ms 6.6086 KOps/s 6.1939 KOps/s $\textbf{\color{#35bf28}+6.70\%}$
test_permute 0.2351ms 0.1783ms 5.6077 KOps/s 5.4178 KOps/s $\color{#35bf28}+3.51\%$
test_stack 50.6945ms 50.4853ms 19.8077 Ops/s 19.9291 Ops/s $\color{#d91a1a}-0.61\%$
test_cat 52.4697ms 50.4589ms 19.8181 Ops/s 19.9341 Ops/s $\color{#d91a1a}-0.58\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. versioning
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants