-
Notifications
You must be signed in to change notification settings - Fork 95
[Versioning] Bump v0.8.0 #1257
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
[Versioning] Bump v0.8.0 #1257
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 45.3050μs | 21.2707μs | 47.0130 KOps/s | 46.7342 KOps/s | |
test_plain_set_stack_nested | 58.0880μs | 21.4793μs | 46.5565 KOps/s | 45.9012 KOps/s | |
test_plain_set_nested_inplace | 67.6060μs | 23.0137μs | 43.4524 KOps/s | 42.3880 KOps/s | |
test_plain_set_stack_nested_inplace | 55.3230μs | 23.0236μs | 43.4338 KOps/s | 42.4318 KOps/s | |
test_items | 30.5870μs | 4.1841μs | 239.0002 KOps/s | 236.5207 KOps/s | |
test_items_nested | 0.8538ms | 0.4081ms | 2.4507 KOps/s | 2.4766 KOps/s | |
test_items_nested_locked | 1.0283ms | 0.4172ms | 2.3967 KOps/s | 2.4412 KOps/s | |
test_items_nested_leaf | 0.1601ms | 76.9509μs | 12.9953 KOps/s | 13.1231 KOps/s | |
test_items_stack_nested | 0.5969ms | 0.4122ms | 2.4263 KOps/s | 2.4701 KOps/s | |
test_items_stack_nested_leaf | 0.3700ms | 77.4923μs | 12.9045 KOps/s | 13.0478 KOps/s | |
test_items_stack_nested_locked | 0.5661ms | 0.4120ms | 2.4271 KOps/s | 2.4578 KOps/s | |
test_keys | 21.7710μs | 3.7130μs | 269.3266 KOps/s | 275.9065 KOps/s | |
test_keys_nested | 0.2759ms | 0.1669ms | 5.9924 KOps/s | 6.0249 KOps/s | |
test_keys_nested_locked | 1.6777ms | 0.1757ms | 5.6902 KOps/s | 5.7939 KOps/s | |
test_keys_nested_leaf | 0.1976ms | 0.1461ms | 6.8470 KOps/s | 6.9289 KOps/s | |
test_keys_stack_nested | 0.2527ms | 0.1675ms | 5.9706 KOps/s | 6.0539 KOps/s | |
test_keys_stack_nested_leaf | 0.2087ms | 0.1464ms | 6.8310 KOps/s | 6.9587 KOps/s | |
test_keys_stack_nested_locked | 0.3095ms | 0.1742ms | 5.7402 KOps/s | 5.8552 KOps/s | |
test_values | 9.8462μs | 1.0551μs | 947.7805 KOps/s | 941.3581 KOps/s | |
test_values_nested | 0.1131ms | 66.6354μs | 15.0070 KOps/s | 15.0193 KOps/s | |
test_values_nested_locked | 0.1271ms | 66.7268μs | 14.9865 KOps/s | 15.1041 KOps/s | |
test_values_nested_leaf | 0.1176ms | 74.8434μs | 13.3612 KOps/s | 13.3484 KOps/s | |
test_values_stack_nested | 0.1220ms | 67.3631μs | 14.8449 KOps/s | 15.1684 KOps/s | |
test_values_stack_nested_leaf | 0.1367ms | 74.7891μs | 13.3709 KOps/s | 13.6403 KOps/s | |
test_values_stack_nested_locked | 0.1339ms | 66.7379μs | 14.9840 KOps/s | 15.1265 KOps/s | |
test_membership | 6.4807μs | 0.7173μs | 1.3941 MOps/s | 1.4572 MOps/s | |
test_membership_nested | 23.3330μs | 2.9215μs | 342.2841 KOps/s | 339.3417 KOps/s | |
test_membership_nested_leaf | 22.9030μs | 2.9491μs | 339.0917 KOps/s | 337.0741 KOps/s | |
test_membership_stacked_nested | 36.5380μs | 2.8770μs | 347.5891 KOps/s | 341.9098 KOps/s | |
test_membership_stacked_nested_leaf | 25.6980μs | 2.9333μs | 340.9170 KOps/s | 339.3119 KOps/s | |
test_membership_nested_last | 22.4420μs | 4.3459μs | 230.1017 KOps/s | 229.7454 KOps/s | |
test_membership_nested_leaf_last | 23.2730μs | 4.3611μs | 229.3014 KOps/s | 230.9647 KOps/s | |
test_membership_stacked_nested_last | 26.7100μs | 4.3480μs | 229.9908 KOps/s | 231.0965 KOps/s | |
test_membership_stacked_nested_leaf_last | 20.5880μs | 4.3423μs | 230.2927 KOps/s | 229.7179 KOps/s | |
test_nested_getleaf | 50.0230μs | 18.0384μs | 55.4373 KOps/s | 56.8539 KOps/s | |
test_nested_get | 48.1800μs | 17.0679μs | 58.5895 KOps/s | 59.8581 KOps/s | |
test_stacked_getleaf | 62.3060μs | 18.0282μs | 55.4687 KOps/s | 57.2789 KOps/s | |
test_stacked_get | 42.0180μs | 17.1626μs | 58.2662 KOps/s | 60.0160 KOps/s | |
test_nested_getitemleaf | 49.0210μs | 18.6360μs | 53.6596 KOps/s | 55.0096 KOps/s | |
test_nested_getitem | 48.4100μs | 17.7703μs | 56.2735 KOps/s | 57.3976 KOps/s | |
test_stacked_getitemleaf | 45.5850μs | 18.4757μs | 54.1250 KOps/s | 55.9629 KOps/s | |
test_stacked_getitem | 46.9480μs | 17.5623μs | 56.9401 KOps/s | 57.7128 KOps/s | |
test_lock_nested | 0.6584ms | 0.4172ms | 2.3967 KOps/s | 2.4658 KOps/s | |
test_lock_stack_nested | 0.5432ms | 0.4228ms | 2.3649 KOps/s | 2.3743 KOps/s | |
test_unlock_nested | 0.4401ms | 0.3380ms | 2.9584 KOps/s | 3.0104 KOps/s | |
test_unlock_stack_nested | 0.5517ms | 0.3409ms | 2.9334 KOps/s | 2.9259 KOps/s | |
test_flatten_speed | 0.1999ms | 99.5254μs | 10.0477 KOps/s | 9.9552 KOps/s | |
test_unflatten_speed | 0.7173ms | 0.5999ms | 1.6669 KOps/s | 1.6879 KOps/s | |
test_common_ops | 1.0971ms | 0.8203ms | 1.2190 KOps/s | 1.2135 KOps/s | |
test_creation | 34.8150μs | 2.5773μs | 388.0060 KOps/s | 399.8933 KOps/s | |
test_creation_empty | 67.4330μs | 11.9210μs | 83.8855 KOps/s | 79.0592 KOps/s | |
test_creation_nested_1 | 64.0900μs | 14.8165μs | 67.4923 KOps/s | 63.7761 KOps/s | |
test_creation_nested_2 | 55.2130μs | 19.2728μs | 51.8867 KOps/s | 50.1849 KOps/s | |
test_clone | 48.4200μs | 13.8308μs | 72.3022 KOps/s | 75.6058 KOps/s | |
test_getitem[int] | 0.8223ms | 13.0751μs | 76.4810 KOps/s | 79.1792 KOps/s | |
test_getitem[slice_int] | 0.1310ms | 25.6349μs | 39.0093 KOps/s | 40.4077 KOps/s | |
test_getitem[range] | 0.1655ms | 51.3518μs | 19.4735 KOps/s | 20.5800 KOps/s | |
test_getitem[tuple] | 0.1203ms | 20.4072μs | 49.0023 KOps/s | 49.7250 KOps/s | |
test_getitem[list] | 0.1617ms | 45.7475μs | 21.8591 KOps/s | 22.1678 KOps/s | |
test_setitem_dim[int] | 48.0300μs | 26.3765μs | 37.9126 KOps/s | 39.6979 KOps/s | |
test_setitem_dim[slice_int] | 91.3100μs | 51.6192μs | 19.3727 KOps/s | 19.8307 KOps/s | |
test_setitem_dim[range] | 0.1631ms | 78.1643μs | 12.7936 KOps/s | 13.4953 KOps/s | |
test_setitem_dim[tuple] | 80.6390μs | 40.7926μs | 24.5143 KOps/s | 25.1217 KOps/s | |
test_setitem | 64.0700μs | 21.1195μs | 47.3495 KOps/s | 48.1913 KOps/s | |
test_set | 56.1350μs | 20.5283μs | 48.7132 KOps/s | 49.1860 KOps/s | |
test_set_shared | 3.5658ms | 0.1831ms | 5.4609 KOps/s | 5.6183 KOps/s | |
test_update | 0.1153ms | 26.4461μs | 37.8128 KOps/s | 37.5329 KOps/s | |
test_update_nested | 0.1394ms | 42.0531μs | 23.7794 KOps/s | 23.6360 KOps/s | |
test_update__nested | 0.3956ms | 35.0279μs | 28.5487 KOps/s | 29.2012 KOps/s | |
test_set_nested | 67.9470μs | 22.7091μs | 44.0352 KOps/s | 43.5406 KOps/s | |
test_set_nested_new | 0.1058ms | 27.6780μs | 36.1297 KOps/s | 36.1206 KOps/s | |
test_select | 0.1188ms | 44.3707μs | 22.5374 KOps/s | 22.6279 KOps/s | |
test_select_nested | 0.1310ms | 65.1155μs | 15.3573 KOps/s | 15.6070 KOps/s | |
test_exclude_nested | 0.1544ms | 82.0235μs | 12.1916 KOps/s | 12.4248 KOps/s | |
test_empty[True] | 0.5489ms | 0.4088ms | 2.4463 KOps/s | 2.4620 KOps/s | |
test_empty[False] | 8.2580μs | 1.3649μs | 732.6747 KOps/s | 717.4644 KOps/s | |
test_unbind_speed | 0.6034ms | 0.2679ms | 3.7332 KOps/s | 3.7257 KOps/s | |
test_unbind_speed_stack0 | 0.4570ms | 0.2659ms | 3.7615 KOps/s | 3.7636 KOps/s | |
test_unbind_speed_stack1 | 0.1060s | 0.7242ms | 1.3808 KOps/s | 1.2398 KOps/s | |
test_split | 0.1071s | 1.7868ms | 559.6617 Ops/s | 575.0286 Ops/s | |
test_chunk | 0.1008s | 1.7680ms | 565.6039 Ops/s | 633.8841 Ops/s | |
test_consolidate_njt[False-None] | 8.2789ms | 8.0416ms | 124.3528 Ops/s | 110.4412 Ops/s | |
test_creation[device0] | 0.2155ms | 92.6188μs | 10.7969 KOps/s | 11.0540 KOps/s | |
test_creation_from_tensor | 4.1880ms | 96.2115μs | 10.3938 KOps/s | 10.6099 KOps/s | |
test_add_one[memmap_tensor0] | 84.7380μs | 5.1631μs | 193.6809 KOps/s | 203.7442 KOps/s | |
test_contiguous[memmap_tensor0] | 18.2140μs | 0.5028μs | 1.9888 MOps/s | 1.9521 MOps/s | |
test_stack[memmap_tensor0] | 31.2280μs | 3.5656μs | 280.4594 KOps/s | 295.9895 KOps/s | |
test_memmaptd_index | 0.3383ms | 0.2333ms | 4.2862 KOps/s | 4.3822 KOps/s | |
test_memmaptd_index_astensor | 0.4841ms | 0.3213ms | 3.1123 KOps/s | 3.1298 KOps/s | |
test_memmaptd_index_op | 1.0711ms | 0.5983ms | 1.6715 KOps/s | 1.6206 KOps/s | |
test_serialize_model | 0.2263s | 0.1328s | 7.5303 Ops/s | 8.7098 Ops/s | |
test_serialize_model_pickle | 0.4523s | 0.3858s | 2.5922 Ops/s | 2.3677 Ops/s | |
test_serialize_weights | 0.1231s | 0.1162s | 8.6026 Ops/s | 8.6186 Ops/s | |
test_serialize_weights_returnearly | 0.1690s | 0.1593s | 6.2787 Ops/s | 5.7317 Ops/s | |
test_serialize_weights_pickle | 1.1085s | 0.7043s | 1.4198 Ops/s | 2.4028 Ops/s | |
test_serialize_weights_filesystem | 0.1470s | 0.1423s | 7.0298 Ops/s | 6.8403 Ops/s | |
test_serialize_model_filesystem | 0.1514s | 0.1413s | 7.0766 Ops/s | 6.5840 Ops/s | |
test_reshape_pytree | 72.5550μs | 26.3238μs | 37.9884 KOps/s | 37.8116 KOps/s | |
test_reshape_td | 0.1159ms | 33.1663μs | 30.1511 KOps/s | 31.1156 KOps/s | |
test_view_pytree | 58.1780μs | 26.4153μs | 37.8568 KOps/s | 37.8244 KOps/s | |
test_view_td | 83.7660μs | 41.3861μs | 24.1627 KOps/s | 24.4103 KOps/s | |
test_unbind_pytree | 90.8290μs | 29.3396μs | 34.0836 KOps/s | 34.1085 KOps/s | |
test_unbind_td | 0.3179ms | 40.1469μs | 24.9085 KOps/s | 25.3416 KOps/s | |
test_split_pytree | 92.8130μs | 29.2799μs | 34.1531 KOps/s | 34.4764 KOps/s | |
test_split_td | 0.5007ms | 46.0871μs | 21.6981 KOps/s | 22.2504 KOps/s | |
test_add_pytree | 87.3430μs | 35.7143μs | 28.0000 KOps/s | 27.7837 KOps/s | |
test_add_td | 0.2606ms | 59.9498μs | 16.6806 KOps/s | 16.9684 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1339ms | 67.5203μs | 14.8104 KOps/s | 14.9749 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 1.5926ms | 0.1815ms | 5.5085 KOps/s | 5.5543 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1252ms | 46.2411μs | 21.6258 KOps/s | 21.8385 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2258ms | 0.1184ms | 8.4492 KOps/s | 8.4230 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 88.6360μs | 27.9444μs | 35.7854 KOps/s | 35.8877 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1353ms | 63.7964μs | 15.6749 KOps/s | 15.5758 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1651ms | 79.9774μs | 12.5035 KOps/s | 12.2909 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1827ms | 65.8022μs | 15.1971 KOps/s | 15.0327 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1866ms | 0.1084ms | 9.2281 KOps/s | 9.3704 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2985ms | 0.2202ms | 4.5410 KOps/s | 4.6177 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1235ms | 49.3387μs | 20.2680 KOps/s | 21.7902 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.6558ms | 69.1191μs | 14.4678 KOps/s | 14.8309 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2322ms | 0.1010ms | 9.9019 KOps/s | 10.0044 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4240ms | 0.2061ms | 4.8532 KOps/s | 4.9776 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3976ms | 0.2349ms | 4.2580 KOps/s | 4.3212 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2139ms | 0.1116ms | 8.9608 KOps/s | 9.2623 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1215ms | 64.4904μs | 15.5062 KOps/s | 15.8277 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1234ms | 50.4580μs | 19.8185 KOps/s | 20.2450 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3617ms | 0.1588ms | 6.2955 KOps/s | 6.4266 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1816ms | 99.2706μs | 10.0735 KOps/s | 10.0183 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 82.3930μs | 22.0352μs | 45.3819 KOps/s | 47.0654 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1454ms | 68.8482μs | 14.5247 KOps/s | 15.0557 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1709ms | 84.3238μs | 11.8590 KOps/s | 11.8496 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1357ms | 66.7849μs | 14.9735 KOps/s | 15.0245 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3243ms | 0.2136ms | 4.6812 KOps/s | 4.6176 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.4674ms | 1.3882ms | 720.3707 Ops/s | 716.6900 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.4107ms | 0.2090ms | 4.7847 KOps/s | 4.7581 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3343ms | 0.8352ms | 1.1974 KOps/s | 1.1938 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5280ms | 0.4558ms | 2.1941 KOps/s | 2.1639 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.5094ms | 2.7077ms | 369.3184 Ops/s | 365.1359 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1046ms | 38.9327μs | 25.6853 KOps/s | 26.6594 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.6346ms | 34.8141μs | 28.7240 KOps/s | 28.0934 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 87.9140μs | 31.8223μs | 31.4245 KOps/s | 33.0080 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 80.4200μs | 23.3112μs | 42.8979 KOps/s | 43.2218 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 82.3730μs | 31.8577μs | 31.3896 KOps/s | 31.7205 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 63.6580μs | 23.4811μs | 42.5875 KOps/s | 42.8863 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1142ms | 54.1580μs | 18.4645 KOps/s | 18.7721 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4035ms | 21.7375μs | 46.0035 KOps/s | 46.3771 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1160ms | 46.8919μs | 21.3256 KOps/s | 21.7520 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 70.6720μs | 18.3145μs | 54.6015 KOps/s | 53.8740 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1218ms | 47.8018μs | 20.9197 KOps/s | 21.3354 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 65.1520μs | 18.3534μs | 54.4857 KOps/s | 55.0914 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1164ms | 55.4812μs | 18.0241 KOps/s | 18.5062 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9610ms | 21.6099μs | 46.2751 KOps/s | 46.7699 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1211ms | 47.2107μs | 21.1816 KOps/s | 21.3415 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.2841ms | 18.5071μs | 54.0334 KOps/s | 54.9363 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1160ms | 47.6485μs | 20.9870 KOps/s | 21.6343 KOps/s | |
test_compile_indexing[int-pytree-eager] | 82.0830μs | 18.3968μs | 54.3573 KOps/s | 55.1897 KOps/s | |
test_mod_add[eager] | 88.2250μs | 37.1653μs | 26.9068 KOps/s | 26.9826 KOps/s | |
test_mod_add[compile] | 0.1205ms | 66.1942μs | 15.1071 KOps/s | 15.1058 KOps/s | |
test_mod_add[compile-overhead] | 0.1356ms | 65.8653μs | 15.1825 KOps/s | 14.6551 KOps/s | |
test_mod_wrap[eager] | 0.3483ms | 0.2226ms | 4.4932 KOps/s | 4.3271 KOps/s | |
test_mod_wrap[compile] | 2.2794ms | 0.2292ms | 4.3636 KOps/s | 4.2165 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4483ms | 0.2280ms | 4.3859 KOps/s | 4.3642 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.2045ms | 10.8904ms | 91.8244 Ops/s | 71.6631 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.6961ms | 11.0415ms | 90.5676 Ops/s | 73.7514 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.4703ms | 11.3823ms | 87.8555 Ops/s | 76.6307 Ops/s | |
test_seq_add[eager] | 0.2219ms | 0.1255ms | 7.9712 KOps/s | 8.3032 KOps/s | |
test_seq_add[compile] | 0.1419ms | 78.2213μs | 12.7842 KOps/s | 12.8439 KOps/s | |
test_seq_add[compile-overhead] | 0.1385ms | 75.6843μs | 13.2128 KOps/s | 13.4011 KOps/s | |
test_seq_wrap[eager] | 0.8701ms | 0.4644ms | 2.1535 KOps/s | 2.1507 KOps/s | |
test_seq_wrap[compile] | 0.4358ms | 0.2467ms | 4.0539 KOps/s | 4.0390 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4321ms | 0.2444ms | 4.0916 KOps/s | 4.0688 KOps/s | |
test_func_call_runtime[False-eager] | 0.7017ms | 0.5395ms | 1.8537 KOps/s | 1.7941 KOps/s | |
test_func_call_runtime[False-compile] | 0.8970ms | 0.4465ms | 2.2396 KOps/s | 2.2323 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.9630ms | 0.4520ms | 2.2126 KOps/s | 2.2441 KOps/s | |
test_func_call_runtime[True-eager] | 0.9611ms | 0.7516ms | 1.3305 KOps/s | 1.3193 KOps/s | |
test_func_call_runtime[True-compile] | 0.9324ms | 0.4697ms | 2.1288 KOps/s | 2.1439 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6352ms | 0.4717ms | 2.1199 KOps/s | 2.1408 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.6527ms | 0.5350ms | 1.8690 KOps/s | 1.8248 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8169ms | 0.4487ms | 2.2287 KOps/s | 2.2539 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6548ms | 0.4459ms | 2.2425 KOps/s | 2.2448 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.2854ms | 0.8874ms | 1.1269 KOps/s | 1.1112 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.9380ms | 0.7889ms | 1.2676 KOps/s | 1.2388 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0703ms | 0.7980ms | 1.2531 KOps/s | 1.2322 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6059ms | 1.9128ms | 522.7874 Ops/s | 519.5714 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9144ms | 0.5465ms | 1.8297 KOps/s | 1.8513 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8813ms | 0.5426ms | 1.8428 KOps/s | 1.8459 KOps/s | |
test_distributed | 0.3194ms | 0.1291ms | 7.7457 KOps/s | 7.8894 KOps/s | |
test_tdmodule | 77.3140μs | 28.6483μs | 34.9061 KOps/s | 35.4150 KOps/s | |
test_tdmodule_dispatch | 0.1360ms | 57.6947μs | 17.3326 KOps/s | 19.3442 KOps/s | |
test_tdseq | 67.7860μs | 31.0207μs | 32.2366 KOps/s | 32.9841 KOps/s | |
test_tdseq_dispatch | 0.1004ms | 56.7262μs | 17.6285 KOps/s | 17.7698 KOps/s | |
test_instantiation_functorch | 2.0175ms | 1.5377ms | 650.3193 Ops/s | 656.7756 Ops/s | |
test_exec_functorch | 0.3166ms | 0.1807ms | 5.5345 KOps/s | 5.5981 KOps/s | |
test_exec_functional_call | 0.4009ms | 0.1748ms | 5.7218 KOps/s | 5.5655 KOps/s | |
test_exec_td_decorator | 0.5424ms | 0.2378ms | 4.2053 KOps/s | 4.2050 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2033ms | 0.6839ms | 1.4621 KOps/s | 1.4997 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.6446ms | 0.6877ms | 1.4542 KOps/s | 1.5115 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8292ms | 0.5444ms | 1.8370 KOps/s | 1.8892 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8227ms | 0.5440ms | 1.8384 KOps/s | 1.8847 KOps/s | |
test_to_module_speed[True] | 2.6071ms | 1.3517ms | 739.8363 Ops/s | 731.0207 Ops/s | |
test_to_module_speed[False] | 1.8294ms | 1.3152ms | 760.3306 Ops/s | 764.7523 Ops/s | |
test_tc_init | 87.3920μs | 47.0797μs | 21.2406 KOps/s | 20.8099 KOps/s | |
test_tc_init_nested | 0.1973ms | 93.8083μs | 10.6600 KOps/s | 10.1960 KOps/s | |
test_tc_first_layer_tensor | 40.3450μs | 1.7120μs | 584.0957 KOps/s | 593.7859 KOps/s | |
test_tc_first_layer_nontensor | 32.6700μs | 4.9549μs | 201.8193 KOps/s | 203.3808 KOps/s | |
test_tc_second_layer_tensor | 48.2290μs | 3.1975μs | 312.7401 KOps/s | 313.3826 KOps/s | |
test_tc_second_layer_nontensor | 39.2330μs | 6.5466μs | 152.7515 KOps/s | 154.5685 KOps/s | |
test_unbind | 0.2606s | 13.9889ms | 71.4854 Ops/s | 68.5191 Ops/s | |
test_full_like | 8.3040ms | 5.4439ms | 183.6930 Ops/s | 285.5508 Ops/s | |
test_zeros_like | 10.1251ms | 5.0942ms | 196.2999 Ops/s | 346.0594 Ops/s | |
test_ones_like | 6.0320ms | 3.7863ms | 264.1113 Ops/s | 276.9189 Ops/s | |
test_clone | 7.4952ms | 5.3547ms | 186.7526 Ops/s | 182.6829 Ops/s | |
test_squeeze | 66.1130μs | 13.0462μs | 76.6509 KOps/s | 78.2702 KOps/s | |
test_unsqueeze | 0.3249ms | 97.3642μs | 10.2707 KOps/s | 10.6417 KOps/s | |
test_split | 0.3574ms | 0.1974ms | 5.0665 KOps/s | 5.2295 KOps/s | |
test_permute | 0.3174ms | 0.2050ms | 4.8790 KOps/s | 5.0336 KOps/s | |
test_stack | 29.6607ms | 24.9382ms | 40.0992 Ops/s | 39.1426 Ops/s | |
test_cat | 29.6458ms | 24.8444ms | 40.2505 Ops/s | 39.0491 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 33.5610μs | 11.8928μs | 84.0845 KOps/s | 76.0824 KOps/s | |
test_plain_set_stack_nested | 50.0710μs | 12.0700μs | 82.8499 KOps/s | 75.0658 KOps/s | |
test_plain_set_nested_inplace | 42.9900μs | 12.9793μs | 77.0460 KOps/s | 69.8720 KOps/s | |
test_plain_set_stack_nested_inplace | 61.9210μs | 13.1698μs | 75.9312 KOps/s | 70.2072 KOps/s | |
test_items | 38.6910μs | 2.8711μs | 348.2964 KOps/s | 345.0180 KOps/s | |
test_items_nested | 0.4132ms | 0.3645ms | 2.7437 KOps/s | 2.7099 KOps/s | |
test_items_nested_locked | 0.4215ms | 0.3686ms | 2.7133 KOps/s | 2.7112 KOps/s | |
test_items_nested_leaf | 88.8910μs | 62.2869μs | 16.0547 KOps/s | 15.9663 KOps/s | |
test_items_stack_nested | 0.5403ms | 0.3633ms | 2.7529 KOps/s | 2.6973 KOps/s | |
test_items_stack_nested_leaf | 94.1210μs | 62.5021μs | 15.9995 KOps/s | 16.0576 KOps/s | |
test_items_stack_nested_locked | 0.5178ms | 0.3665ms | 2.7285 KOps/s | 2.7091 KOps/s | |
test_keys | 50.8110μs | 3.4249μs | 291.9807 KOps/s | 293.0005 KOps/s | |
test_keys_nested | 0.1280ms | 88.8774μs | 11.2515 KOps/s | 11.3781 KOps/s | |
test_keys_nested_locked | 0.7070ms | 94.1514μs | 10.6212 KOps/s | 10.6538 KOps/s | |
test_keys_nested_leaf | 0.1171ms | 79.2620μs | 12.6164 KOps/s | 12.6103 KOps/s | |
test_keys_stack_nested | 0.1621ms | 87.4928μs | 11.4295 KOps/s | 11.4552 KOps/s | |
test_keys_stack_nested_leaf | 0.1140ms | 78.7150μs | 12.7041 KOps/s | 12.7983 KOps/s | |
test_keys_stack_nested_locked | 0.1399ms | 94.4846μs | 10.5837 KOps/s | 10.6732 KOps/s | |
test_values | 4.2850μs | 0.8537μs | 1.1714 MOps/s | 1.1737 MOps/s | |
test_values_nested | 64.4010μs | 38.5152μs | 25.9638 KOps/s | 26.2124 KOps/s | |
test_values_nested_locked | 64.4210μs | 40.2606μs | 24.8382 KOps/s | 25.0892 KOps/s | |
test_values_nested_leaf | 68.5610μs | 43.4745μs | 23.0020 KOps/s | 23.1680 KOps/s | |
test_values_stack_nested | 70.0610μs | 38.4174μs | 26.0298 KOps/s | 26.0999 KOps/s | |
test_values_stack_nested_leaf | 70.1910μs | 43.3635μs | 23.0609 KOps/s | 23.1743 KOps/s | |
test_values_stack_nested_locked | 70.3010μs | 40.6058μs | 24.6270 KOps/s | 25.1165 KOps/s | |
test_membership | 2.3706μs | 0.5001μs | 1.9996 MOps/s | 1.9951 MOps/s | |
test_membership_nested | 18.3500μs | 1.9857μs | 503.5903 KOps/s | 504.0555 KOps/s | |
test_membership_nested_leaf | 15.5350μs | 1.9964μs | 500.9061 KOps/s | 499.9143 KOps/s | |
test_membership_stacked_nested | 25.8710μs | 2.0592μs | 485.6302 KOps/s | 485.2543 KOps/s | |
test_membership_stacked_nested_leaf | 35.5700μs | 2.0558μs | 486.4336 KOps/s | 485.0675 KOps/s | |
test_membership_nested_last | 42.9110μs | 3.0354μs | 329.4485 KOps/s | 330.1285 KOps/s | |
test_membership_nested_leaf_last | 49.0610μs | 3.0131μs | 331.8866 KOps/s | 329.9813 KOps/s | |
test_membership_stacked_nested_last | 53.5210μs | 3.0347μs | 329.5178 KOps/s | 331.4650 KOps/s | |
test_membership_stacked_nested_leaf_last | 32.1900μs | 3.0275μs | 330.3070 KOps/s | 331.5549 KOps/s | |
test_nested_getleaf | 49.7900μs | 13.4588μs | 74.3010 KOps/s | 74.2010 KOps/s | |
test_nested_get | 38.0810μs | 12.8057μs | 78.0901 KOps/s | 77.4962 KOps/s | |
test_stacked_getleaf | 42.2710μs | 13.4125μs | 74.5573 KOps/s | 74.8667 KOps/s | |
test_stacked_get | 43.3200μs | 12.7444μs | 78.4656 KOps/s | 79.0482 KOps/s | |
test_nested_getitemleaf | 55.4810μs | 14.0129μs | 71.3628 KOps/s | 72.1874 KOps/s | |
test_nested_getitem | 36.5100μs | 13.1651μs | 75.9584 KOps/s | 76.4258 KOps/s | |
test_stacked_getitemleaf | 42.6900μs | 13.8193μs | 72.3627 KOps/s | 72.9835 KOps/s | |
test_stacked_getitem | 59.3810μs | 13.0170μs | 76.8225 KOps/s | 77.6344 KOps/s | |
test_lock_nested | 7.8819ms | 0.3453ms | 2.8963 KOps/s | 2.9191 KOps/s | |
test_lock_stack_nested | 0.3896ms | 0.3416ms | 2.9276 KOps/s | 2.8441 KOps/s | |
test_unlock_nested | 0.3599ms | 0.2853ms | 3.5053 KOps/s | 3.4754 KOps/s | |
test_unlock_stack_nested | 0.3178ms | 0.2801ms | 3.5697 KOps/s | 3.4461 KOps/s | |
test_flatten_speed | 0.1149ms | 78.6988μs | 12.7067 KOps/s | 12.8153 KOps/s | |
test_unflatten_speed | 0.4600ms | 0.4045ms | 2.4722 KOps/s | 2.5067 KOps/s | |
test_common_ops | 0.7142ms | 0.5731ms | 1.7448 KOps/s | 1.5766 KOps/s | |
test_creation | 89.3410μs | 1.7537μs | 570.2353 KOps/s | 572.2836 KOps/s | |
test_creation_empty | 34.5900μs | 6.4577μs | 154.8543 KOps/s | 116.8732 KOps/s | |
test_creation_nested_1 | 31.3110μs | 8.3032μs | 120.4353 KOps/s | 97.8051 KOps/s | |
test_creation_nested_2 | 36.3810μs | 10.9683μs | 91.1722 KOps/s | 76.5730 KOps/s | |
test_clone | 56.6410μs | 10.4960μs | 95.2746 KOps/s | 93.0746 KOps/s | |
test_getitem[int] | 1.2723ms | 10.4584μs | 95.6165 KOps/s | 92.3427 KOps/s | |
test_getitem[slice_int] | 0.1124ms | 20.5693μs | 48.6161 KOps/s | 47.7019 KOps/s | |
test_getitem[range] | 0.1289ms | 37.5834μs | 26.6075 KOps/s | 26.0742 KOps/s | |
test_getitem[tuple] | 0.1144ms | 17.7654μs | 56.2891 KOps/s | 54.6763 KOps/s | |
test_getitem[list] | 0.1286ms | 32.5914μs | 30.6829 KOps/s | 29.4817 KOps/s | |
test_setitem_dim[int] | 41.2210μs | 18.7333μs | 53.3809 KOps/s | 50.6845 KOps/s | |
test_setitem_dim[slice_int] | 61.1510μs | 38.3269μs | 26.0913 KOps/s | 25.6533 KOps/s | |
test_setitem_dim[range] | 92.5810μs | 53.4879μs | 18.6958 KOps/s | 18.6403 KOps/s | |
test_setitem_dim[tuple] | 53.2110μs | 31.5827μs | 31.6629 KOps/s | 29.9016 KOps/s | |
test_setitem | 57.4310μs | 13.8563μs | 72.1692 KOps/s | 65.5837 KOps/s | |
test_set | 48.1110μs | 13.1177μs | 76.2330 KOps/s | 67.7087 KOps/s | |
test_set_shared | 0.6018ms | 0.1573ms | 6.3565 KOps/s | 6.3084 KOps/s | |
test_update | 0.3387ms | 17.0515μs | 58.6458 KOps/s | 52.0835 KOps/s | |
test_update_nested | 58.0410μs | 25.6765μs | 38.9462 KOps/s | 35.2330 KOps/s | |
test_update__nested | 0.5233ms | 25.4119μs | 39.3517 KOps/s | 39.5321 KOps/s | |
test_set_nested | 49.2710μs | 14.5019μs | 68.9565 KOps/s | 62.7024 KOps/s | |
test_set_nested_new | 57.1810μs | 17.0799μs | 58.5482 KOps/s | 54.4888 KOps/s | |
test_select | 71.7710μs | 28.9653μs | 34.5240 KOps/s | 33.2589 KOps/s | |
test_select_nested | 77.4310μs | 44.1847μs | 22.6323 KOps/s | 22.7135 KOps/s | |
test_exclude_nested | 92.3920μs | 63.5604μs | 15.7331 KOps/s | 15.9077 KOps/s | |
test_empty[True] | 0.3534ms | 0.2976ms | 3.3605 KOps/s | 3.4056 KOps/s | |
test_empty[False] | 3.3711μs | 0.8234μs | 1.2145 MOps/s | 1.2235 MOps/s | |
test_to | 89.5210μs | 57.7621μs | 17.3124 KOps/s | 16.8728 KOps/s | |
test_to_nonblocking | 96.0510μs | 49.4778μs | 20.2111 KOps/s | 20.5324 KOps/s | |
test_unbind_speed | 0.2750ms | 0.2367ms | 4.2253 KOps/s | 4.1030 KOps/s | |
test_unbind_speed_stack0 | 0.2927ms | 0.2349ms | 4.2578 KOps/s | 4.1231 KOps/s | |
test_unbind_speed_stack1 | 92.2995ms | 0.7319ms | 1.3664 KOps/s | 1.3526 KOps/s | |
test_split | 93.3556ms | 1.5731ms | 635.7057 Ops/s | 623.0582 Ops/s | |
test_chunk | 95.3172ms | 1.5862ms | 630.4207 Ops/s | 618.7683 Ops/s | |
test_consolidate[False-None] | 95.3325ms | 3.0881ms | 323.8218 Ops/s | 357.1980 Ops/s | |
test_consolidate[default-None] | 1.8096ms | 1.7032ms | 587.1209 Ops/s | 579.2627 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8469ms | 1.7289ms | 578.3871 Ops/s | 569.3619 Ops/s | |
test_consolidate_njt[False-None] | 6.5979ms | 6.3148ms | 158.3593 Ops/s | 150.1611 Ops/s | |
test_to[False-False-None] | 1.8178ms | 1.7247ms | 579.8054 Ops/s | 551.8505 Ops/s | |
test_to[True-False-None] | 1.5410ms | 1.3408ms | 745.8407 Ops/s | 696.1935 Ops/s | |
test_to[within-False-None] | 4.2808ms | 4.2019ms | 237.9875 Ops/s | 227.7119 Ops/s | |
test_to[True-default-None] | 5.2881ms | 5.1301ms | 194.9280 Ops/s | 187.8191 Ops/s | |
test_to_njt[False-False-None] | 6.9208ms | 6.8413ms | 146.1701 Ops/s | 139.4280 Ops/s | |
test_to_njt[True-False-None] | 5.4592ms | 5.3222ms | 187.8905 Ops/s | 172.0899 Ops/s | |
test_to_njt[within-False-None] | 11.8095ms | 11.6799ms | 85.6172 Ops/s | 81.2041 Ops/s | |
test_creation[device0] | 0.4723ms | 79.6331μs | 12.5576 KOps/s | 12.3823 KOps/s | |
test_creation_from_tensor | 0.5442ms | 82.6395μs | 12.1007 KOps/s | 11.7868 KOps/s | |
test_add_one[memmap_tensor0] | 0.5614ms | 6.6989μs | 149.2793 KOps/s | 139.7282 KOps/s | |
test_contiguous[memmap_tensor0] | 4.5040μs | 0.4250μs | 2.3531 MOps/s | 2.2825 MOps/s | |
test_stack[memmap_tensor0] | 36.6210μs | 4.2818μs | 233.5471 KOps/s | 208.8466 KOps/s | |
test_memmaptd_index | 1.4778ms | 0.2364ms | 4.2297 KOps/s | 4.0230 KOps/s | |
test_memmaptd_index_astensor | 0.4206ms | 0.2966ms | 3.3711 KOps/s | 3.1882 KOps/s | |
test_memmaptd_index_op | 0.7125ms | 0.5402ms | 1.8513 KOps/s | 1.6561 KOps/s | |
test_serialize_model | 0.1326s | 0.1309s | 7.6409 Ops/s | 7.6643 Ops/s | |
test_serialize_model_pickle | 1.3516s | 1.2162s | 0.8222 Ops/s | 0.8247 Ops/s | |
test_serialize_weights | 0.4057s | 0.1695s | 5.9002 Ops/s | 7.6540 Ops/s | |
test_serialize_weights_returnearly | 0.3321s | 53.2655ms | 18.7739 Ops/s | 23.1384 Ops/s | |
test_serialize_weights_pickle | 1.4371s | 1.2283s | 0.8142 Ops/s | 0.8220 Ops/s | |
test_reshape_pytree | 65.0810μs | 22.3422μs | 44.7584 KOps/s | 44.0504 KOps/s | |
test_reshape_td | 61.9810μs | 26.4655μs | 37.7850 KOps/s | 35.4300 KOps/s | |
test_view_pytree | 54.4210μs | 22.3311μs | 44.7807 KOps/s | 43.9750 KOps/s | |
test_view_td | 65.9010μs | 31.9317μs | 31.3168 KOps/s | 29.7964 KOps/s | |
test_unbind_pytree | 51.6210μs | 27.5653μs | 36.2775 KOps/s | 34.2076 KOps/s | |
test_unbind_td | 0.7384ms | 36.1130μs | 27.6909 KOps/s | 26.0695 KOps/s | |
test_split_pytree | 56.7210μs | 29.4268μs | 33.9826 KOps/s | 30.5886 KOps/s | |
test_split_td | 0.8839ms | 38.1864μs | 26.1873 KOps/s | 24.8535 KOps/s | |
test_add_pytree | 65.3810μs | 34.6835μs | 28.8321 KOps/s | 25.9520 KOps/s | |
test_add_td | 98.6420μs | 46.3780μs | 21.5620 KOps/s | 18.8607 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1827ms | 0.1211ms | 8.2591 KOps/s | 7.8797 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2364ms | 0.1412ms | 7.0817 KOps/s | 6.9905 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.4818ms | 94.3103μs | 10.6033 KOps/s | 10.3386 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.0373ms | 0.1484ms | 6.7396 KOps/s | 6.4537 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.4196ms | 24.8464μs | 40.2472 KOps/s | 41.0266 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.4363ms | 35.6315μs | 28.0650 KOps/s | 28.1995 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.3184ms | 64.2239μs | 15.5705 KOps/s | 15.3465 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.4394ms | 49.4047μs | 20.2410 KOps/s | 20.0624 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1844ms | 0.1416ms | 7.0643 KOps/s | 7.1306 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.6160ms | 0.2198ms | 4.5487 KOps/s | 4.5187 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.5034ms | 96.1965μs | 10.3954 KOps/s | 10.1034 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4560ms | 57.9532μs | 17.2553 KOps/s | 17.2294 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1723ms | 0.1339ms | 7.4693 KOps/s | 7.3007 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.8837ms | 0.4772ms | 2.0955 KOps/s | 2.0058 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.6590ms | 0.2618ms | 3.8195 KOps/s | 3.7638 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1928ms | 0.1410ms | 7.0927 KOps/s | 7.0332 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.4757ms | 69.9661μs | 14.2926 KOps/s | 13.8413 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1333ms | 96.4990μs | 10.3628 KOps/s | 10.1618 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.8033ms | 0.4040ms | 2.4753 KOps/s | 2.3652 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1712ms | 0.1335ms | 7.4898 KOps/s | 7.3637 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1269ms | 18.4729μs | 54.1335 KOps/s | 55.4472 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 57.8810μs | 31.8186μs | 31.4282 KOps/s | 31.5235 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1992ms | 69.6382μs | 14.3599 KOps/s | 14.2215 KOps/s | |
test_compile_copy_flat[pytree-eager] | 83.2510μs | 52.0535μs | 19.2110 KOps/s | 19.1840 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6336ms | 0.3993ms | 2.5046 KOps/s | 2.1973 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7523ms | 2.6095ms | 383.2201 Ops/s | 365.7788 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5943ms | 0.3810ms | 2.6249 KOps/s | 2.2598 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7181ms | 2.6002ms | 384.5890 Ops/s | 365.5168 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5240ms | 0.1147ms | 8.7161 KOps/s | 8.7272 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5917ms | 82.1409μs | 12.1742 KOps/s | 11.8857 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.7710ms | 0.1083ms | 9.2339 KOps/s | 9.2557 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1509ms | 68.2989μs | 14.6415 KOps/s | 14.3270 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2277ms | 0.1118ms | 8.9433 KOps/s | 9.2131 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1499ms | 67.9155μs | 14.7242 KOps/s | 14.4074 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1378ms | 0.1027ms | 9.7370 KOps/s | 9.9270 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1493ms | 18.1355μs | 55.1404 KOps/s | 51.3839 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2187ms | 94.9267μs | 10.5344 KOps/s | 10.3162 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 55.5610μs | 15.2480μs | 65.5825 KOps/s | 62.2855 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1886ms | 96.1564μs | 10.3997 KOps/s | 10.2532 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1028ms | 15.2539μs | 65.5569 KOps/s | 62.3636 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1917ms | 0.1005ms | 9.9489 KOps/s | 9.7833 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5663ms | 17.7666μs | 56.2854 KOps/s | 52.5345 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1846ms | 97.9843μs | 10.2057 KOps/s | 10.2936 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 50.6410μs | 15.3381μs | 65.1969 KOps/s | 62.5522 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1909ms | 96.4473μs | 10.3684 KOps/s | 10.2695 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1214ms | 16.7134μs | 59.8322 KOps/s | 62.7467 KOps/s | |
test_mod_add[eager] | 0.1348ms | 36.7285μs | 27.2268 KOps/s | 24.1637 KOps/s | |
test_mod_add[compile] | 0.1917ms | 83.3710μs | 11.9946 KOps/s | 12.0034 KOps/s | |
test_mod_add[compile-overhead] | 0.3200ms | 0.1672ms | 5.9808 KOps/s | 5.5339 KOps/s | |
test_mod_wrap[eager] | 0.3583ms | 0.2484ms | 4.0259 KOps/s | 3.8470 KOps/s | |
test_mod_wrap[compile] | 0.3654ms | 0.2895ms | 3.4539 KOps/s | 3.3604 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5250ms | 3.8039ms | 262.8871 Ops/s | 262.5623 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6984ms | 1.4501ms | 689.5880 Ops/s | 670.7080 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4790ms | 1.3814ms | 723.9120 Ops/s | 710.9931 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.5713ms | 1.0493ms | 952.9713 Ops/s | 926.7037 Ops/s | |
test_seq_add[eager] | 0.1622ms | 0.1163ms | 8.6010 KOps/s | 7.7592 KOps/s | |
test_seq_add[compile] | 0.1272ms | 89.1197μs | 11.2209 KOps/s | 10.8240 KOps/s | |
test_seq_add[compile-overhead] | 0.1942ms | 0.1342ms | 7.4507 KOps/s | 7.5556 KOps/s | |
test_seq_wrap[eager] | 0.4994ms | 0.4276ms | 2.3384 KOps/s | 2.2068 KOps/s | |
test_seq_wrap[compile] | 0.3798ms | 0.3060ms | 3.2682 KOps/s | 3.1931 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2820ms | 0.2293ms | 4.3617 KOps/s | 4.3037 KOps/s | |
test_func_call_runtime[False-eager] | 1.0407ms | 0.7361ms | 1.3585 KOps/s | 1.3055 KOps/s | |
test_func_call_runtime[False-compile] | 1.0909ms | 0.7430ms | 1.3458 KOps/s | 1.2915 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4160ms | 0.3634ms | 2.7520 KOps/s | 2.6974 KOps/s | |
test_func_call_runtime[True-eager] | 1.1791ms | 0.8979ms | 1.1137 KOps/s | 1.0806 KOps/s | |
test_func_call_runtime[True-compile] | 0.8709ms | 0.7622ms | 1.3120 KOps/s | 1.2693 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4861ms | 0.3881ms | 2.5768 KOps/s | 2.5553 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8605ms | 0.7336ms | 1.3632 KOps/s | 1.3102 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9543ms | 0.7380ms | 1.3550 KOps/s | 1.2964 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4317ms | 0.3646ms | 2.7430 KOps/s | 2.6910 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1548ms | 0.9952ms | 1.0048 KOps/s | 956.5567 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1103ms | 0.9928ms | 1.0072 KOps/s | 978.2226 Ops/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.0638ms | 0.9863ms | 1.0139 KOps/s | 979.0844 Ops/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4775ms | 2.0865ms | 479.2738 Ops/s | 467.5899 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8963ms | 0.8183ms | 1.2220 KOps/s | 1.1867 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5689ms | 0.4216ms | 2.3718 KOps/s | 2.3446 KOps/s | |
test_distributed | 2.9102ms | 0.1838ms | 5.4400 KOps/s | 8.6985 KOps/s | |
test_tdmodule | 0.2511ms | 19.3067μs | 51.7956 KOps/s | 44.6204 KOps/s | |
test_tdmodule_dispatch | 52.4210μs | 33.4276μs | 29.9154 KOps/s | 24.9267 KOps/s | |
test_tdseq | 28.7110μs | 19.3078μs | 51.7925 KOps/s | 45.1222 KOps/s | |
test_tdseq_dispatch | 57.1010μs | 36.2737μs | 27.5682 KOps/s | 23.5651 KOps/s | |
test_instantiation_functorch | 1.6090ms | 1.5096ms | 662.4298 Ops/s | 635.4801 Ops/s | |
test_exec_functorch | 0.1837ms | 0.1403ms | 7.1254 KOps/s | 6.8447 KOps/s | |
test_exec_functional_call | 0.2089ms | 0.1344ms | 7.4391 KOps/s | 7.0591 KOps/s | |
test_exec_td_decorator | 0.3795ms | 0.1871ms | 5.3453 KOps/s | 5.1497 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7869ms | 0.6820ms | 1.4662 KOps/s | 1.4250 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8199ms | 0.6817ms | 1.4670 KOps/s | 1.4269 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7309ms | 0.6084ms | 1.6437 KOps/s | 1.6504 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7530ms | 0.5980ms | 1.6723 KOps/s | 1.6565 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.1764ms | 19.3924ms | 51.5666 Ops/s | 51.1612 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.3804ms | 19.3029ms | 51.8058 Ops/s | 51.1916 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.5021ms | 19.1880ms | 52.1159 Ops/s | 51.5651 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.9277ms | 19.2146ms | 52.0438 Ops/s | 51.6723 Ops/s | |
test_to_module_speed[True] | 1.5180ms | 0.9605ms | 1.0412 KOps/s | 1.0202 KOps/s | |
test_to_module_speed[False] | 1.0114ms | 0.9437ms | 1.0596 KOps/s | 1.0466 KOps/s | |
test_tc_init | 70.9210μs | 32.7712μs | 30.5146 KOps/s | 28.3256 KOps/s | |
test_tc_init_nested | 0.1086ms | 66.0283μs | 15.1450 KOps/s | 14.4157 KOps/s | |
test_tc_first_layer_tensor | 29.2710μs | 0.9238μs | 1.0825 MOps/s | 1.0377 MOps/s | |
test_tc_first_layer_nontensor | 30.3310μs | 2.4168μs | 413.7702 KOps/s | 406.7062 KOps/s | |
test_tc_second_layer_tensor | 33.5910μs | 1.7998μs | 555.6137 KOps/s | 550.0382 KOps/s | |
test_tc_second_layer_nontensor | 31.5600μs | 3.2979μs | 303.2227 KOps/s | 296.7729 KOps/s | |
test_unbind | 0.2162s | 11.9905ms | 83.3996 Ops/s | 143.1893 Ops/s | |
test_full_like | 4.5305ms | 4.3509ms | 229.8356 Ops/s | 112.2124 Ops/s | |
test_zeros_like | 9.2803ms | 7.2904ms | 137.1670 Ops/s | 230.7453 Ops/s | |
test_ones_like | 5.5364ms | 4.3334ms | 230.7675 Ops/s | 229.7837 Ops/s | |
test_clone | 6.9280ms | 6.3645ms | 157.1210 Ops/s | 109.9239 Ops/s | |
test_squeeze | 59.7610μs | 9.3902μs | 106.4940 KOps/s | 101.0200 KOps/s | |
test_unsqueeze | 0.1224ms | 70.8649μs | 14.1114 KOps/s | 13.5869 KOps/s | |
test_split | 0.4026ms | 0.1513ms | 6.6086 KOps/s | 6.1939 KOps/s | |
test_permute | 0.2351ms | 0.1783ms | 5.6077 KOps/s | 5.4178 KOps/s | |
test_stack | 50.6945ms | 50.4853ms | 19.8077 Ops/s | 19.9291 Ops/s | |
test_cat | 52.4697ms | 50.4589ms | 19.8181 Ops/s | 19.9341 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
versioning
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):