News
Tutorials contains the code accompanying the HIP Tutorials that can be found in the HIP documentation. For a full overview over the examples see the section repository contents. Alternatively, instead ...
Abstract: The multiplication of a sparse matrix with a dense vector (SpMV) is a key component in many numerical schemes and its performance is known to be severely limited by main memory access.
On a B200, the nvjet_tst_16x64_64x16_4x1_v_bz_TNN kernel is used, and it takes roughly 8.1 microseconds. On a H200, the nvjet_tst_64x8_64x16_4x1_v_bz_TNT kernel is ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results