News
Abstract: Non-volatile computing-in-memory (nvCIM) can potentially meet the ever-increasing demands on improving the energy efficiency (EF) for intelligent edge devices. However, it still suffers from ...
Tutorials contains the code accompanying the HIP Tutorials that can be found in the HIP documentation. For a full overview over the examples see the section repository contents. Alternatively, instead ...
Abstract: The multiplication of a sparse matrix with a dense vector (SpMV) is a key component in many numerical schemes and its performance is known to be severely limited by main memory access.
On a B200, the nvjet_tst_16x64_64x16_4x1_v_bz_TNN kernel is used, and it takes roughly 8.1 microseconds. On a H200, the nvjet_tst_64x8_64x16_4x1_v_bz_TNT kernel is ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results