You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: profilers/README.md
+6-7
Original file line number
Diff line number
Diff line change
@@ -59,10 +59,9 @@ The following terms are used in this blog post:
59
59
|[AMD "Zen" Core](https://www.amd.com/en/technologies/zen-core)| AMD's x86-64 processor core architecture design. Used by the AMD EPYC™, AMD Ryzen™, AMD Ryzen™ PRO, and AMD Threadripper™ PRO processor series.|
60
60
|[RDNA™](https://www.amd.com/en/technologies/rdna)| AMD's Traditional GPU architecture optimized for graphically demanding workloads like gaming and visualization. Includes the RX 5000, 6000 and 7000 GPUs. |
61
61
|[CDNA™](https://www.amd.com/en/technologies/cdna)| AMD's Compute dedicated GPU architecture optimized for accelerating HPC, ML/AI, and data center type workloads. Includes the AMD Instinct™ MI50/60, MI100, and MI200 series accelerators.|
|[HIP](https://docs.amd.com/bundle/HIP-Programming-Guide-v5.3/page/Introduction_to_HIP_Programming_Guide.html)| A C++ Runtime API and kernel language that allows developers to create portable compute kernels/applications for AMD and NVIDIA GPUs from a single source code |
64
-
| Timeline Trace | A profiling approach where durations of compute kernels and data transfers between devices are collected and visualized |
65
-
|[Roofline Analysis](https://enccs.github.io/AMD-ROCm-development/hierarchical_roofline/)| Hardware agnostic methodology for quantifying a workload's ability to saturate the given compute architecture in terms of floating-point compute and memory bandwidth |
62
+
|[HIP](https://rocm.docs.amd.com/projects/HIP/en/latest/user_guide/programming_manual.html)| A C++ Runtime API and kernel language that allows developers to create portable compute kernels/applications for AMD and NVIDIA GPUs from a single source code |
63
+
|[Timeline Trace](https://amdresearch.github.io/omnitrace/output.html#perfetto-output)| A profiling approach where durations of compute kernels and data transfers between devices are collected and visualized |
64
+
|[Roofline Analysis](https://amdresearch.github.io/omniperf/profiling.html#standalone-roofline)| Hardware agnostic methodology for quantifying a workload's ability to saturate the given compute architecture in terms of floating-point compute and memory bandwidth |
66
65
| Hardware Counters | Individual metrics which track how many times a certain event occurs in the hardware, such as bytes moved from L2 cache or a 32 bit floating point add performed |
67
66
68
67
## What tools to use?
@@ -223,7 +222,7 @@ and collecting counters during the execution.
223
222
224
223
The `rocprof` utility also depends on the
225
224
[ROC-tracer](https://github.com/ROCm-Developer-Tools/roctracer) and
Copy file name to clipboardExpand all lines: rocm-installation/README.md
+17-15
Original file line number
Diff line number
Diff line change
@@ -23,9 +23,11 @@ SOFTWARE.
23
23
24
24
AMD ROCm™ is the first open-source software development platform for HPC/Hyperscale-class GPU computing. AMD ROCm™ brings the UNIX philosophy of choice, minimalism and modular software development to GPU computing. Please see the AMD [Open Software Platform for GPU Compute](https://www.amd.com/en/graphics/servers-solutions-rocm) and [ROCm Informational Portal](https://rocm.docs.amd.com/) pages for more information.
25
25
26
+
More detailed information on ROCm installation is available on the [Deploy ROCm on Linux](https://rocm.docs.amd.com/en/latest/deploy/linux/index.html) page.
27
+
26
28
Installation of the AMD ROCm™ software package can be challenging without a clear understanding of the pieces involved and the flow of the installation process. This introductory material shows how to install ROCm on a workstation with an AMD GPU card that supports the AMD GFX9 architecture. A follow on blog will discuss installing ROCm in other environments, such as a Docker Container, Linux LXC or a full HPC installation.
27
29
28
-
The website [https://rocm.docs.amd.com](https://rocm.docs.amd.com) contains links to the Release, Support and API documentation for ROCm. Please refer to the [Installation Guide](https://rocm.docs.amd.com/en/latest/deploy/linux/) and [Hardware / Software Support Guide](https://rocm.docs.amd.com/en/latest/release/gpu_os_support.html) for the software and hardware supported by the V 5.4 release of ROCm. This post will be based on an Ubuntu 20.04 operating system and the AMD MI50 GPU card. The full installation process is documentated in the [Installation Guide](https://rocm.docs.amd.com/en/latest/deploy/linux/).
30
+
The website [https://rocm.docs.amd.com](https://rocm.docs.amd.com) contains links to the Release, Support and API documentation for ROCm. Please refer to the [Installation Guide](https://rocm.docs.amd.com/en/latest/deploy/linux/) and [Hardware / Software Support Guide](https://rocm.docs.amd.com/en/latest/release/gpu_os_support.html) for the software and hardware supported by the V 5.7 release of ROCm. This post will be based on an Ubuntu 22.04 operating system and the AMD MI (MI210, MI250, etc.) series GPU card. The full installation process is documentated in the [Installation Guide](https://rocm.docs.amd.com/en/latest/deploy/linux/).
29
31
30
32
AMD ROCm™ is a brand name for the ROCm open software platform supporting GPUs using AMD's CDNA, and RDNA GPU architectures. The platform includes drivers and runtimes for libraries and developer tools.
31
33
@@ -37,12 +39,12 @@ AMD ROCm™ is a brand name for the ROCm open software platform supporting GPUs
37
39
38
40
## Option 1
39
41
40
-
AMD provides an installation script for specific operating system and ROCm versions. The script name and download location can be different for each combination of O/S and ROCm so check the [How to Install page](https://rocm.docs.amd.com/en/latest/deploy/linux/installer/install.html) for your specific combination. We are using Ubuntu 20.04 and installing ROCm 5.4.3 and find that the script is named *amdgpu-install_5.4.50403-1_all*.
42
+
AMD provides an installation script for specific operating system and ROCm versions. The script name and download location can be different for each combination of O/S and ROCm so check the [How to Install page](https://rocm.docs.amd.com/en/latest/deploy/linux/installer/install.html) for your specific combination. We are using Ubuntu 22.04 and installing ROCm 5.7.1 and find that the script is named *amdgpu-install_5.7.50701-1_all*.
Once the amdgpu-install script has been extracted, it can be used to install the kernel code, libraries and developer code. For a typical HPC environment HIP, ROCm and the kernel drivers should be sufficient:
48
50
```bash
@@ -67,27 +69,27 @@ To remove all old versions use:
67
69
sudo amdgpu-uninstall --rocmrelease=all
68
70
```
69
71
70
-
AMD provides an installation script for specific operating system and ROCm versions. The script name and download location can be different for each combination of O/S and ROCm so check the [How to Install page](https://rocm.docs.amd.com/en/latest/deploy/linux/installer/install.html) for your specific combination. We are using Ubuntu 20.04 and installing ROCm 5.4 and find that the script is named *amdgpu-install_5.4.50400-1_all*.
72
+
AMD provides an installation script for specific operating system and ROCm versions. The script name and download location can be different for each combination of O/S and ROCm so check the [How to Install page](https://rocm.docs.amd.com/en/latest/deploy/linux/installer/install.html) for your specific combination. We are using Ubuntu 22.04 and installing ROCm 5.7.1 and find that the script is named *amdgpu-install_5.7.50701-1_all*.
Once the amdgpu-install script has been extracted, it can be used to install the kernel code, libraries and developer code.
78
-
The steps below will install the kernel driver code at level 5.4.0 and the libraries at level 5.4.0, 5.3.2 and 5.2.3. For a typical HPC environment, HIP and the ROCm libraries should be sufficient:
80
+
The steps below will install the kernel driver code at level 5.7.1 and the libraries at level 5.7.0 and 5.7.1. For a typical HPC environment, HIP and the ROCm libraries should be sufficient:
0 commit comments