We have made built the Omniperf without GUI support for use in the exercises
Run srun -n 1 --gpus 1 omniperf profile -h
to see all the options
Now is created a workload in the directory workloads with the name dgemmoh I mean for (the argument of the -n). So, we can analyze it
If you want to only roofline analysis, then execute: srun -n 1 --gpus 1 omniperf profile -n dgemm --roof-only -- ./dgemm -m 8192 -n 8192 -k 8192 -i 1 -r 10 -d 0 -o dgemm.csv
If tou want to know the kernel names, it creates a second pdf with the markers and corresponding names, then run: srun -n 1 --gpus 1 omniperf profile -n dgemm --kernel-names --roof-only -- ./dgemm -m 8192 -n 8192 -k 8192 -i 1 -r 10 -d 0 -o dgemm.csv
There is no need for srun to analyze but we want to avoid everybody to use the login node. Explore the file dgemm_analyze.txt
But you need to know the code of the IP Block
Open the web page: http://IP:8050/ The IP will be displayed in the output
Use another cod, for example: https://github.com/amd/HPCTrainingExamples/blob/main/HIP/saxpy/saxpy.cpp