Advisor: Difference between revisions

Revision as of 19:41, 17 December 2021

Intel® Advisor XE is a threading assistant for C, C++, C# and Fortran. It guides developers through threading design, automating analyses required for fast and correct implementation.

It helps developers to add parallelism to their existing C/C++ or Fortran programs. You can use the Intel Advisor XE to:

Find the most time-consuming serial code regions in your program.
Insert Intel Advisor XE annotations to identify these as possible parallel code regions.
Predict the approximate parallel performance characteristics of the proposed parallel code regions.
Check for data sharing problems that could prevent the application from working correctly when parallelized.


Developer:	Intel
Platforms:
Category:	Performance Analyzer
License:	Commercial
Website:	Intel® Advisor XE homepage

Why Intel Advisor?

Before checking the parallel efficiency of an application, it is necessary to understand how the application behaves at the core and node level. For example,

whether it is memory bound or compute bound

how good is the vectorization

how is the memory access pattern

whether there are dependencies hindering vectorization

where different loops/functions lie on the Roofline plot etc.

Intel Advisor not only provide answers to all the above-mentioned queries, but also suggests solutions, for example, what kind of optimizations one needs to implement in order to improve the performance of the application.

How to use Intel Advisor?

First compile your application with an additional "-g" flag. Then, set up an environment for the Advisor by loading the corresponding module.

For example, on Hawk

 module load advisor

On Vulcan

 module load performance/advisor

If you have installed Intel oneAPI on your laptop then,

 source /opt/intel/oneapi/setvars.sh

Running Advisor on OpenMP parallel application

Select the number of OpenMP threads as,

 export OMP_NUM_THREADS=num_of_threads

and bind them as,

 export OMP_PROC_BIND=spread

Afterwards, collect survey, tripcounts and flops as follows,

  advixe-cl -collect survey -project-dir results_advisor  ./a.out

  advixe-cl -collect tripcounts -flop -project-dir results_advisor  ./a.out

Results can be visualized using Advisor GUI,

  advixe-gui results_advisor/e000/e000.advixeexp

Visualizing results on Hawk could be slow, one may thus alternatively pack up all the results in a read-only file as follows

  advixe-cl --snapshot --project-dir=results_advisor --cache-sources path_to_source_code --cache-binaries path_to_binary

Above command will create a file snapshot000.advixeexpz which can be easily copied to the local machine and can be viewed in GUI as,

  advixe-gui snapshot000.advixeexpz

Running Advisor on MPI parallel application

Collect survey, tripcounts and flops as follows,

  mpirun -np $num_of_mpi_tasks advixe-cl -collect survey -project-dir results_advisor  ./a.out

  mpirun -np $num_of_mpi_tasks advixe-cl -collect tripcounts -flop -project-dir results_advisor  ./a.out

Above command will create Advisor reports for all the ranks. In case, one would like to run Advisor only on the single rank, then do the following,

  mpirun -np $num_of_mpi_tasks-1 ./a.out : -np 1 advixe-cl -collect survey -project-dir results_advisor  ./a.out

  mpirun -np $num_of_mpi_tasks-1 ./a.out : -np 1 advixe-cl -collect tripcounts -flop -project-dir results_advisor  ./a.out

Running Advisor on MPI+OpenMP parallel application

The following example employs 32 MPI tasks distributed uniformly over both the sockets with 2 OpenMP threads per MPI tasks on a Hawk node.

  module load mpt
  export MPI_SHEPHERD=1 
  export MPI_DSM_CPULIST=0-127/2:allhosts
  export OMP_NUM_THREADS=2
  export OMP_PROC_BIND=close
  export MPI_OPENMP_INTEROP=1

One can then run Advisor, same as described in the above section.

  mpirun -np $num_of_mpi_tasks-1 ./a.out : -np 1 advixe-cl -collect survey -project-dir results_advisor  ./a.out

  mpirun -np $num_of_mpi_tasks-1 ./a.out : -np 1 advixe-cl -collect tripcounts -flop -project-dir results_advisor  ./a.out

Additional analysis - memory access pattern and dependencies

While visualizing the results, Advisor might suggest performing additional analysis like memory access pattern and dependencies. One may collect the same, for example, as follows,

  mpirun -np $num_of_mpi_tasks-1 ./a.out : -np 1 advixe-cl -collect map -project-dir results_advisor  ./a.out

  mpirun -np $num_of_mpi_tasks-1 ./a.out : -np 1 advixe-cl -collect dependencies -project-dir results_advisor  ./a.out

Note that above analysis is possible only after collecting survey and tripcounts.

External links

Advisor: Difference between revisions

Revision as of 19:41, 17 December 2021

Contents

Why Intel Advisor?

How to use Intel Advisor?

Running Advisor on OpenMP parallel application

Running Advisor on MPI parallel application

Running Advisor on MPI+OpenMP parallel application

Additional analysis - memory access pattern and dependencies

See also

External links

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools