- Infos im HLRS Wiki sind nicht rechtsverbindlich und ohne Gewähr -

Difference between revisions of "VTune"

From HLRS Platforms
Jump to navigationJump to search
Line 11: Line 11:
 
** C/C++/C#, Fortran, Python and others
 
** C/C++/C#, Fortran, Python and others
  
(Formerly Intel® VTune™ Performance Analyzer with Intel® Thread Profiler)
 
 
| logo = [[Image:intel-logo.png]]
 
| logo = [[Image:intel-logo.png]]
 
| developer              = Intel
 
| developer              = Intel
| available on      =  
+
| available on      = Hawk, Vulcan
 
| category                  = [[:Category:Performance Analyzer | Performance Analyzer]]
 
| category                  = [[:Category:Performance Analyzer | Performance Analyzer]]
 
| license                = Commercial
 
| license                = Commercial
| website                = [http://software.intel.com/en-us/intel-vtune/ Intel® VTune™ Amplifier XE homepage]  
+
| website                = [https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/vtune-profiler.html Intel® VTune™ Profiler homepage]  
 
}}  
 
}}  
  
=== Using Intel VTune on Nehalem cluster ===
+
=== Using Intel VTune ===
 
+
To perform the performance analyse of your application with VTune you don’t need special compiler wrapper or libraries. Just recompile and relink your code with extra –g option in order to include debug information. VTune works well for dynamically linked binaries. [https://software.intel.com/content/www/us/en/develop/documentation/vtune-help/top/set-up-analysis-target/linux-targets/analyzing-statically-linked-binaries-on-linux-targets.html Here] you can find some tips for statically linked binaries
Load the necessary module. For example:
+
Example:
 
<pre>
 
<pre>
 
module load compiler/intel  
 
module load compiler/intel  

Revision as of 16:11, 4 June 2021

Intel® VTune™ Profiler is a performance analysis tool for serial and multithreaded applications. Use VTune Profiler:
  • to determine the most time-consuming (hot) functions in your application
  • to discover on very fine-grained level which section of code (loop, source code line or even data field) do not effectively utilize available processor time
  • to analyse communication behavior that affects threaded performance
  • for accurate analysis with low overhead
  • for serial and multithreaded applications
    • MPI (MPT, OpenMPI)
    • OpenMP, Intel® oneAPI Threading Building Blocks, native threads
  • Languages:
    • C/C++/C#, Fortran, Python and others
Intel-logo.png
Developer: Intel
Platforms: Hawk, Vulcan
Category: Performance Analyzer
License: Commercial
Website: Intel® VTune™ Profiler homepage


Using Intel VTune

To perform the performance analyse of your application with VTune you don’t need special compiler wrapper or libraries. Just recompile and relink your code with extra –g option in order to include debug information. VTune works well for dynamically linked binaries. Here you can find some tips for statically linked binaries Example:

module load compiler/intel 
module load performance/vtune # set up VTune environment
module load mpi/impi          # if MPI needed 

Compilation example:

ifort -g -O2 prog.f90

Analyzing MPI applications:

 mpirun –n4 amplxe-cl -c hotspots –r my_result -- path_to_my_app

Using Intel VTune on Cray machines

Load the necessary module. For example:

module swap PrgEnv-cray PrgEnv-intel # set up MPI environment too 
module load performance/vtune        # set up VTune environment

Compilation example:

ftn -dynamic -g -O2 prog.f90

Analyzing MPI applications:

 aprun –n4 amplxe-cl -c hotspots –r myResult-@@@{at} -- path_to_my_app

See also

External links