Intel® Software Development Tools for HPC (Webinar)

Europe/Berlin
Andre Sternbeck (Friedrich-Schiller-University Jena), Henning Schwanbeck (Technische Universität Ilmenau)
Description

The first day virtual course (webinar) will start with a short update on the latest publicly available Intel Hardware Architectures and introduce the Intel Software development tools starting the various components of Intel Parallel Studio XE suites and the upcoming oneAPI development framework supporting heterogenous Intel environments

Intel HPCThe second day virtual course (webinar) will focus on demonstrating Intel Software Analysis Tools. We will start with the tools for shared memory environments (Intel VTune Profiler and Intel Advisor) and we show the high-level lightweight Application Performance Snapshot tool. Later we turn our attention to shared memory apps using Intel MPI for building and the Intel Trace Analyzer and Collector for analyzing MPI based applications. As time allows new features of the oneAPI development framework are shown.

Prerequisites:

  • Basic knowledge of C/C++, Fortran and parallel programming with OpenMP or MPI
  • This event uses Cisco Webex Meetings. Webex details will be distributed to registered participants.

Note the other webinar on the Intel Development Tools for Artificial Intelligence on November 17.

Surveys
Intel HPC Tools Workshop Day 1
Intel HPC Tools Workshop Day 2
  • Wednesday, 11 November
    • 10:00 10:15
      Welcome and Introduction 15m
    • 10:15 12:00
      Introduction to Intel® Tools: Morning Session
      • 10:15
        The Intel® Architecture for Software Developers 45m

        This session will offer insights into Intel® hardware platforms tailored to the needs of software developers, software architects and HPC experts. Learn how Intel® Software Development Tools will help you to achieve optimal performance via vectorization, memory access tuning, and threading.

      • 11:00
        Intel® oneAPI – the Future of Intel Software Development 1h

        Modern workloads are incredibly diverse—and so are architectures. No single architecture is best for every workload. Maximizing performance takes a mix of scalar, vector, matrix, and spatial (SVMS) architectures deployed in CPU, GPU, FPGA, and other future accelerators. Intel® oneAPI products will deliver the tools you need to deploy your applications and solutions across SVMS architectures.

    • 12:00 13:00
      Lunch Break 1h
    • 13:00 16:30
      Introduction to Intel® Tools: Afternoon Session
      • 13:00
        Intel® Compilers – an Overview - with demos 1h 30m

        C++ and Fortran compilers from Intel produce optimized code that takes advantage of the ever-increasing core count and vector register width in Intel® processors. You will learn most important compiler optimizations helping to boost the performance of your application. A short demo will complement the talk and show a possible speed-up you can gain by controlling compiler options only.

      • 14:30
        Intel® MKL – The Math Kernel Library - with demos 1h

        Intel Math Kernel Library is a key element in many applications for scientific, engineering, and data applications and many others. The session will cover the newest features implemented into the latest version and some of the newest coming updates. We will also describe to you the most efficient ways how to use Intel MKL properly and how to get the best performance out of your MKL applications.

      • 15:30
        Q&A -- Try yourself 1h

        During this session you can try the Intel tools yourself on your favorite cluster or workstation. The Intel trainers are online and will answer your questions.

  • Thursday, 12 November
    • 10:00 10:15
      Welcome to Day 2 15m
    • 10:15 12:00
      Optimizing your Software: Morning Session
      • 10:15
        Intel® MPI Library 1h

        Get the best performance out of your MPI application by leveraging the Intel MPI library. Besides a general introduction on how to use Intel MPI properly, approaches for tuning the Intel MPI library will be shown.

      • 11:15
        Application Performance Snapshot - Intel® VTune™ Profiler 45m

        Intel Application Performance Snapshot (APS) is the first step in a systematic analysis using Intel tools. APS identifies the hotspots of an application and recommends the most appropriate Intel tool for detailed analysis. A small demo will explain the basic steps of APS usage and result interpretation

    • 12:00 13:00
      Lunch Break 1h
    • 13:00 17:00
      Optimizing your Software: Afternoon Session
      • 13:00
        Intel® Advisor - Optimize high-performing code on modern computer architectures- with demos 1h

        SIMD (Single Instruction Multiple Data) instructions are crucial for optimal performance on modern Intel CPUs. On x86 architecture these extensions are known as SSE, AVX, AVX-2 and AVX-512. In general developers will rely on the compiler vectorization capabilities. But the automatic vectorization often needs support from the developer to deal with dependencies, alignment, data type conversions and other topics we will cover here. Compiler optimization reports provide some analysis but don’t give much guidance. Intel Advisor XE delivers an easy and precise way of vectorization analysis plus an evaluation of maximal potential performance for each loop (Roofline Analysis).

      • 14:00
        Intel® VTune™ Profiler - Locate performance bottlenecks fast - with demos 1h

        Without the right data, you’re guessing about how to improve software performance and are unlikely to make the most effective improvements. The VTune Profiler collects key profiling data and presents it with a powerful interface that simplifies its analysis and interpretation. This talk will provide a general introduction to VTune while diving deeper using a memory bandwidth analysis example.

      • 15:00
        Intel® Trace Analyzer and Collector - MPI profiling for cluster applications -with Demos 1h

        Intel Trace Analyzer and Collector provides a detailed analysis and visualization of MPI usage in your application. The graphical display will help to understand bottlenecks of the application. It can also help to understand an unknown MPI algorithm. Another mode of ITAC is called Message-Checker (mc).  Running your application with mc will produce error messages or warnings when MPI related issues are detected. Both modes of ITAC are easy to be used but configuring for trace file reduction and user events are more advanced topics.

      • 16:00
        Q&A -- Try yourself 1h

        During this session you can try the Intel tools yourself on your favorite cluster or workstation. The Intel trainers are online and will answer your questions.