Overview Venue : CMSD, UoH Key-Note/Invited Talks Faculty / Speakers Proceedings Downloads Past Tech. Workshops Target Audience Benefits Organisers Accommodation Local Travel Sponsors Feedback Acknowledgements Contact Home

Topics of Interest Tech. Prog. Schedule Topic : Multi-Core Topic : ARM Proc. Topic : Coprocessors Topic : GPGPUs Topic : HPC Cluster Topic : App. Kernels. Topic : Lab. Session Key-Note / Invited Talks Home

Mode-1 Multi-Core Memory Allocators OpenMP Intel TBB Pthreads Java - Threads Charm++ Prog. Message Passing (MPI) MPI - OpenMP MPI - Intel TBB MPI - Pthreads Compilers - Opt. Features Threads-Perf. Math. Lib. Threads-Prof. & Tools Threads - I/O Perf. PGAS : UPC / CAF/ GA Power & Perf. Home

Mode-2 ARM Prog. Env Benchmarks Power & Perf. Home

Mode-3 Coprocessors Arch. Software Compiler & Vect. Prog. Env. Benchmarks Power & Perf. Home

Mode-4 GPGPUs NVIDIA - CUDA/OpenCL AMD APP - OpenCL GPGPUs - OpenCL GPGPUs : Power & Perf. Home

Mode-5 HPC Cluster HPC MPI Cluster GPU Cluster - NVIDIA GPU Cluster - AMD APP Cluster - Intel Coprocessors Cluster- Power & Perf. Home

Mode-6 App. Kernels PDE Solvers : FDM/FEM Image Processing - FFT Monte Carlo Methods String Srch. Seq. Analy. Video Process. Intr. Detcn. Sys App. Power & Perf. Home

Reg. Overview Pvt. Sector Pub. Sector Govt. Acad. Staff Students Reg. On-line Reg. Accommodation Contact Home

hyPACK-2013 Overview

Downloadable hypack 2013 Poster/Brochure

hypack-2013 Schedule & Tech. Prog. at Glance (pdf)

Center for Development of Advanced Computing (C-DAC) Pune and Centre for Modelling Simulation and Design (CMSD), High-Performance Computing (HPC) Facility University of Hyderabad, Hyderabad are jointly organizing Four days technology workshop on “ Hybrid Computing - Coprocessors & Accelerators - Power-aware Computing & Performance of Application Kernels (hyPACK-2013) ”.

hyPACK-2013 objective is to understand power-aware performance issues of various scientific application kernels and computational mathematics on parallel processing platforms such as computing systems with Intel Xeon-Phi Coprocessors and NVIDIA /AMD GPU accelerators as well as ARM processor based Linux multi-core processor systems. The aim is to achieve the best performance (turnaround time & throughput) and the total power consumption, a device or a system needs in order to solve a problem of given size in High Performance Computing (HPC) application kernels. The focus is to integrate different programming paradigms such as Pthreads, OpenMP 4.X, Intel TBB, Cilk Plus, MPI, Intel Xeon-Phi Offload Pragmas, MPI-OpenMP, & NVIDIA CUDA, OpenACC, OpenCL and extract the best achieved performance for application kernels on systems with coprocessors and accelerators. The workshop gives an opportunity to write, execute and demonstrate computational mathematics and application kernels using different programming paradigms. The workshop is aimed to cover classroom lectures in morning/forenoon session and four hours hands-on in afternoon session on every day.

Topics of interest include the following but not limited to:

Day 1 & Day-2 HPC Cluster - Intel Xeon/Phi coprocessors; ARM multi-core processor & HPC Cluster

Programming on Intel Xeon-Phi Coprocessors; Xeon-Phi Coprocessor usage model : MPI vesus Offload; Compiler and Programming model; Approaches to Vectorization - Complier Offload Pragmas & Directives; Programming Paradigms - OpenMP-4.0, Intel TBB, Intel Cilk Plus, Intel MKL, MPI
Intel Xeon-Phi Coprocessor Architecture; Linux OS on Coprocessor; Coprocessor System software; Tuning Memory Allocation Performance - Huge Page Sizes; Profiling & Tuning Tools- PAPI & MPI tools
Tuning and Performance Issues- Power Consumption for Application Kernels; Measurement of Power Consumption - External Power-Off-Meter; Application Kernels; Programming on ARM processor multi-core processor systems; Energy Efficiency & Performance Issues

Day 3 & Day-4: HPC Cluster - NVIDIA GPUs and AMD GPUs, ARM multi-core processors & HPC Cluster with coprocessors & Accelerators

An Overview of CUDA enabled NVIDIA GPUs : CUDA SDK/APIs; CUDA - Optimization & Performance Issues; Efficient use of different memory types, Libraries-CUBLAS, CUFFT, CUSPARSE; CUDA-OpenACC APIs; Programming - OpenCL; CUDA NVIDIA GPU Cluster
An Overview of AMD Accelerated Parallel Processing (APP) Capabilities; AMD APUs - OpenCL Prog. on Multi-Core CPUs & Multi-GPUs; AMD APP Math Libraries - BLAS & FFTs; AMD APP SDK, AMD tools - Aparapi AP; AMD OpenCL tuning - performance; HPC AMD GPU Cluster: Host CPU (Pthreads, OpenMP, MPI) with OpenCL on AMD GPUs; GPU Cluster - Health Monitoring & Efficient use of AMD GPUs using OpenCL
Programming on ARM Processor multi-core systems; power-aware performance Issues on ARM Multi-Coprocessor systems; Prog. on carma - NVIDIA CUDA on ARM Development Kit
An Overview of FPGA Device Systems; Energy Efficiency - Power-Off Meters and NVML Libraries - Health Monitoring - NVML Power Efficient API - Performance Issues; Efficient use of GPUs in Cluster; Open Source Software using GPUs - MAGMA, & Top-500 Benchmarks

Application Kernels :

Mixed Programming for Numerical /Non-Numerical Computations on multi-core processors with Intel Xeon-Phi coprocessors - and NVIDIA /AMD GPU accelerators and ARM processor systems; Application & System Benchmarks & Performance; Image Processing Applications - Bio-Informatics - String Search Algorithms & Sequence Analysis; Dense /Sparse Matrix Computations on HPC GPU Cluster; Solution of Partial Differential Eqs. (FDM & FEM); FFT Libraries; Invited lectures on Information Sciences; Computational Physics

hypack-2013 Registration

October 15, 2013 8:30 AM - 9:00 AM

The High-Performance Computing - Frontier Technologies Exploration (HPC-FTE) Group Members, c-DAC, Pune & techncial and administrative Staff Centre for Modelling Simulation and Design (CMSD), University of Hyderabad are involved for hyPACK-2013 technology workhsop activities.