plan.txt

   1 Chapter 1: Presentation of the GPU architecture
   2 Author: Raphaël Couturier: University of Franche-Comte, France
   3 This chapter will introduce the GPU architecture and the classical model proposed by CUDA. All backgrounds necessary for the remainder of the book will be first presented here.
   4
   5
   6 Chapter 2: Simple examples with CUDA
   7 Author: Raphaël Couturier: University of Franche-Comte, France
   8
   9
  10 **** Part : image processing
  11
  12 Chapter 3: Fast kernels for image and signal processing
  13 Authors: Gilles Perrot, Raphaël Couturier and Stéphane Domas: University of Franche-Comte, France, Nicolas Bertaux, University of Aix-Marseille, France
  14 In this chapter, we will introduce and present many kernels that can drastically enhance signal and image processing algorithms. Although these kernels seem to be very common, they have not yet been well described in the literature.
  15
  16
  17 Chapter 4: Region Based Algorithm for Large Images Segmentation on GPU
  18 Authors: Gilles Perrot, Raphaël Couturier and Stéphane Domas: University of Franche-Comte, France, Nicolas Bertaux, University of Aix-Marseille, France
  19 In this chapter, we will present an algorithm for region-based active contour techniques (snakes) as they seem to achieve a high level of robustness and fit with a large range of applications.
  20
  21
  22
  23
  24
  25 ****  Part : Software Development
  26
  27
  28 Chapter 5: On the development of high-performance software library for emerging architectures: design and analysis
  29 Authors: Allan P. Engsig-Karup, Bernd Dammann, Jeppe R. Frisvad and Stefan Lemvig: Technical University of Denmark, Denmark
  30 This chapter will present performance portable tuning techniques via a modern parallel programming. Then it will focus on efficient and scalable iterative methods for solution of high-order numerical methods and strategies for efficient implementations on desktop architectures.
  31
  32
  33 Chapter 6: Pertinence and development methodologies for GPU and cluster of GPU
  34 Authors: Sylvain Contassot-Vivier: University of Nancy, France, Stéphane Vialle, Supélec, Metz, France
  35 This chapter proposes to draw the main frontiers of the fields of applicability of GPU acceleration as well as development methodologies to obtain efficient codes in classical scientific applications.
  36
  37
  38
  39 Chapter 7: Fast GPU-accelerated desktop application
  40 Authors: Allan P. Engsig-Karup, Bernd Dammann, Jeppe R. Frisvad and Stefan Lemvig: Technical University of Denmark, Denmark
  41 This chapter will present discussions, analysis and highlights of a new massively parallel engineering tool for nonlinear free surface flows intended for both engineering analysis and interactive real-time computing, e.g. for applications in coastal and offshore engineering and first of its kind physics-based ship simulation.
  42
  43
  44
  45
  46
  47
  48 **** Part : Optimization
  49
  50 Chapter 8: GPU-accelerated Tree-based Exact Optimization Methods
  51 Authors:  Imen Chakroun, Nouredine Melab and El-Ghazali Talbi: INRIA Lille, France
  52 This chapter will present the latest techniques and algorithms for solving tree-based exact optimization methods on GPU.
  53
  54 Chapter 9: Parallel Meta-heuristics for Solving Challenging Problems on GPU Accelerators
  55 Authors:  Thé Van Luong, Nouredine Melab and El-Ghazali Talbi: INRIA Lille, France
  56 This chapter will describe parallel metaheuristics for solving complex problems in science and industry. This work is based on local search metaheuristics.
  57
  58 Chapter 10: Linear programming on a GPU: a study case based on the simplex method and the branch-cut-and bound algorithm
  59 Authors: Paul Albuquerque: HES-SO, Geneva, Switzerland, Xavier Meyer and Bastien Chopard: University of Geneva, Switzerland
  60 This chapter will address the main issues related to programming the simplex method on a GPU. Then it will present how to integrate this GPU-based simplex method in a branch-cut-and-bound framework which will take place between the CPU and the GPU.
  61
  62 Chapter 11: Performing large scale robust regression on GPUs
  63 Authors: Gleb Beliakov and Gang Li: Deakin University, Melbourne, Australia
  64 In this chapter we will report on the use of GPUs for large scale robust data analysis. Identification of outliers in large multivariate data sets is difficult, because outliers shift regression models in their direction so much that they become undetectable by their residuals.
  65
  66
  67
  68 ***** Part : Numerical applications
  69
  70 Chapter 12: Sparse linear system solvers with the GMRES method on gpu clusters
  71 Authors: Lilia Ziane Khodja, Raphaël Couturier and Jacques Bahi: University of Franche Comte, France
  72 In this chapter, the adaptation of the GMRES method will be presented and several techniques (compression, partitioning …) allowing to increase the scalability of this algorithm for GPU cluster will be described.
  73
  74
  75 Chapter 13: Parallel solution of the Obstacle problem on GPU clusters
  76 Authors: Lilia Ziane Khodja, Raphaël Couturier and Jacques Bahi: University of Franche Comte, France, Ming Chau and Pierre Spiteri: University of Toulouse, France
  77 This chapter is devoted to the implementation of the Obstable problem on GPU clusters. This problem is a non linear PDE occurring in financial mathematics (option pricing) and constrained structure mechanics. Synchronous and asynchronous implementations will be analyzed.
  78
  79 Chapter 14: Complex fluid lattice Boltzmann on GPU clusters
  80 Authors: Kevin Stratford and Alan Gray: University of Edinburg, United Kingdom
  81 This chapter will present a complex fluid lattice Boltzmann application such that it can scale and perform excellently on large-scale GPU clusters.
  82
  83 Chapter 15: Deployment on GPU of an atomic physics program
  84 Authors: Pierre Fortin, Rachid Habel, Fabienne Jézéquel and Jean-Luc Lamotte: University of Paris 6, France Stan Scott
  85 This chapter will describe the deployment on GPUs of PROP, a program of the 2DRMP suite which models electron collisions with H-like atoms and ions.
  86
  87 Chapter 16: GPU-based envelop-follow simulation techniques for power converters design
  88 Authors: Sheldon Tan + students: University of California, Riverside, USA
  89 This chapter will introduce a new envelope-following parallel transient analysis method for the general switching power converters. This method exploits the parallelism in the envelope-following method and parallelize the Newton update solving part, which is the most computational expensive, in GPU platforms to boost the simulation performance.
  90
  91 Chapter 17: Domain decomposition method on GPU architecture
  92 Authors: Frédéric Magoules: Ecole centrale, Paris, France
  93 This chapter will present how GPU architecture can increase performances of domain decomposition methods.
  94
  95
  96
  97
  98
  99
 100
 101
 102
 103 **** Part Other
 104
 105 Chapter 18: Pseudo Random Number Generator on GPU
 106 Authors: Raphaël Couturier and Christophe Guyeux: University of Franche-Comte, France
 107 This chapter will present some pseudo random number generators which are essential in many applications. We have proposed a generator which has chaotic properties which are proved, whereas it is not the case for other generators. Our generator succeeds to pass all statistical battery series.
 108
 109
 110 Chapter 19: Solving large sparse linear systems for integer factorization on GPUs
 111 Authors: Bertil Schmidt and Hao Yu Dang: University of Mainz, Germany
 112 This chapter will present the number field sieve (NFS) which is the current state-of-the-art integer factorization method. It will focus on how GPUs can be used to accelerate this highly time consuming operation.