X-Git-Url: https://bilbo.iut-bm.univ-fcomte.fr/and/gitweb/kahina_paper2.git/blobdiff_plain/af5339950d91a3a52a04304214f1350ddac54bba..13cfd00767389ce6a0bcc1e192ff84f8e5ead47c:/paper.tex?ds=sidebyside diff --git a/paper.tex b/paper.tex index 9ebca6f..7064995 100644 --- a/paper.tex +++ b/paper.tex @@ -466,7 +466,8 @@ Finding polynomial roots rapidly and accurately is the main objective of our wor %This paper is organized as follows. In Section~\ref{sec2} we recall the Ehrlich-Aberth method. In section~\ref{sec3} we present EA algorithm on single GPU. In section~\ref{sec4} we propose the EA algorithm implementation on Multi-GPU for (OpenMP-CUDA) approach and (MPI-CUDA) approach. In sectioné\ref{sec5} we present our experiments and discus it. Finally, Section~\ref{sec6} concludes this paper and gives some hints for future research directions in this topic.} -The paper is organized as follows. In Section~\ref{sec2} we present three different parallel programming models OpenMP, MPI and CUDA. +The paper is organized as follows. In Section~\ref{sec2} we present three different parallel programming models OpenMP, MPI and CUDA. In Section~\ref{sec3} we present the Ehrlich-Aberth algorithm on a single GPU. In Section~\ref{sec4} we propose the parallel implementations of the Ehrlich-Aberth algorithm using the OpenMP and MPI approaches. + \LZK{A revoir toute cette organization} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% @@ -505,6 +506,9 @@ CUDA (Compute Unified Device Architecture) is a parallel computing architecture %We introduced three paradigms of parallel programming. Our objective consists in implementing a root finding polynomial algorithm on multiple GPUs. To this end, it is primordial to know how to manage CUDA contexts of different GPUs. A direct method for controlling the various GPUs is to use as many threads or processes as GPU devices. We can choose the GPU index based on the identifier of OpenMP thread or the rank of the MPI process. Both approaches will be investigated. +%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% +%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% + \section{The EA algorithm on a single GPU} \label{sec3} \subsection{The EA method} @@ -860,33 +864,40 @@ This is due to the use of MPI parallel paradigm that divides the problem computa -%\begin{figure}[htbp] -%\centering - % \includegraphics[angle=-90,width=0.5\textwidth]{Sparse} -%\caption{Comparaison between MPI and OpenMP versions of the Ehrlich-Aberth method for solving sparse plynomials on GPUs} -%\label{fig:05} -%\end{figure} +\begin{figure}[htbp] +\centering + \includegraphics[angle=-90,width=0.5\textwidth]{Sparse} +\caption{Comparaison between MPI and OpenMP versions of the Ehrlich-Aberth method for solving sparse plynomials on GPUs.} +\label{fig:05} +\end{figure} -%\begin{figure}[htbp] -%\centering - % \includegraphics[angle=-90,width=0.5\textwidth]{Full} -%\caption{Comparaison between MPI and OpenMP versions of the Ehrlich-Aberth method for solving full polynomials on GPUs} -%\label{fig:06} -%\end{figure} +\begin{figure}[htbp] +\centering + \includegraphics[angle=-90,width=0.5\textwidth]{Full} +\caption{Comparaison between MPI and OpenMP versions of the Ehrlich-Aberth method for solving full polynomials on GPUs.} +\label{fig:06} +\end{figure} -%\begin{figure}[htbp] -%\centering - % \includegraphics[angle=-90,width=0.5\textwidth]{MPI} -%\caption{Comparaison of execution times of the Ehrlich-Aberth method for solving sparse and full polynomials on GPUs with distributed memory paradigm using MPI} -%\label{fig:07} -%\end{figure} +\begin{figure}[htbp] +\centering + \includegraphics[angle=-90,width=0.5\textwidth]{MPI} +\caption{Comparaison of execution times of the Ehrlich-Aberth method for solving sparse and full polynomials on GPUs with distributed memory paradigm using MPI.} +\label{fig:07} +\end{figure} -%\begin{figure}[htbp] -%\centering - % \includegraphics[angle=-90,width=0.5\textwidth]{OMP} -%\caption{Comparaison of execution times of the Ehrlich-Aberth method for solving sparse and full polynomials on GPUs with shared memory paradigm using OpenMP} -%\label{fig:08} -%\end{figure} +\begin{figure}[htbp] +\centering + \includegraphics[angle=-90,width=0.5\textwidth]{OMP} +\caption{Comparaison of execution times of the Ehrlich-Aberth method for solving sparse and full polynomials on GPUs with shared memory paradigm using OpenMP.} +\label{fig:08} +\end{figure} + +\begin{figure}[htbp] +\centering + \includegraphics[angle=-90,width=0.5\textwidth]{big} +\caption{Execution times in seconds of the Ehrlich-Aberth method for solving full polynomials of high degrees on 4 GPUs.} +\label{fig:09} +\end{figure} % An example of a floating figure using the graphicx package. % Note that \label must occur AFTER (or within) \caption.