where the real-valued function $\phi(x,y,z)$ is the solution sought, $f(x,y,z)$ is a known function and $\Omega=[0,1]^3$. The 3D discretization of the Laplace operator $\nabla^2$ with the finite difference scheme includes 7 points stencil on the computational grid. The numerical approximation of the Poisson problem on three-dimensional grid is repeatedly computed as $\phi=\phi^\star$ such that
\begin{equation}
\begin{array}{ll}
-\phi^\star(x,y,z)= & \frac{1}{6}(\phi(x-h,y,z)+\phi(x+h,y,z) \\
- & +\phi(x,y-h,z)+\phi(x,y+h,z) \\
- & +\phi(x,y,z-h)+\phi(x,y,z+h)\\
- & -h^2f(x,y,z))
+\phi^\star(x,y,z)=&\frac{1}{6}(\phi(x-h,y,z)+\phi(x,y-h,z)+\phi(x,y,z-h)\\&+\phi(x+h,y,z)+\phi(x,y+h,z)+\phi(x,y,z+h)\\&-h^2f(x,y,z))
\end{array}
\label{eq:08}
\end{equation}
until convergence where $h$ is the grid spacing between two adjacent elements in the 3D computational grid.
-In the parallel context, the 3D Poisson problem is partitioned into $L\times p$ sub-problems such that $L$ is the number of clusters and $p$ is the number of processors in each cluster. We apply the three-dimensional partitioning instead of the row-by-row one in order to reduce the size of the data shared at the sub-problems boundaries. In this case, each processor is in charge of parallelepipedic sub-problem and has at most six neighbors in the same cluster or in distant clusters with which it shares data at boundaries.
+In the parallel context, the 3D Poisson problem is partitioned into $L\times p$ sub-problems such that $L$ is the number of clusters and $p$ is the number of processors in each cluster. We apply the three-dimensional partitioning instead of the row-by-row one in order to reduce the size of the data shared at the sub-problems boundaries. In this case, each processor is in charge of parallelepipedic block of the problem and has at most six neighbors in the same cluster or in distant clusters with which it shares data at boundaries.
\subsection{Study setup and Simulation Methodology}
speed. The network between distant clusters might be a bottleneck for the
global performance of the application.
-\subsection{Comparison of GMRES and Krylov Multisplitting algorithms in
-synchronous mode}
+\subsection{Comparison of GMRES and Krylov Multisplitting algorithms in synchronous mode}
In the scope of this paper, our first objective is to analyze when the Krylov
Multisplitting method has better performances than the classical GMRES
architecture and scaling up the input matrix size}
\ \\
% environment
-\begin{footnotesize}
+
+\begin{figure} [ht!]
+\begin{center}
\begin{tabular}{r c }
\hline
Grid & 2x16, 4x8, 4x16 and 8x8\\ %\hline
Input matrix size & N$_{x}$ x N$_{y}$ x N$_{z}$ =150 x 150 x 150\\ %\hline
- & N$_{x}$ x N$_{y}$ x N$_{z}$ =170 x 170 x 170 \\ \hline
\end{tabular}
-Table 1 : Clusters x Nodes with N$_{x}$=150 or N$_{x}$=170 \\
+\caption{Clusters x Nodes with N$_{x}$=150 or N$_{x}$=170 \RC{je ne comprends pas la légende... Ca ne serait pas plutot Characteristics of cluster (mais il faudrait lui donner un nom)}}
+\end{center}
+\end{figure}
-\end{footnotesize}
%\RCE{J'ai voulu mettre les tableaux des données mais je pense que c'est inutile et ça va surcharger}
-In this section, we compare the algorithms performance running on various grid configuration (2x16, 4x8, 4x16 and 8x8). First, the results in figure 3 show for all grid configuration the non-variation of the number of iterations of classical GMRES for a given input matrix size; it is not
-the case for the multisplitting method.
+In this section, we analyze the performences of algorithms running on various
+grid configuration (2x16, 4x8, 4x16 and 8x8). First, the results in Figure~\ref{fig:01}
+show for all grid configuration the non-variation of the number of iterations of
+classical GMRES for a given input matrix size; it is not the case for the
+multisplitting method.
+
+\RC{CE attention tu n'as pas mis de label dans tes figures, donc c'est le bordel, j'en mets mais vérifie...}
+\RC{Les légendes ne sont pas explicites...}
+
-%\begin{wrapfigure}{l}{100mm}
\begin{figure} [ht!]
-\centering
-\includegraphics[width=100mm]{cluster_x_nodes_nx_150_and_nx_170.pdf}
-\caption{Cluster x Nodes N$_{x}$=150 and N$_{x}$=170}
-%\label{overflow}}
+ \begin{center}
+ \includegraphics[width=100mm]{cluster_x_nodes_nx_150_and_nx_170.pdf}
+ \end{center}
+ \caption{Cluster x Nodes N$_{x}$=150 and N$_{x}$=170}
+ \label{fig:01}
\end{figure}
-%\end{wrapfigure}
+
The execution time difference between the two algorithms is important when
comparing between different grid architectures, even with the same number of