-However, the iterative methods suffer from scalability problems on parallel computing platforms with many processors due to their need for reduction operations and collective communications to perform matrix-vector multiplications. The communications on large clusters with thousands of cores and large sizes of messages can significantly affect the performances of iterative methods. In practice, Krylov subspace iteration methods are often used with preconditioners in order to increase their convergence and accelerate their performances. However, most of the good preconditioners are not scalable on large clusters.
+However, the iterative methods suffer from scalability problems on parallel computing platforms with many processors due to their need for reduction operations and collective communications to perform matrix-vector multiplications. The communications on large clusters with thousands of cores and large sizes of messages can significantly affect the performances of iterative methods. In practice, Krylov subspace iteration methods are often used with preconditioners in order to increase their convergence and accelerate their performances. However, most of the good preconditioners are not scalable on large clusters.
+
+In this paper we propose a two-stage algorithm, also called inner-outer iteration algorithm, based on two nested iterations.
+
+This paper is organized as follows. In Section~\ref{sec:02} some related works are presented. Section~\ref{sec:03} presents our two-stage algorithm based on Krylov subspace iteration methods. Section~\ref{sec:04} shows some experimental results obtained on large clusters using routines of PETSC toolkit.