also to be reviewed. Finally, some compilation errors on MPI\_waitall and MPI\_Finalise primitives have been fixed with the latest version of Simgrid.
In total, the initial MPI program running on the simulation environment SMPI gave after a very simple adaptation the same results as those obtained in a real
environment. We have tested in synchronous mode with a simulated platform starting from a modest 2 or 3 clusters grid to a larger configuration like simulating
-Grid5000 with more than 1500 hosts with 5000 cores [?]. Once the code debugging and adaptation were complete, the next section shows our methodology and experimental
+Grid5000 with more than 1500 hosts with 5000 cores~\cite{bolze2006grid}. Once the code debugging and adaptation were complete, the next section shows our methodology and experimental
results.
Our work has demonstrated that using such a simulation tool allow us to
reach the following three objectives:
-\newcounter{numberedCntD}
\begin{enumerate}
\item To have a flexible configurable execution platform resolving the
hard exercise to access to very limited but so solicited physical
\item and finally and more importantly, to find the correct combination
of the cluster and network specifications permitting to save time in
executing the algorithm in asynchronous mode.
-\setcounter{numberedCntD}{\theenumi}
\end{enumerate}
Our results have shown that in certain conditions, asynchronous mode is
speeder up to \np[\%]{40} than executing the algorithm in synchronous mode
\section*{Acknowledgment}
This work is partially funded by the Labex ACTION program (contract ANR-11-LABX-01-01).
-The authors would like to thank\dots{}
+\todo[inline]{The authors would like to thank\dots{}}
% trigger a \newpage just before the given reference