-After analyzing the outputs, generally, for the configuration with two or three
-clusters including one hundred hosts (Tables~\ref{tab.cluster.2x50}
-and~\ref{tab.cluster.3x33}), some combinations of the used parameters affecting
-the results have given a relative gain more than 2.5, showing the effectiveness of the
-asynchronous performance compared to the synchronous mode.
-
-In the case of a two clusters configuration, Table~\ref{tab.cluster.2x50} shows
-that with a deterioration of inter cluster network set with \np[Mbit/s]{5} of
-bandwidth, a latency in order of a hundredth of a millisecond and a system power
-of one GFlops, an efficiency of about \np[\%]{40} in asynchronous mode is
-obtained for a matrix size of 62 elements. It is noticed that the result remains
-stable even if we vary the external precision from \np{E-5} to \np{E-9}. By
-increasing the matrix size up to 100 elements, it was necessary to increase the
-CPU power of \np[\%]{50} to \np[GFlops]{1.5} for a convergence of the algorithm
-with the same order of asynchronous mode efficiency. Maintaining such a system
-power but this time, increasing network throughput inter cluster up to
-\np[Mbit/s]{50}, the result of efficiency with a relative gain of 1.5\AG[]{2.5 ?} is obtained with
-high external precision of \np{E-11} for a matrix size from 110 to 150 side
-elements.
-
-For the 3 clusters architecture including a total of 100 hosts,
-Table~\ref{tab.cluster.3x33} shows that it was difficult to have a combination
-which gives a relative gain of asynchronous mode more than 1.2. Indeed, for a
-matrix size of 62 elements, equality between the performance of the two modes
-(synchronous and asynchronous) is achieved with an inter cluster of
-\np[Mbit/s]{10} and a latency of \np[ms]{E-1}. To challenge an efficiency greater than 1.2 with a matrix size of 100 points, it was necessary to degrade the
-inter cluster network bandwidth from 5 to \np[Mbit/s]{2}.
-\AG{Conclusion, on prend une plateforme pourrie pour avoir un bon ratio sync/async ???
- Quelle est la perte de perfs en faisant ça ?}
-
-A last attempt was made for a configuration of three clusters but more powerful
-with 200 nodes in total. The convergence with a relative gain around 1.1 was
-obtained with a bandwidth of \np[Mbit/s]{1} as shown in
-Table~\ref{tab.cluster.3x67}.
-
-\RC{Est ce qu'on sait expliquer pourquoi il y a une telle différence entre les résultats avec 2 et 3 clusters... Avec 3 clusters, ils sont pas très bons... Je me demande s'il ne faut pas les enlever...}
-\RC{En fait je pense avoir la réponse à ma remarque... On voit avec les 2 clusters que le gain est d'autant plus grand qu'on choisit une bonne précision. Donc, plusieurs solutions, lancer rapidement un long test pour confirmer ca, ou enlever des tests... ou on ne change rien :-)}
-\LZK{Ma question est: le bandwidth et latency sont ceux inter-clusters ou pour les deux inter et intra cluster??}
-
+After analyzing the outputs, generally, for the two clusters including one hundred hosts configuration (Tables~\ref{tab.cluster.2x50}), some combinations of parameters affecting
+the results, have given a relative gain of more than 2.5, showing the effectiveness of the
+asynchronous multisplitting compared to GMRES with two distant clusters.
+
+With these settings, Table~\ref{tab.cluster.2x50} shows
+that after setting the bandwidth of the inter cluster network to \np[Mbit/s]{5}, the latency to $20$ millisecond and the processor power
+to one GFlops, an efficiency of about \np[\%]{40} is
+obtained in asynchronous mode for a matrix size of $62^3$ elements. It is noticed that the result remains
+stable even if the residual error precision varies from \np{E-5} to \np{E-9}. By
+increasing the matrix size up to $100^3$ elements, it was necessary to increase the
+CPU power by \np[\%]{50} to \np[GFlops]{1.5} to get the algorithm convergence and the same order of asynchronous mode efficiency. Maintaining a relative gain of $2.5$ and such processor power but increasing network throughput inter cluster up to \np[Mbit/s]{50}, is obtained with
+high external precision of \np{E-11} for a matrix size from $110^3$ to $150^3$ side
+elements.
+
+%For the 3 clusters architecture including a total of 100 hosts,
+%Table~\ref{tab.cluster.3x33} shows that it was difficult to have a combination
+%which gives a relative gain of asynchronous mode more than 1.2. Indeed, for a
+%matrix size of 62 elements, equality between the performance of the two modes
+%(synchronous and asynchronous) is achieved with an inter cluster of
+%\np[Mbit/s]{10} and a latency of \np[ms]{E-1}. To challenge an efficiency greater than 1.2 with a matrix %size of 100 points, it was necessary to degrade the
+%inter cluster network bandwidth from 5 to \np[Mbit/s]{2}.
+%\AG{Conclusion, on prend une plateforme pourrie pour avoir un bon ratio sync/async ???
+ %Quelle est la perte de perfs en faisant ça ?}
+
+%A last attempt was made for a configuration of three clusters but more powerful
+%with 200 nodes in total. The convergence with a relative gain around 1.1 was
+%obtained with a bandwidth of \np[Mbit/s]{1} as shown in
+%Table~\ref{tab.cluster.3x67}.
+
+%\RC{Est ce qu'on sait expliquer pourquoi il y a une telle différence entre les résultats avec 2 et 3 clusters... Avec 3 clusters, ils sont pas très bons... Je me demande s'il ne faut pas les enlever...}
+%\RC{En fait je pense avoir la réponse à ma remarque... On voit avec les 2 clusters que le gain est d'autant plus grand qu'on choisit une bonne précision. Donc, plusieurs solutions, lancer rapidement un long test pour confirmer ca, ou enlever des tests... ou on ne change rien :-)}
+%\LZK{Ma question est: le bandwidth et latency sont ceux inter-clusters ou pour les deux inter et intra cluster??}
+%\CER{Définitivement, les paramètres réseaux variables ici se rapportent au réseau INTER cluster.}