X-Git-Url: https://bilbo.iut-bm.univ-fcomte.fr/and/gitweb/mpi-energy2.git/blobdiff_plain/215b0583e051f6596cbef34868f5e5fe2dcfdbba..d76c3b96fde2dfb310f6f5bb4d244c9d255984fd:/Heter_paper.tex diff --git a/Heter_paper.tex b/Heter_paper.tex index babdfc2..530730c 100644 --- a/Heter_paper.tex +++ b/Heter_paper.tex @@ -632,7 +632,7 @@ which results in bigger energy savings. \EndWhile \State Return $Sopt_1,Sopt_2,\dots,Sopt_N$ \end{algorithmic} - \caption{Heterogeneous scaling algorithm} + \caption{frequency scaling factors selection algorithm} \label{HSA} \end{algorithm} @@ -663,7 +663,8 @@ EQ(\ref{eq:perf}) and the energy model computed by EQ(\ref{eq:energy}). The energy model is also significantly dependent on the execution time model because the static energy is linearly related the execution time and the dynamic energy is related to the computation time. So, all of the work presented in this paper is based on the execution time model. To verify this model, the predicted -execution time was compared to the real execution time over Simgrid for all the NAS parallel benchmarks +execution time was compared to the real execution time over SimGrid/SMPI simulator, v3.10~\cite{casanova+giersch+legrand+al.2014.versatile}, +for all the NAS parallel benchmarks NPB v3.3 \cite{NAS.Parallel.Benchmarks}, running class B on 8 or 9 nodes. The comparison showed that the proposed execution time model is very precise, the maximum normalized difference between the predicted execution time and the real execution time is equal to 0.03 for all the NAS benchmarks. @@ -683,11 +684,11 @@ vector of frequency scaling factors that gives the results of the next sections. \label{sec.expe} To evaluate the efficiency and the overall energy consumption reduction of algorithm~(\ref{HSA}), it was applied to the NAS parallel benchmarks NPB v3.3. The experiments were executed -on the simulator SimGrid/SMPI v3.10~\cite{casanova+giersch+legrand+al.2014.versatile} which offers -easy tools to create a heterogeneous platform and run message passing applications over it. The -heterogeneous platform that was used in the experiments, had one core per node because just one -process was executed per node. The heterogeneous platform was composed of four types of nodes. -Each type of nodes had different characteristics such as the maximum CPU frequency, the number of +on the simulator SimGrid/SMPI which offers easy tools to create a heterogeneous platform and run +message passing applications over it. The heterogeneous platform that was used in the experiments, +had one core per node because just one process was executed per node. +The heterogeneous platform was composed of four types of nodes. Each type of nodes had different +characteristics such as the maximum CPU frequency, the number of available frequencies and the computational power, see table (\ref{table:platform}). The characteristics of these different types of nodes are inspired from the specifications of real Intel processors. The heterogeneous platform had up to 144 nodes and had nodes from the four types in equal proportions,