Indeed, the benchmarks with the same class, C, are executed on different numbers
of nodes, so the computation required for each iteration is divided by the
number of computing nodes. On the other hand, more communications are required
Indeed, the benchmarks with the same class, C, are executed on different numbers
of nodes, so the computation required for each iteration is divided by the
number of computing nodes. On the other hand, more communications are required