--------------------------------------------------------------------------- -- Size of the global domain: 50653 = 37 x 37 x 37 -- Number of processes in the global domain: 1 -- Split the global domain in: X axis=1, Y axis=1, Z axis=1 -- Rank of the present process in the global domain: 0 -- Neighbors in the global domain: GAU=-2, DRO=-2, BAS=-2, HAU=-2, AVN=-2, ARR=-2 -------------------------------------------------------------------------- --------------------------------------------------------------------------- -- Process displaying messages: 0 -- Maximum number of restarts of GMRES: 16 -- Maximum number of iterations in classical gmres: 5000000 -- Maximum number of inner-iterations in gmres two-stage: 5 -- Maximum number of outer-iterations in gmres two-stage: 5000000 -- Tolerance threshold in classical gmres: 1.00e-06 -- Tolerance threshold of the inner-iterations in gmres two-stage: 1.00e-10 -- Tolerance threshold of the outer-iterations in gmres two-stage: 1.00e-06 -- Values on the matrix diagonal: 6.000000 -- Values on the matrix off-diagonal: -1.000000 -- Size of the matrix S: 50653 x 10 -- Maximum number of iterations for CGLS: 20 -- Tolerance threshold for CGLS: 1.00e-25 -- SYNCHRONOUS GMRES TWO STAGE -------------------------------------------------------------------------- *********************************************** ********* CLASSICAL GMRES ON CPU ********** *********************************************** -- CPU normb_begin: 3.75103689368593862241e+01 -- CPU beta_begin: 1.79830552959599913265e+02 ** rank 0 iter 1 - r is 1.597e+01 ** rank 0 iter 17 - r is 5.350e+00 ** rank 0 iter 33 - r is 1.839e+00 ** rank 0 iter 49 - r is 6.567e-01 ** rank 0 iter 65 - r is 2.355e-01 ** rank 0 iter 81 - r is 8.551e-02 ** rank 0 iter 97 - r is 3.111e-02 ** rank 0 iter 113 - r is 1.144e-02 ** rank 0 iter 129 - r is 4.218e-03 ** rank 0 iter 145 - r is 1.574e-03 ** rank 0 iter 161 - r is 5.889e-04 ** rank 0 iter 177 - r is 2.233e-04 ** rank 0 iter 193 - r is 8.481e-05 ** rank 0 iter 209 - r is 8.481e-05 -- CPU processing time: 2011.869907 (ms) -- CPU iter_end: 224 -- CPU normb_end: 3.75103689368593862241e+01 -- CPU beta_end: 3.57864724215958064949e-05 -- CPU MAX of residu: 5.914423e-07 ----------------------------------------------------------------------------------------------------- -- Size of the global domain: 50653 = 37 x 37 x 37 -- Number of processes in the global domain: 1 -- Split the global domain: X axis=1, Y axis=1, Z axis=1 -- Rank of the present process in the global domain: 0 -- Rank of the present process in the local cluster: 0 -- Size of the subdomain: 50653 = 37 x 37 x 37 -- Neighbors in the global domain: GAU=-2, DRO=-2, BAS=-2, HAU=-2, AVN=-2, ARR=-2 -- Neighbors in the local cluster: GAU=-2, DRO=-2, BAS=-2, HAU=-2, AVN=-2, ARR=-2 -- Neighbors in the extern clusters: GAU=-2, DRO=-2, BAS=-2, HAU=-2, AVN=-2, ARR=-2 ----------------------------------------------------------------------------------------------------- *********************************************** *** SYNCHRONOUS GMRES TWO STAGES ON CPU *** *********************************************** -- CPU normb_begin: 3.75103689368593862241e+01 -- CPU beta_begin: 1.79830552959599913265e+02 ** rank 0 iter 1 - r is 4.542e+01 ** rank 0 iter 11 - r is 6.182e-01 ** rank 0 iter 21 - r is 7.913e-03 ** rank 0 iter 31 - r is 1.134e-03 ** rank 0 iter 41 - r is 1.572e-05 ** rank 0 iter 51 - r is 4.193e-06 ** rank 0 iter 61 - -- CPU processing time: 3036.126137 (ms) -- CPU External iter: 61 -- CPU Total iter: 280 -- CPU normb_end: 3.75103689368593862241e+01 -- CPU beta_end: 5.53073906747368889757e-08 -- CPU MAX of residu: 6.861261e-10 -- Compare gmres & gmres2stages: 6.652595e-05 Speedup: 1.509107 ******************************************************* *** CLASSIC SYNCHRONOUS GMRES TWO STAGES ON CPU *** ******************************************************* -- CPU normb_begin: 3.75103689368593862241e+01 -- CPU beta_begin: 1.79830552959599913265e+02 ** rank 0 iter 1 - r is 1.134e+01 ** rank 0 iter 2 - r is 4.105e+00 ** rank 0 iter 3 - r is 2.230e+00 ** rank 0 iter 4 - r is 2.742e+00 ** rank 0 iter 5 - r is 2.493e+00 ** rank 0 iter 6 - r is 2.837e+00 ** rank 0 iter 7 - r is 2.495e+00 ** rank 0 iter 8 - r is 2.768e+00 ** rank 0 iter 9 - r is 2.352e+00 ** rank 0 iter 10 - r is 2.589e+00 ** rank 0 iter 11 - r is 2.152e+00 ** rank 0 iter 12 - r is 2.373e+00 ** rank 0 iter 13 - r is 1.944e+00 ** rank 0 iter 14 - r is 2.155e+00 ** rank 0 iter 15 - r is 1.750e+00 ** rank 0 iter 16 - r is 1.949e+00 ** rank 0 iter 17 - r is 1.572e+00 ** rank 0 iter 18 - r is 1.758e+00 ** rank 0 iter 19 - r is 1.410e+00 ** rank 0 iter 20 - r is 1.584e+00 ** rank 0 iter 21 - r is 1.269e+00 ** rank 0 iter 22 - r is 1.426e+00 ** rank 0 iter 23 - r is 1.141e+00 ** rank 0 iter 24 - r is 1.283e+00 ** rank 0 iter 25 - r is 1.026e+00 ** rank 0 iter 26 - r is 1.154e+00 ** rank 0 iter 27 - r is 9.225e-01 ** rank 0 iter 28 - r is 1.038e+00 ** rank 0 iter 29 - r is 8.291e-01 ** rank 0 iter 30 - r is 9.331e-01 ** rank 0 iter 31 - r is 7.450e-01 ** rank 0 iter 32 - r is 8.387e-01 ** rank 0 iter 33 - r is 6.693e-01 ** rank 0 iter 34 - r is 7.537e-01 ** rank 0 iter 35 - r is 6.012e-01 ** rank 0 iter 36 - r is 6.772e-01 ** rank 0 iter 37 - r is 5.400e-01 ** rank 0 iter 38 - r is 6.084e-01 ** rank 0 iter 39 - r is 4.850e-01 ** rank 0 iter 40 - r is 5.465e-01 ** rank 0 iter 41 - r is 4.355e-01 ** rank 0 iter 42 - r is 4.908e-01 ** rank 0 iter 43 - r is 3.911e-01 ** rank 0 iter 44 - r is 4.408e-01 ** rank 0 iter 45 - r is 3.512e-01 ** rank 0 iter 46 - r is 3.958e-01 ** rank 0 iter 47 - r is 3.153e-01 ** rank 0 iter 48 - r is 3.554e-01 ** rank 0 iter 49 - r is 2.831e-01 ** rank 0 iter 50 - r is 3.191e-01 ** rank 0 iter 51 - r is 2.541e-01 ** rank 0 iter 52 - r is 2.865e-01 ** rank 0 iter 53 - r is 2.282e-01 ** rank 0 iter 54 - r is 2.572e-01 ** rank 0 iter 55 - r is 2.048e-01 ** rank 0 iter 56 - r is 2.309e-01 ** rank 0 iter 57 - r is 1.839e-01 ** rank 0 iter 58 - r is 2.072e-01 ** rank 0 iter 59 - r is 1.650e-01 ** rank 0 iter 60 - r is 1.860e-01 ** rank 0 iter 61 - r is 1.482e-01 ** rank 0 iter 62 - r is 1.670e-01 ** rank 0 iter 63 - r is 1.330e-01 ** rank 0 iter 64 - r is 1.499e-01 ** rank 0 iter 65 - r is 1.194e-01 ** rank 0 iter 66 - r is 1.345e-01 ** rank 0 iter 67 - r is 1.071e-01 ** rank 0 iter 68 - r is 1.207e-01 ** rank 0 iter 69 - r is 9.616e-02 ** rank 0 iter 70 - r is 1.083e-01 ** rank 0 iter 71 - r is 8.630e-02 ** rank 0 iter 72 - r is 9.720e-02 ** rank 0 iter 73 - r is 7.746e-02 ** rank 0 iter 74 - r is 8.723e-02 ** rank 0 iter 75 - r is 6.952e-02 ** rank 0 iter 76 - r is 7.827e-02 ** rank 0 iter 77 - r is 6.239e-02 ** rank 0 iter 78 - r is 7.024e-02 ** rank 0 iter 79 - r is 5.599e-02 ** rank 0 iter 80 - r is 6.302e-02 ** rank 0 iter 81 - r is 5.025e-02 ** rank 0 iter 82 - r is 5.655e-02 ** rank 0 iter 83 - r is 4.509e-02 ** rank 0 iter 84 - r is 5.074e-02 ** rank 0 iter 85 - r is 4.047e-02 ** rank 0 iter 86 - r is 4.552e-02 ** rank 0 iter 87 - r is 3.631e-02 ** rank 0 iter 88 - r is 4.085e-02 ** rank 0 iter 89 - r is 3.259e-02 ** rank 0 iter 90 - r is 3.665e-02 ** rank 0 iter 91 - r is 2.924e-02 ** rank 0 iter 92 - r is 3.288e-02 ** rank 0 iter 93 - r is 2.624e-02 ** rank 0 iter 94 - r is 2.950e-02 ** rank 0 iter 95 - r is 2.354e-02 ** rank 0 iter 96 - r is 2.646e-02 ** rank 0 iter 97 - r is 2.113e-02 ** rank 0 iter 98 - r is 2.374e-02 ** rank 0 iter 99 - r is 1.896e-02 ** rank 0 iter 100 - r is 2.130e-02 ** rank 0 iter 101 - r is 1.701e-02 ** rank 0 iter 102 - r is 1.911e-02 ** rank 0 iter 103 - r is 1.526e-02 ** rank 0 iter 104 - r is 1.714e-02 ** rank 0 iter 105 - r is 1.369e-02 ** rank 0 iter 106 - r is 1.538e-02 ** rank 0 iter 107 - r is 1.229e-02 ** rank 0 iter 108 - r is 1.380e-02 ** rank 0 iter 109 - r is 1.102e-02 ** rank 0 iter 110 - r is 1.238e-02 ** rank 0 iter 111 - r is 9.891e-03 ** rank 0 iter 112 - r is 1.110e-02 ** rank 0 iter 113 - r is 8.874e-03 ** rank 0 iter 114 - r is 9.960e-03 ** rank 0 iter 115 - r is 7.962e-03 ** rank 0 iter 116 - r is 8.934e-03 ** rank 0 iter 117 - r is 7.143e-03 ** rank 0 iter 118 - r is 8.014e-03 ** rank 0 iter 119 - r is 6.409e-03 ** rank 0 iter 120 - r is 7.189e-03 ** rank 0 iter 121 - r is 5.750e-03 ** rank 0 iter 122 - r is 6.449e-03 ** rank 0 iter 123 - r is 5.159e-03 ** rank 0 iter 124 - r is 5.785e-03 ** rank 0 iter 125 - r is 4.628e-03 ** rank 0 iter 126 - r is 5.189e-03 ** rank 0 iter 127 - r is 4.152e-03 ** rank 0 iter 128 - r is 4.655e-03 ** rank 0 iter 129 - r is 3.725e-03 ** rank 0 iter 130 - r is 4.175e-03 ** rank 0 iter 131 - r is 3.342e-03 ** rank 0 iter 132 - r is 3.745e-03 ** rank 0 iter 133 - r is 2.998e-03 ** rank 0 iter 134 - r is 3.360e-03 ** rank 0 iter 135 - r is 2.690e-03 ** rank 0 iter 136 - r is 3.014e-03 ** rank 0 iter 137 - r is 2.413e-03 ** rank 0 iter 138 - r is 2.703e-03 ** rank 0 iter 139 - r is 2.165e-03 ** rank 0 iter 140 - r is 2.425e-03 ** rank 0 iter 141 - r is 1.942e-03 ** rank 0 iter 142 - r is 2.175e-03 ** rank 0 iter 143 - r is 1.742e-03 ** rank 0 iter 144 - r is 1.951e-03 ** rank 0 iter 145 - r is 1.563e-03 ** rank 0 iter 146 - r is 1.750e-03 ** rank 0 iter 147 - r is 1.402e-03 ** rank 0 iter 148 - r is 1.570e-03 ** rank 0 iter 149 - r is 1.258e-03 ** rank 0 iter 150 - r is 1.408e-03 ** rank 0 iter 151 - r is 1.128e-03 ** rank 0 iter 152 - r is 1.263e-03 ** rank 0 iter 153 - r is 1.012e-03 ** rank 0 iter 154 - r is 1.133e-03 ** rank 0 iter 155 - r is 9.079e-04 ** rank 0 iter 156 - r is 1.016e-03 ** rank 0 iter 157 - r is 8.144e-04 ** rank 0 iter 158 - r is 9.113e-04 ** rank 0 iter 159 - r is 7.306e-04 ** rank 0 iter 160 - r is 8.174e-04 ** rank 0 iter 161 - r is 6.554e-04 ** rank 0 iter 162 - r is 7.332e-04 ** rank 0 iter 163 - r is 5.879e-04 ** rank 0 iter 164 - r is 6.576e-04 ** rank 0 iter 165 - r is 5.274e-04 ** rank 0 iter 166 - r is 5.898e-04 ** rank 0 iter 167 - r is 4.731e-04 ** rank 0 iter 168 - r is 5.291e-04 ** rank 0 iter 169 - r is 4.244e-04 ** rank 0 iter 170 - r is 4.745e-04 ** rank 0 iter 171 - r is 3.807e-04 ** rank 0 iter 172 - r is 4.256e-04 ** rank 0 iter 173 - r is 3.415e-04 ** rank 0 iter 174 - r is 3.818e-04 ** rank 0 iter 175 - r is 3.063e-04 ** rank 0 iter 176 - r is 3.424e-04 ** rank 0 iter 177 - r is 2.748e-04 ** rank 0 iter 178 - r is 3.071e-04 ** rank 0 iter 179 - r is 2.465e-04 ** rank 0 iter 180 - r is 2.755e-04 ** rank 0 iter 181 - r is 2.211e-04 ** rank 0 iter 182 - r is 2.471e-04 ** rank 0 iter 183 - r is 1.983e-04 ** rank 0 iter 184 - r is 2.216e-04 ** rank 0 iter 185 - r is 1.779e-04 ** rank 0 iter 186 - r is 1.988e-04 ** rank 0 iter 187 - r is 1.596e-04 ** rank 0 iter 188 - r is 1.783e-04 ** rank 0 iter 189 - r is 1.431e-04 ** rank 0 iter 190 - r is 1.599e-04 ** rank 0 iter 191 - r is 1.284e-04 ** rank 0 iter 192 - r is 1.434e-04 ** rank 0 iter 193 - r is 1.152e-04 ** rank 0 iter 194 - r is 1.287e-04 ** rank 0 iter 195 - r is 1.033e-04 ** rank 0 iter 196 - r is 1.154e-04 ** rank 0 iter 197 - r is 9.267e-05 ** rank 0 iter 198 - r is 1.035e-04 ** rank 0 iter 199 - r is 8.313e-05 ** rank 0 iter 200 - r is 9.284e-05 ** rank 0 iter 201 - r is 7.456e-05 ** rank 0 iter 202 - r is 8.327e-05 ** rank 0 iter 203 - r is 6.688e-05 ** rank 0 iter 204 - r is 7.469e-05 ** rank 0 iter 205 - r is 5.999e-05 ** rank 0 iter 206 - r is 6.699e-05 ** rank 0 iter 207 - r is 5.381e-05 ** rank 0 iter 208 - r is 6.009e-05 ** rank 0 iter 209 - r is 4.827e-05 ** rank 0 iter 210 - r is 5.389e-05 ** rank 0 iter 211 - r is 4.330e-05 ** rank 0 iter 212 - r is 4.834e-05 ** rank 0 iter 213 - r is 3.884e-05 ** rank 0 iter 214 - r is 4.336e-05 ** rank 0 iter 215 - r is 3.484e-05 ** rank 0 iter 216 - r is 3.889e-05 ** rank 0 iter 217 - r is 3.125e-05 ** rank 0 iter 218 - r is 3.488e-05 ** rank 0 iter 219 - r is 2.803e-05 ** rank 0 iter 220 - r is 3.129e-05 ** rank 0 iter 221 - r is 2.514e-05 ** rank 0 iter 222 - r is 2.806e-05 ** rank 0 iter 223 - r is 2.255e-05 ** rank 0 iter 224 - r is 2.517e-05 ** rank 0 iter 225 - r is 2.023e-05 ** rank 0 iter 226 - r is 2.257e-05 ** rank 0 iter 227 - r is 1.815e-05 ** rank 0 iter 228 - r is 2.025e-05 ** rank 0 iter 229 - r is 1.629e-05 ** rank 0 iter 230 - r is 1.816e-05 ** rank 0 iter 231 - r is 1.462e-05 ** rank 0 iter 232 - r is 1.629e-05 ** rank 0 iter 233 - r is 1.312e-05 ** rank 0 iter 234 - r is 1.461e-05 ** rank 0 iter 235 - r is 1.178e-05 ** rank 0 iter 236 - r is 1.310e-05 ** rank 0 iter 237 - r is 1.058e-05 ** rank 0 iter 238 - r is 1.175e-05 ** rank 0 iter 239 - r is 9.495e-06 ** rank 0 iter 240 - r is 1.054e-05 ** rank 0 iter 241 - r is 8.525e-06 ** rank 0 iter 242 - r is 9.456e-06 ** rank 0 iter 243 - r is 7.653e-06 ** rank 0 iter 244 - r is 8.481e-06 ** rank 0 iter 245 - r is 6.871e-06 ** rank 0 iter 246 - r is 7.607e-06 ** rank 0 iter 247 - r is 6.169e-06 ** rank 0 iter 248 - r is 6.823e-06 ** rank 0 iter 249 - r is 5.539e-06 ** rank 0 iter 250 - r is 6.120e-06 ** rank 0 iter 251 - r is 4.973e-06 ** rank 0 iter 252 - r is 5.489e-06 ** rank 0 iter 253 - r is 4.464e-06 ** rank 0 iter 254 - r is 4.923e-06 ** rank 0 iter 255 - r is 4.008e-06 ** rank 0 iter 256 - r is 4.416e-06 ** rank 0 iter 257 - r is 3.599e-06 ** rank 0 iter 258 - r is 3.961e-06 ** rank 0 iter 259 - r is 3.231e-06 ** rank 0 iter 260 - r is 3.553e-06 ** rank 0 iter 261 - r is 2.901e-06 ** rank 0 iter 262 - r is 3.186e-06 ** rank 0 iter 263 - r is 2.604e-06 ** rank 0 iter 264 - r is 2.858e-06 ** rank 0 iter 265 - r is 2.338e-06 ** rank 0 iter 266 - r is 2.563e-06 ** rank 0 iter 267 - r is 2.099e-06 ** rank 0 iter 268 - r is 2.299e-06 ** rank 0 iter 269 - r is 1.885e-06 ** rank 0 iter 270 - r is 2.062e-06 ** rank 0 iter 271 - r is 1.692e-06 ** rank 0 iter 272 - r is 1.850e-06 ** rank 0 iter 273 - r is 1.519e-06 ** rank 0 iter 274 - r is 1.659e-06 ** rank 0 iter 275 - r is 1.364e-06 ** rank 0 iter 276 - r is 1.488e-06 ** rank 0 iter 277 - r is 1.225e-06 ** rank 0 iter 278 - r is 1.335e-06 ** rank 0 iter 279 - r is 1.099e-06 ** rank 0 iter 280 - r is 1.197e-06 ** rank 0 iter 281 - -- CPU processing time: 7664.911032 (ms) -- CPU External iter: 281 -- CPU Total iter: 1124 -- CPU normb_end: 3.75103689368593862241e+01 -- CPU beta_end: 5.45896525036652100274e-06 -- CPU MAX of residu: 7.672086e-08 -- Compare gmres & classic gmres2stage: 5.051729e-05 Speedup: 3.809844 -- gmres2stage with minimization is 2.524569 times faster than classic gmres2stage