X-Git-Url: https://bilbo.iut-bm.univ-fcomte.fr/and/gitweb/book_gpu.git/blobdiff_plain/1874c46934f4ba7e8c2013d3829f65309456d292..17bff40b83bcdcc39769f9e59c70ffae1c525b72:/BookGPU/Chapters/chapter15/ch15.tex?ds=inline diff --git a/BookGPU/Chapters/chapter15/ch15.tex b/BookGPU/Chapters/chapter15/ch15.tex index 7e25220..64eb771 100644 --- a/BookGPU/Chapters/chapter15/ch15.tex +++ b/BookGPU/Chapters/chapter15/ch15.tex @@ -670,7 +670,7 @@ Fig.~\ref{offdiagonal} for an off-diagonal sector. These copies, along with possible scalings or transpositions, are implemented as CUDA kernels which can be applied to two matrices of any size starting at any offset. - Memory accesses are coalesced\index{coalesced memory accesses} \cite{CUDA_ProgGuide} in order to + Memory accesses are coalesced\index{GPU!coalesced memory accesses} \cite{CUDA_ProgGuide} in order to provide the best performance for such memory-bound kernels. \item[Step 2] (``Local copies''):~data are copied from local $R$-matrices to temporary arrays ($U$, $V$) and to $\Re^{O}$.