new

[book_gpu.git] / BookGPU / Chapters / chapter15 / ch15.tex
diff --git a/BookGPU/Chapters/chapter15/ch15.tex b/BookGPU/Chapters/chapter15/ch15.tex

index 7e25220e38354c0472ab23827a8bf24f8ecbb005..64eb771e3e60d957a2b33a8ec777d13ee5ce958e 100644 (file)
--- a/BookGPU/Chapters/chapter15/ch15.tex
+++ b/BookGPU/Chapters/chapter15/ch15.tex
@@ -670,7 +670,7 @@ Fig.~\ref{offdiagonal} for an off-diagonal sector.
    These copies, along with possible scalings or transpositions, are
    implemented as CUDA kernels which can be applied to two
    matrices of any size starting at any offset. 
    These copies, along with possible scalings or transpositions, are
    implemented as CUDA kernels which can be applied to two
    matrices of any size starting at any offset. 
-  Memory accesses are coalesced\index{coalesced memory accesses} \cite{CUDA_ProgGuide} in order to
+  Memory accesses are coalesced\index{GPU!coalesced memory accesses} \cite{CUDA_ProgGuide} in order to
    provide the best performance for such memory-bound kernels.
  \item[Step 2] (``Local copies''):~data are copied from
    local $R$-matrices to temporary arrays ($U$, $V$) and to $\Re^{O}$.
    provide the best performance for such memory-bound kernels.
  \item[Step 2] (``Local copies''):~data are copied from
    local $R$-matrices to temporary arrays ($U$, $V$) and to $\Re^{O}$.