new

[book_gpu.git] / BookGPU / Chapters / chapter15 / ch15.tex
diff --git a/BookGPU/Chapters/chapter15/ch15.tex b/BookGPU/Chapters/chapter15/ch15.tex

index 7e25220e38354c0472ab23827a8bf24f8ecbb005..64eb771e3e60d957a2b33a8ec777d13ee5ce958e 100644 (file)
--- a/BookGPU/Chapters/chapter15/ch15.tex
+++ b/BookGPU/Chapters/chapter15/ch15.tex
@@ -670,7 +670,7 @@ Fig.~\ref{offdiagonal} for an off-diagonal sector.
    These copies, along with possible scalings or transpositions, are
    implemented as CUDA kernels which can be applied to two
    matrices of any size starting at any offset. 
-  Memory accesses are coalesced\index{coalesced memory accesses} \cite{CUDA_ProgGuide} in order to
+  Memory accesses are coalesced\index{GPU!coalesced memory accesses} \cite{CUDA_ProgGuide} in order to
    provide the best performance for such memory-bound kernels.
  \item[Step 2] (``Local copies''):~data are copied from
    local $R$-matrices to temporary arrays ($U$, $V$) and to $\Re^{O}$.