instruction in this kernel is used to computed the \texttt{tid} which
representes the thread index. This thread index is computed according to the
values of the block index (it is a variable of CUDA
-called \texttt{blockIdx\index{CUDA~keywords!blockIdx}}). Blocks of threads can
+called \texttt{blockIdx}\index{CUDA~keywords!blockIdx}). Blocks of threads can
be decomposed into 1 dimension, 2 dimensions or 3 dimensions. According to the
dimension of data manipulated, the appropriate dimension can be useful. In our
example, only one dimension is used. Then using notation \texttt{.x} we can