1 \documentclass{article}
2 \usepackage[utf8]{inputenc}
3 \usepackage[T1]{fontenc}
10 \usepackage{algorithm2e}
12 \usepackage[standard]{ntheorem}
14 % Pour mathds : les ensembles IR, IN, etc.
17 % Pour avoir des intervalles d'entiers
21 % Pour faire des sous-figures dans les figures
22 \usepackage{subfigure}
26 \newtheorem{notation}{Notation}
28 \newcommand{\X}{\mathcal{X}}
29 \newcommand{\Go}{G_{f_0}}
30 \newcommand{\B}{\mathds{B}}
31 \newcommand{\N}{\mathds{N}}
32 \newcommand{\BN}{\mathds{B}^\mathsf{N}}
35 \newcommand{\alert}[1]{\begin{color}{blue}\textit{#1}\end{color}}
37 \title{Efficient generation of pseudo random numbers based on chaotic iterations on GPU}
40 \author{Jacques M. Bahi, Rapha\"{e}l Couturier, and Christophe Guyeux\thanks{Authors in alphabetic order}}
48 \section{Introduction}
50 Interet des itérations chaotiques pour générer des nombre alea\\
51 Interet de générer des nombres alea sur GPU
52 \alert{RC, un petit state-of-the-art sur les PRNGs sur GPU ?}
56 \section{Basic Recalls}
57 \label{section:BASIC RECALLS}
58 This section is devoted to basic definitions and terminologies in the fields of topological chaos and chaotic iterations.
59 \subsection{Devaney's chaotic dynamical systems}
61 In the sequel $S^{n}$ denotes the $n^{th}$ term of a sequence $S$ and $V_{i}$ denotes the $i^{th}$ component of a vector $V$. $f^{k}=f\circ ...\circ f$ denotes the $k^{th}$ composition of a function $f$. Finally, the following notation is used: $\llbracket1;N\rrbracket=\{1,2,\hdots,N\}$.
64 Consider a topological space $(\mathcal{X},\tau)$ and a continuous function $f : \mathcal{X} \rightarrow \mathcal{X}$.
67 $f$ is said to be \emph{topologically transitive} if, for any pair of open sets $U,V \subset \mathcal{X}$, there exists $k>0$ such that $f^k(U) \cap V \neq \varnothing$.
71 An element $x$ is a \emph{periodic point} for $f$ of period $n\in \mathds{N}^*$ if $f^{n}(x)=x$.% The set of periodic points of $f$ is denoted $Per(f).$
75 $f$ is said to be \emph{regular} on $(\mathcal{X}, \tau)$ if the set of periodic points for $f$ is dense in $\mathcal{X}$: for any point $x$ in $\mathcal{X}$, any neighborhood of $x$ contains at least one periodic point (without necessarily the same period).
80 $f$ is said to be \emph{chaotic} on $(\mathcal{X},\tau)$ if $f$ is regular and topologically transitive.
83 The chaos property is strongly linked to the notion of ``sensitivity'', defined on a metric space $(\mathcal{X},d)$ by:
86 \label{sensitivity} $f$ has \emph{sensitive dependence on initial conditions}
87 if there exists $\delta >0$ such that, for any $x\in \mathcal{X}$ and any neighborhood $V$ of $x$, there exist $y\in V$ and $n > 0$ such that $d\left(f^{n}(x), f^{n}(y)\right) >\delta $.
89 $\delta$ is called the \emph{constant of sensitivity} of $f$.
92 Indeed, Banks \emph{et al.} have proven in~\cite{Banks92} that when $f$ is chaotic and $(\mathcal{X}, d)$ is a metric space, then $f$ has the property of sensitive dependence on initial conditions (this property was formerly an element of the definition of chaos). To sum up, quoting Devaney in~\cite{Devaney}, a chaotic dynamical system ``is unpredictable because of the sensitive dependence on initial conditions. It cannot be broken down or simplified into two subsystems which do not interact because of topological transitivity. And in the midst of this random behavior, we nevertheless have an element of regularity''. Fundamentally different behaviors are consequently possible and occur in an unpredictable way.
96 \subsection{Chaotic iterations}
97 \label{sec:chaotic iterations}
100 Let us consider a \emph{system} with a finite number $\mathsf{N} \in
101 \mathds{N}^*$ of elements (or \emph{cells}), so that each cell has a
102 Boolean \emph{state}. Having $\mathsf{N}$ Boolean values for these
103 cells leads to the definition of a particular \emph{state of the
104 system}. A sequence which elements belong to $\llbracket 1;\mathsf{N}
105 \rrbracket $ is called a \emph{strategy}. The set of all strategies is
106 denoted by $\mathbb{S}.$
109 \label{Def:chaotic iterations}
110 The set $\mathds{B}$ denoting $\{0,1\}$, let
111 $f:\mathds{B}^{\mathsf{N}}\longrightarrow \mathds{B}^{\mathsf{N}}$ be
112 a function and $S\in \mathbb{S}$ be a strategy. The so-called
113 \emph{chaotic iterations} are defined by $x^0\in
114 \mathds{B}^{\mathsf{N}}$ and
116 \forall n\in \mathds{N}^{\ast }, \forall i\in
117 \llbracket1;\mathsf{N}\rrbracket ,x_i^n=\left\{
119 x_i^{n-1} & \text{ if }S^n\neq i \\
120 \left(f(x^{n-1})\right)_{S^n} & \text{ if }S^n=i.
125 In other words, at the $n^{th}$ iteration, only the $S^{n}-$th cell is
126 \textquotedblleft iterated\textquotedblright . Note that in a more
127 general formulation, $S^n$ can be a subset of components and
128 $\left(f(x^{n-1})\right)_{S^{n}}$ can be replaced by
129 $\left(f(x^{k})\right)_{S^{n}}$, where $k<n$, describing for example,
130 delays transmission~\cite{Robert1986,guyeux10}. Finally, let us remark that
131 the term ``chaotic'', in the name of these iterations, has \emph{a
132 priori} no link with the mathematical theory of chaos, recalled above.
135 Let us now recall how to define a suitable metric space where chaotic iterations are continuous. For further explanations, see, e.g., \cite{guyeux10}.
137 Let $\delta $ be the \emph{discrete Boolean metric}, $\delta (x,y)=0\Leftrightarrow x=y.$ Given a function $f$, define the function:
140 F_{f}: & \llbracket1;\mathsf{N}\rrbracket\times \mathds{B}^{\mathsf{N}} &
141 \longrightarrow & \mathds{B}^{\mathsf{N}} \\
142 & (k,E) & \longmapsto & \left( E_{j}.\delta (k,j)+f(E)_{k}.\overline{\delta
143 (k,j)}\right) _{j\in \llbracket1;\mathsf{N}\rrbracket},%
146 \noindent where + and . are the Boolean addition and product operations.
147 Consider the phase space:
149 \mathcal{X} = \llbracket 1 ; \mathsf{N} \rrbracket^\mathds{N} \times
150 \mathds{B}^\mathsf{N},
152 \noindent and the map defined on $\mathcal{X}$:
154 G_f\left(S,E\right) = \left(\sigma(S), F_f(i(S),E)\right), \label{Gf}
156 \noindent where $\sigma$ is the \emph{shift} function defined by $\sigma (S^{n})_{n\in \mathds{N}}\in \mathbb{S}\longrightarrow (S^{n+1})_{n\in \mathds{N}}\in \mathbb{S}$ and $i$ is the \emph{initial function} $i:(S^{n})_{n\in \mathds{N}} \in \mathbb{S}\longrightarrow S^{0}\in \llbracket 1;\mathsf{N}\rrbracket$. Then the chaotic iterations defined in (\ref{sec:chaotic iterations}) can be described by the following iterations:
160 X^0 \in \mathcal{X} \\
166 With this formulation, a shift function appears as a component of chaotic iterations. The shift function is a famous example of a chaotic map~\cite{Devaney} but its presence is not sufficient enough to claim $G_f$ as chaotic.
168 To study this claim, a new distance between two points $X = (S,E), Y = (\check{S},\check{E})\in
169 \mathcal{X}$ has been introduced in \cite{guyeux10} as follows:
171 d(X,Y)=d_{e}(E,\check{E})+d_{s}(S,\check{S}),
177 \displaystyle{d_{e}(E,\check{E})} & = & \displaystyle{\sum_{k=1}^{\mathsf{N}%
178 }\delta (E_{k},\check{E}_{k})}, \\
179 \displaystyle{d_{s}(S,\check{S})} & = & \displaystyle{\dfrac{9}{\mathsf{N}}%
180 \sum_{k=1}^{\infty }\dfrac{|S^k-\check{S}^k|}{10^{k}}}.%
186 This new distance has been introduced to satisfy the following requirements.
188 \item When the number of different cells between two systems is increasing, then their distance should increase too.
189 \item In addition, if two systems present the same cells and their respective strategies start with the same terms, then the distance between these two points must be small because the evolution of the two systems will be the same for a while. Indeed, the two dynamical systems start with the same initial condition, use the same update function, and as strategies are the same for a while, then components that are updated are the same too.
191 The distance presented above follows these recommendations. Indeed, if the floor value $\lfloor d(X,Y)\rfloor $ is equal to $n$, then the systems $E, \check{E}$ differ in $n$ cells. In addition, $d(X,Y) - \lfloor d(X,Y) \rfloor $ is a measure of the differences between strategies $S$ and $\check{S}$. More precisely, this floating part is less than $10^{-k}$ if and only if the first $k$ terms of the two strategies are equal. Moreover, if the $k^{th}$ digit is nonzero, then the $k^{th}$ terms of the two strategies are different.
193 Finally, it has been established in \cite{guyeux10} that,
196 Let $f$ be a map from $\mathds{B}^n$ to itself. Then $G_{f}$ is continuous in the metric space $(\mathcal{X},d)$.
199 The chaotic property of $G_f$ has been firstly established for the vectorial Boolean negation \cite{guyeux10}. To obtain a characterization, we have secondly introduced the notion of asynchronous iteration graph recalled bellow.
201 Let $f$ be a map from $\mathds{B}^n$ to itself. The
202 {\emph{asynchronous iteration graph}} associated with $f$ is the
203 directed graph $\Gamma(f)$ defined by: the set of vertices is
204 $\mathds{B}^n$; for all $x\in\mathds{B}^n$ and $i\in \llbracket1;n\rrbracket$,
205 the graph $\Gamma(f)$ contains an arc from $x$ to $F_f(i,x)$.
206 The relation between $\Gamma(f)$ and $G_f$ is clear: there exists a
207 path from $x$ to $x'$ in $\Gamma(f)$ if and only if there exists a
208 strategy $s$ such that the parallel iteration of $G_f$ from the
209 initial point $(s,x)$ reaches the point $x'$.
211 We have finally proven in \cite{FCT11} that,
215 \label{Th:Caractérisation des IC chaotiques}
216 Let $f:\mathds{B}^n\to\mathds{B}^n$. $G_f$ is chaotic (according to Devaney)
217 if and only if $\Gamma(f)$ is strongly connected.
220 This result of chaos has lead us to study the possibility to build a pseudo-random number generator (PRNG) based on the chaotic iterations.
221 As $G_f$, defined on the domain $\llbracket 1 ; n \rrbracket^{\mathds{N}} \times \mathds{B}^n$, is build from Boolean networks $f : \mathds{B}^n \rightarrow \mathds{B}^n$, we can preserve the theoretical properties on $G_f$ during implementations (due to the discrete nature of $f$). It is as if $\mathds{B}^n$ represents the memory of the computer whereas $\llbracket 1 ; n \rrbracket^{\mathds{N}}$ is its input stream (the seeds, for instance).
223 \section{Application to Pseudo-Randomness}
225 We have proposed in~\cite{bgw09:ip} a new family of generators that receives
226 two PRNGs as inputs. These two generators are mixed with chaotic iterations,
227 leading thus to a new PRNG that improves the statistical properties of each
228 generator taken alone. Furthermore, our generator
229 possesses various chaos properties
230 that none of the generators used as input present.
232 \begin{algorithm}[h!]
234 \KwIn{a function $f$, an iteration number $b$, an initial configuration $x^0$ ($n$ bits)}
235 \KwOut{a configuration $x$ ($n$ bits)}
237 $k\leftarrow b + \textit{XORshift}(b+1)$\;
238 \For{$i=0,\dots,k-1$}
240 $s\leftarrow{\textit{XORshift}(n)}$\;
241 $x\leftarrow{F_f(s,x)}$\;
245 \caption{PRNG with chaotic functions}
249 \begin{algorithm}[h!]
250 %\SetAlgoLined %%RAPH: cette ligne provoque une erreur chez moi
251 \KwIn{the internal configuration $z$ (a 32-bit word)}
252 \KwOut{$y$ (a 32-bit word)}
253 $z\leftarrow{z\oplus{(z\ll13)}}$\;
254 $z\leftarrow{z\oplus{(z\gg17)}}$\;
255 $z\leftarrow{z\oplus{(z\ll5)}}$\;
259 \caption{An arbitrary round of \textit{XORshift} algorithm}
267 This generator is synthesized in Algorithm~\ref{CI Algorithm}.
268 It takes as input: a function $f$;
269 an integer $b$, ensuring that the number of executed iterations is at least $b$ and at most $2b+1$; and an initial configuration $x^0$.
270 It returns the new generated configuration $x$. Internally, it embeds two
271 \textit{XORshift}$(k)$ PRNGs \cite{Marsaglia2003} that returns integers uniformly distributed
272 into $\llbracket 1 ; k \rrbracket$.
273 \textit{XORshift} is a category of very fast PRNGs designed by George Marsaglia, which repeatedly uses the transform of exclusive or (XOR, $\oplus$) on a number with a bit shifted version of it. This PRNG, which has a period of $2^{32}-1=4.29\times10^9$, is summed up in Algorithm~\ref{XORshift}. It is used in our PRNG to compute the strategy length and the strategy elements.
276 We have proven in \cite{FCT11} that,
279 Let $f: \mathds{B}^{n} \rightarrow \mathds{B}^{n}$, $\Gamma(f)$ its
280 iteration graph, $\check{M}$ its adjacency
281 matrix and $M$ a $n\times n$ matrix defined as in the previous lemma.
282 If $\Gamma(f)$ is strongly connected, then
283 the output of the PRNG detailed in Algorithm~\ref{CI Algorithm} follows
284 a law that tends to the uniform distribution
285 if and only if $M$ is a double stochastic matrix.
290 \alert{Mettre encore un peu de blabla sur le PRNG, puis enchaîner en disant que, ok, on peut préserver le chaos quand on passe sur machine, mais que le chaos dont il s'agit a été prouvé pour une distance bizarroïde sur un espace non moins hémoroïde, d'où ce qui suit}
294 \section{The relativity of disorder}
295 \label{sec:de la relativité du désordre}
297 \subsection{Impact of the topology's finenesse}
299 Let us firstly introduce the following notations.
302 $\mathcal{X}_\tau$ will denote the topological space $\left(\mathcal{X},\tau\right)$, whereas $\mathcal{V}_\tau (x)$ will be the set of all the neighborhoods of $x$ when considering the topology $\tau$ (or simply $\mathcal{V} (x)$, if there is no ambiguity).
308 \label{Th:chaos et finesse}
309 Let $\mathcal{X}$ a set and $\tau, \tau'$ two topologies on $\mathcal{X}$ s.t. $\tau'$ is finer than $\tau$. Let $f:\mathcal{X} \to \mathcal{X}$, continuous both for $\tau$ and $\tau'$.
311 If $(\mathcal{X}_{\tau'},f)$ is chaotic according to Devaney, then $(\mathcal{X}_\tau,f)$ is chaotic too.
315 Let us firstly establish the transitivity of $(\mathcal{X}_\tau,f)$.
317 Let $\omega_1, \omega_2$ two open sets of $\tau$. Then $\omega_1, \omega_2 \in \tau'$, becaus $\tau'$ is finer than $\tau$. As $f$ is $\tau'-$transitive, we can deduce that $\exists n \in \mathds{N}, \omega_1 \cap f^{(n)}(\omega_2) = \varnothing$. Consequently, $f$ is $\tau-$transitive.
319 Let us now consider the regularity of $(\mathcal{X}_\tau,f)$, \emph{i.e.}, for all $x \in \mathcal{X}$, and for all $\tau-$neighborhood $V$ of $x$, there is a periodic point for $f$ into $V$.
321 Let $x \in \mathcal{X}$ and $V \in \mathcal{V}_\tau (x)$ a $\tau-$neighborhood of $x$. By definition, $\exists \omega \in \tau, x \in \omega \subset V$.
323 But $\tau \subset \tau'$, so $\omega \in \tau'$, and then $V \in \mathcal{V}_{\tau'} (x)$. As $(\mathcal{X}_{\tau'},f)$ is regular, there is a periodic point for $f$ into $V$, and the regularity of $(\mathcal{X}_\tau,f)$ is proven.
326 \subsection{A given system can always be claimed as chaotic}
328 Let $f$ an iteration function on $\mathcal{X}$ having at least a fixed point. Then this function is chaotic (in a certain way):
331 Let $\mathcal{X}$ a nonempty set and $f: \mathcal{X} \to \X$ a function having at least a fixed point.
332 Then $f$ is $\tau_0-$chaotic, where $\tau_0$ is the trivial (indiscrete) topology on $\X$.
337 $f$ is transitive when $\forall \omega, \omega' \in \tau_0 \setminus \{\varnothing\}, \exists n \in \mathds{N}, f^{(n)}(\omega) \cap \omega' \neq \varnothing$.
338 As $\tau_0 = \left\{ \varnothing, \X \right\}$, this is equivalent to look for an integer $n$ s.t. $f^{(n)}\left( \X \right) \cap \X \neq \varnothing$. For instance, $n=0$ is appropriate.
340 Let us now consider $x \in \X$ and $V \in \mathcal{V}_{\tau_0} (x)$. Then $V = \mathcal{X}$, so $V$ has at least a fixed point for $f$. Consequently $f$ is regular, and the result is established.
346 \subsection{A given system can always be claimed as non-chaotic}
349 Let $\mathcal{X}$ be a set and $f: \mathcal{X} \to \X$.
350 If $\X$ is infinite, then $\left( \X_{\tau_\infty}, f\right)$ is not chaotic (for the Devaney's formulation), where $\tau_\infty$ is the discrete topology.
354 Let us prove it by contradiction, assuming that $\left(\X_{\tau_\infty}, f\right)$ is both transitive and regular.
356 Let $x \in \X$ and $\{x\}$ one of its neighborhood. This neighborhood must contain a periodic point for $f$, if we want that $\left(\X_{\tau_\infty}, f\right)$ is regular. Then $x$ must be a periodic point of $f$.
358 Let $I_x = \left\{ f^{(n)}(x), n \in \mathds{N}\right\}$. This set is finite because $x$ is periodic, and $\mathcal{X}$ is infinite, then $\exists y \in \mathcal{X}, y \notin I_x$.
360 As $\left(\X_{\tau_\infty}, f\right)$ must be transitive, for all open nonempty sets $A$ and $B$, an integer $n$ must satisfy $f^{(n)}(A) \cap B \neq \varnothing$. However $\{x\}$ and $\{y\}$ are open sets and $y \notin I_x \Rightarrow \forall n, f^{(n)}\left( \{x\} \right) \cap \{y\} = \varnothing$.
368 \section{Chaos on the order topology}
370 \subsection{The phase space is an interval of the real line}
372 \subsubsection{Toward a topological semiconjugacy}
374 In what follows, our intention is to establish, by using a topological semiconjugacy, that chaotic iterations over $\mathcal{X}$ can be described as iterations on a real interval. To do so, we must firstly introduce some notations and terminologies.
376 Let $\mathcal{S}_\mathsf{N}$ be the set of sequences belonging into $\llbracket 1; \mathsf{N}\rrbracket$ and $\mathcal{X}_{\mathsf{N}} = \mathcal{S}_\mathsf{N} \times \B^\mathsf{N}$.
380 The function $\varphi: \mathcal{S}_{10} \times\mathds{B}^{10} \rightarrow \big[ 0, 2^{10} \big[$ is defined by:
383 \varphi: & \mathcal{X}_{10} = \mathcal{S}_{10} \times\mathds{B}^{10}& \longrightarrow & \big[ 0, 2^{10} \big[ \\
384 & (S,E) = \left((S^0, S^1, \hdots ); (E_0, \hdots, E_9)\right) & \longmapsto & \varphi \left((S,E)\right)
387 \noindent where $\varphi\left((S,E)\right)$ is the real number:
389 \item whose integral part $e$ is $\displaystyle{\sum_{k=0}^9 2^{9-k} E_k}$, that is, the binary digits of $e$ are $E_0 ~ E_1 ~ \hdots ~ E_9$.
390 \item whose decimal part $s$ is equal to $s = 0,S^0~ S^1~ S^2~ \hdots = \sum_{k=1}^{+\infty} 10^{-k} S^{k-1}.$
396 $\varphi$ realizes the association between a point of $\mathcal{X}_{10}$ and a real number into $\big[ 0, 2^{10} \big[$. We must now translate the chaotic iterations $\Go$ on this real interval. To do so, two intermediate functions over $\big[ 0, 2^{10} \big[$ must be introduced:
401 Let $x \in \big[ 0, 2^{10} \big[$ and:
403 \item $e_0, \hdots, e_9$ the binary digits of the integral part of $x$: $\displaystyle{\lfloor x \rfloor = \sum_{k=0}^{9} 2^{9-k} e_k}$.
404 \item $(s^k)_{k\in \mathds{N}}$ the digits of $x$, where the chosen decimal decomposition of $x$ is the one that does not have an infinite number of 9:
405 $\displaystyle{x = \lfloor x \rfloor + \sum_{k=0}^{+\infty} s^k 10^{-k-1}}$.
407 $e$ and $s$ are thus defined as follows:
410 e: & \big[ 0, 2^{10} \big[ & \longrightarrow & \mathds{B}^{10} \\
411 & x & \longmapsto & (e_0, \hdots, e_9)
417 s: & \big[ 0, 2^{10} \big[ & \longrightarrow & \llbracket 0, 9 \rrbracket^{\mathds{N}} \\
418 & x & \longmapsto & (s^k)_{k \in \mathds{N}}
423 We are now able to define the function $g$, whose goal is to translate the chaotic iterations $\Go$ on an interval of $\mathds{R}$.
426 $g:\big[ 0, 2^{10} \big[ \longrightarrow \big[ 0, 2^{10} \big[$ is defined by:
429 g: & \big[ 0, 2^{10} \big[ & \longrightarrow & \big[ 0, 2^{10} \big[ \\
431 & x & \longmapsto & g(x)
434 \noindent where g(x) is the real number of $\big[ 0, 2^{10} \big[$ defined bellow:
436 \item its integral part has a binary decomposition equal to $e_0', \hdots, e_9'$, with:
440 e(x)_i & \textrm{ if } i \neq s^0\\
441 e(x)_i + 1 \textrm{ (mod 2)} & \textrm{ if } i = s^0\\
445 \item whose decimal part is $s(x)^1, s(x)^2, \hdots$
452 In other words, if $x = \displaystyle{\sum_{k=0}^{9} 2^{9-k} e_k + \sum_{k=0}^{+\infty} s^{k} ~10^{-k-1}}$, then: $$g(x) = \displaystyle{\sum_{k=0}^{9} 2^{9-k} (e_k + \delta(k,s^0) \textrm{ (mod 2)}) + \sum_{k=0}^{+\infty} s^{k+1} 10^{-k-1}}.$$
454 \subsubsection{Defining a metric on $\big[ 0, 2^{10} \big[$}
456 Numerous metrics can be defined on the set $\big[ 0, 2^{10} \big[$, the most usual one being the Euclidian distance recalled bellow:
459 \index{distance!euclidienne}
460 $\Delta$ is the Euclidian distance on $\big[ 0, 2^{10} \big[$, that is, $\Delta(x,y) = |y-x|^2$.
465 This Euclidian distance does not reproduce exactly the notion of proximity induced by our first distance $d$ on $\X$. Indeed $d$ is finer than $\Delta$. This is the reason why we have to introduce the following metric:
470 Let $x,y \in \big[ 0, 2^{10} \big[$.
471 $D$ denotes the function from $\big[ 0, 2^{10} \big[^2$ to $\mathds{R}^+$ defined by: $D(x,y) = D_e\left(e(x),e(y)\right) + D_s\left(s(x),s(y)\right)$, where:
473 $\displaystyle{D_e(E,\check{E}) = \sum_{k=0}^\mathsf{9} \delta (E_k, \check{E}_k)}$, ~~and~ $\displaystyle{D_s(S,\check{S}) = \sum_{k = 1}^\infty \dfrac{|S^k-\check{S}^k|}{10^k}}$.
478 $D$ is a distance on $\big[ 0, 2^{10} \big[$.
482 The three axioms defining a distance must be checked.
484 \item $D \geqslant 0$, because everything is positive in its definition. If $D(x,y)=0$, then $D_e(x,y)=0$, so the integral parts of $x$ and $y$ are equal (they have the same binary decomposition). Additionally, $D_s(x,y) = 0$, then $\forall k \in \mathds{N}^*, s(x)^k = s(y)^k$. In other words, $x$ and $y$ have the same $k-$th decimal digit, $\forall k \in \mathds{N}^*$. And so $x=y$.
485 \item $D(x,y)=D(y,x)$.
486 \item Finally, the triangular inequality is obtained due to the fact that both $\delta$ and $\Delta(x,y)=|x-y|$ satisfy it.
491 The convergence of sequences according to $D$ is not the same than the usual convergence related to the Euclidian metric. For instance, if $x^n \to x$ according to $D$, then necessarily the integral part of each $x^n$ is equal to the integral part of $x$ (at least after a given threshold), and the decimal part of $x^n$ corresponds to the one of $x$ ``as far as required''.
492 To illustrate this fact, a comparison between $D$ and the Euclidian distance is given Figure \ref{fig:comparaison de distances}. These illustrations show that $D$ is richer and more refined than the Euclidian distance, and thus is more precise.
497 \subfigure[Function $x \to dist(x;1,234) $ on the interval $(0;5)$.]{\includegraphics[scale=.35]{DvsEuclidien.pdf}}\quad
498 \subfigure[Function $x \to dist(x;3) $ on the interval $(0;5)$.]{\includegraphics[scale=.35]{DvsEuclidien2.pdf}}
500 \caption{Comparison between $D$ (in blue) and the Euclidian distane (in green).}
501 \label{fig:comparaison de distances}
507 \subsubsection{The semiconjugacy}
509 It is now possible to define a topological semiconjugacy between $\mathcal{X}$ and an interval of $\mathds{R}$:
512 Chaotic iterations on the phase space $\mathcal{X}$ are simple iterations on $\mathds{R}$, which is illustrated by the semiconjugacy of the diagram bellow:
515 \left(~\mathcal{S}_{10} \times\mathds{B}^{10}, d~\right) @>G_{f_0}>> \left(~\mathcal{S}_{10} \times\mathds{B}^{10}, d~\right)\\
516 @V{\varphi}VV @VV{\varphi}V\\
517 \left( ~\big[ 0, 2^{10} \big[, D~\right) @>>g> \left(~\big[ 0, 2^{10} \big[, D~\right)
523 $\varphi$ has been constructed in order to be continuous and onto.
526 In other words, $\mathcal{X}$ is approximately equal to $\big[ 0, 2^\mathsf{N} \big[$.
533 \subsection{Study of the chaotic iterations described as a real function}
538 \subfigure[ICs on the interval $(0,9;1)$.]{\includegraphics[scale=.35]{ICs09a1.pdf}}\quad
539 \subfigure[ICs on the interval $(0,7;1)$.]{\includegraphics[scale=.35]{ICs07a95.pdf}}\\
540 \subfigure[ICs on the interval $(0,5;1)$.]{\includegraphics[scale=.35]{ICs05a1.pdf}}\quad
541 \subfigure[ICs on the interval $(0;1)$]{\includegraphics[scale=.35]{ICs0a1.pdf}}
543 \caption{Representation of the chaotic iterations.}
552 \subfigure[ICs on the interval $(510;514)$.]{\includegraphics[scale=.35]{ICs510a514.pdf}}\quad
553 \subfigure[ICs on the interval $(1000;1008)$]{\includegraphics[scale=.35]{ICs1000a1008.pdf}}
555 \caption{ICs on small intervals.}
561 \subfigure[ICs on the interval $(0;16)$.]{\includegraphics[scale=.3]{ICs0a16.pdf}}\quad
562 \subfigure[ICs on the interval $(40;70)$.]{\includegraphics[scale=.45]{ICs40a70.pdf}}\quad
564 \caption{General aspect of the chaotic iterations.}
569 We have written a Python program to represent the chaotic iterations with the vectorial negation on the real line $\mathds{R}$. Various representations of these CIs are given in Figures \ref{fig:ICs}, \ref{fig:ICs2} and \ref{fig:ICs3}. It can be remarked that the function $g$ is a piecewise linear function: it is linear on each interval having the form $\left[ \dfrac{n}{10}, \dfrac{n+1}{10}\right[$, $n \in \llbracket 0;2^{10}\times 10 \rrbracket$ and its slope is equal to 10. Let us justify these claims:
572 \label{Prop:derivabilite des ICs}
573 Chaotic iterations $g$ defined on $\mathds{R}$ have derivatives of all orders on $\big[ 0, 2^{10} \big[$, except on the 10241 points in $I$ defined by $\left\{ \dfrac{n}{10} ~\big/~ n \in \llbracket 0;2^{10}\times 10\rrbracket \right\}$.
575 Furthermore, on each interval of the form $\left[ \dfrac{n}{10}, \dfrac{n+1}{10}\right[$, with $n \in \llbracket 0;2^{10}\times 10 \rrbracket$, $g$ is a linear function, having a slope equal to 10: $\forall x \notin I, g'(x)=10$.
580 Let $I_n = \left[ \dfrac{n}{10}, \dfrac{n+1}{10}\right[$, with $n \in \llbracket 0;2^{10}\times 10 \rrbracket$. All the points of $I_n$ have the same integral prat $e$ and the same decimal part $s^0$: on the set $I_n$, functions $e(x)$ and $x \mapsto s(x)^0$ of Definition \ref{def:e et s} only depend on $n$. So all the images $g(x)$ of these points $x$:
582 \item Have the same integral part, which is $e$, except probably the bit number $s^0$. In other words, this integer has approximately the same binary decomposition than $e$, the sole exception being the digit $s^0$ (this number is then either $e+2^{10-s^0}$ or $e-2^{10-s^0}$, depending on the parity of $s^0$, \emph{i.e.}, it is equal to $e+(-1)^{s^0}\times 2^{10-s^0}$).
583 \item A shift to the left has been applied to the decimal part $y$, losing by doing so the common first digit $s^0$. In other words, $y$ has been mapped into $10\times y - s^0$.
585 To sum up, the action of $g$ on the points of $I$ is as follows: first, make a multiplication by 10, and second, add the same constant to each term, which is $\dfrac{1}{10}\left(e+(-1)^{s^0}\times 2^{10-s^0}\right)-s^0$.
589 Finally, chaotic iterations are elements of the large family of functions that are both chaotic and piecewise linear (like the tent map).
594 \subsection{Comparison of the two metrics on $\big[ 0, 2^\mathsf{N} \big[$}
596 The two propositions bellow allow to compare our two distances on $\big[ 0, 2^\mathsf{N} \big[$:
599 Id: $\left(~\big[ 0, 2^\mathsf{N} \big[,\Delta~\right) \to \left(~\big[ 0, 2^\mathsf{N} \big[, D~\right)$ is not continuous.
603 The sequence $x^n = 1,999\hdots 999$ constituted by $n$ 9 as decimal part, is such that:
605 \item $\Delta (x^n,2) \to 0.$
606 \item But $D(x^n,2) \geqslant 1$, then $D(x^n,2)$ does not converge to 0.
609 The sequential characterization of the continuity concludes the demonstration.
617 Id: $\left(~\big[ 0, 2^\mathsf{N} \big[,D~\right) \to \left(~\big[ 0, 2^\mathsf{N} \big[, \Delta ~\right)$ is a continuous fonction.
621 If $D(x^n,x) \to 0$, then $D_e(x^n,x) = 0$ at least for $n$ larger than a given threshold, because $D_e$ only returns integers. So, after this threshold, the integral parts of all the $x^n$ are equal to the integral part of $x$.
623 Additionally, $D_s(x^n, x) \to 0$, then $\forall k \in \mathds{N}^*, \exists N_k \in \mathds{N}, n \geqslant N_k \Rightarrow D_s(x^n,x) \leqslant 10^{-k}$. This means that for all $k$, an index $N_k$ can be found such that, $\forall n \geqslant N_k$, all the $x^n$ have the same $k$ firsts digits, which are the digits of $x$. We can deduce the convergence $\Delta(x^n,x) \to 0$, and thus the result.
626 The conclusion of these propositions is that the proposed metric is more precise than the Euclidian distance, that is:
629 $D$ is finer than the Euclidian distance $\Delta$.
632 This corollary can be reformulated as follows:
635 \item The topology produced by $\Delta$ is a subset of the topology produced by $D$.
636 \item $D$ has more open sets than $\Delta$.
637 \item It is harder to converge for the topology $\tau_D$ inherited by $D$, than to converge with the one inherited by $\Delta$, which is denoted here by $\tau_\Delta$.
641 \subsection{Chaos of the chaotic iterations on $\mathds{R}$}
642 \label{chpt:Chaos des itérations chaotiques sur R}
646 \subsubsection{Chaos according to Devaney}
648 We have recalled previously that the chaotic iterations $\left(\Go, \mathcal{X}_d\right)$ are chaotic according to the formulation of Devaney. We can deduce that they are chaotic on $\mathds{R}$ too, when considering the order topology, because:
650 \item $\left(\Go, \mathcal{X}_d\right)$ and $\left(g, \big[ 0, 2^{10} \big[_D\right)$ are semiconjugate by $\varphi$,
651 \item Then $\left(g, \big[ 0, 2^{10} \big[_D\right)$ is a system chaotic according to Devaney, because the semiconjugacy preserve this character.
652 \item But the topology generated by $D$ is finer than the topology generated by the Euclidian distance $\Delta$ -- which is the order topology.
653 \item According to Theorem \ref{Th:chaos et finesse}, we can deduce that the chaotic iterations $g$ are indeed chaotic, as defined by Devaney, for the order topology on $\mathds{R}$.
656 This result can be formulated as follows.
659 \label{th:IC et topologie de l'ordre}
660 The chaotic iterations $g$ on $\mathds{R}$ are chaotic according to the Devaney's formulation, when $\mathds{R}$ has his usual topology, which is the order topology.
663 Indeed this result is weaker than the theorem establishing the chaos for the finer topology $d$. However the Theorem \ref{th:IC et topologie de l'ordre} still remains important. Indeed, we have studied in our previous works a set different from the usual set of study ($\mathcal{X}$ instead of $\mathds{R}$), in order to be as close as possible from the computer: the properties of disorder proved theoretically will then be preserved when computing. However, we could wonder whether this change does not lead to a disorder of a lower quality. In other words, have we replaced a situation of a good disorder lost when computing, to another situation of a disorder preserved but of bad quality. Theorem \ref{th:IC et topologie de l'ordre} prove exactly the contrary.
668 \section{Efficient prng based on chaotic iterations}
670 In order to implement efficiently a PRNG based on chaotic iterations it is
671 possible to improve previous works [ref]. One solution consists in considering
672 that the strategy used contains all the bits for which the negation is
673 achieved out. Then in order to apply the negation on these bits we can simply
674 apply the xor operator between the current number and the strategy. In
675 order to obtain the strategy we also use a classical PRNG.
677 Here is an example with 16-bits numbers showing how the bitwise operations are
678 applied. Suppose that $x$ and the strategy $S^i$ are defined in binary mode.
679 Then the following table shows the result of $x$ xor $S^i$.
681 \begin{array}{|cc|cccccccccccccccc|}
683 x &=&1&0&1&1&1&0&1&0&1&0&0&1&0&0&1&0\\
685 S^i &=&0&1&1&0&0&1&1&0&1&1&1&0&0&1&1&1\\
687 x \oplus S^i&=&1&1&0&1&1&1&0&0&0&1&1&1&0&1&0&1\\
694 %% \begin{figure}[htbp]
697 %% \begin{minipage}{14cm}
698 %% unsigned int CIprng() \{\\
699 %% static unsigned int x = 123123123;\\
700 %% unsigned long t1 = xorshift();\\
701 %% unsigned long t2 = xor128();\\
702 %% unsigned long t3 = xorwow();\\
703 %% x = x\textasciicircum (unsigned int)t1;\\
704 %% x = x\textasciicircum (unsigned int)(t2$>>$32);\\
705 %% x = x\textasciicircum (unsigned int)(t3$>>$32);\\
706 %% x = x\textasciicircum (unsigned int)t2;\\
707 %% x = x\textasciicircum (unsigned int)(t1$>>$32);\\
708 %% x = x\textasciicircum (unsigned int)t3;\\
714 %% \caption{sequential Chaotic Iteration PRNG}
715 %% \label{algo:seqCIprng}
720 \lstset{language=C,caption={C code of the sequential chaotic iterations based PRNG},label=algo:seqCIprng}
722 unsigned int CIprng() {
723 static unsigned int x = 123123123;
724 unsigned long t1 = xorshift();
725 unsigned long t2 = xor128();
726 unsigned long t3 = xorwow();
727 x = x^(unsigned int)t1;
728 x = x^(unsigned int)(t2>>32);
729 x = x^(unsigned int)(t3>>32);
730 x = x^(unsigned int)t2;
731 x = x^(unsigned int)(t1>>32);
732 x = x^(unsigned int)t3;
741 In listing~\ref{algo:seqCIprng} a sequential version of our chaotic iterations
742 based PRNG is presented. The xor operator is represented by
743 \textasciicircum. This function uses three classical 64-bits PRNG: the
744 \texttt{xorshift}, the \texttt{xor128} and the \texttt{xorwow}. In the
745 following, we call them xor-like PRNGSs. These three PRNGs are presented
746 in~\cite{Marsaglia2003}. As each xor-like PRNG used works with 64-bits and as
747 our PRNG works with 32-bits, the use of \texttt{(unsigned int)} selects the 32
748 least significant bits whereas \texttt{(unsigned int)(t3$>>$32)} selects the 32
749 most significants bits of the variable \texttt{t}. So to produce a random
750 number realizes 6 xor operations with 6 32-bits numbers produced by 3 64-bits
751 PRNG. This version successes the BigCrush of the TestU01 battery [P. L’ecuyer
752 and R. Simard. Testu01].
754 \section{Efficient prng based on chaotic iterations on GPU}
756 In order to benefit from computing power of GPU, a program needs to define
757 independent blocks of threads which can be computed simultaneously. In general,
758 the larger the number of threads is, the more local memory is used and the less
759 branching instructions are used (if, while, ...), the better performance is
760 obtained on GPU. So with algorithm \ref{algo:seqCIprng} presented in the
761 previous section, it is possible to build a similar program which computes PRNG
762 on GPU. In the CUDA [ref] environment, threads have a local identificator,
763 called \texttt{ThreadIdx} relative to the block containing them.
766 \subsection{Naive version for GPU}
768 From the CPU version, it is possible to obtain a quite similar version for GPU.
769 The principe consists in assigning the computation of a PRNG as in sequential to
770 each thread of the GPU. Of course, it is essential that the three xor-like
771 PRNGs used for our computation have different parameters. So we chose them
772 randomly with another PRNG. As the initialisation is performed by the CPU, we
773 have chosen to use the ISAAC PRNG [ref] to initalize all the parameters for the
774 GPU version of our PRNG. The implementation of the three xor-like PRNGs is
775 straightforward as soon as their parameters have been allocated in the GPU
776 memory. Each xor-like PRNGs used works with an internal number $x$ which keeps
777 the last generated random numbers. Other internal variables are also used by the
778 xor-like PRNGs. More precisely, the implementation of the xor128, the xorshift
779 and the xorwow respectively require 4, 5 and 6 unsigned long as internal
784 \KwIn{InternalVarXorLikeArray: array with internal variables of the 3 xor-like PRNGs in global memory\;
785 NumThreads: Number of threads\;}
786 \KwOut{NewNb: array containing random numbers in global memory}
787 \If{threadIdx is concerned by the computation} {
788 retrieve data from InternalVarXorLikeArray[threadIdx] in local variables\;
790 compute a new PRNG as in Listing\ref{algo:seqCIprng}\;
791 store the new PRNG in NewNb[NumThreads*threadIdx+i]\;
793 store internal variables in InternalVarXorLikeArray[threadIdx]\;
796 \caption{main kernel for the chaotic iterations based PRNG GPU naive version}
797 \label{algo:gpu_kernel}
800 Algorithm~\ref{algo:gpu_kernel} presents a naive implementation of PRNG using
801 GPU. According to the available memory in the GPU and the number of threads
802 used simultenaously, the number of random numbers that a thread can generate
803 inside a kernel is limited, i.e. the variable \texttt{n} in
804 algorithm~\ref{algo:gpu_kernel}. For example, if $100,000$ threads are used and
805 if $n=100$\footnote{in fact, we need to add the initial seed (a 32-bits number)}
806 then the memory required to store internals variables of xor-like
807 PRNGs\footnote{we multiply this number by $2$ in order to count 32-bits numbers}
808 and random number of our PRNG is equals to $100,000\times ((4+5+6)\times
809 2+(1+100))=1,310,000$ 32-bits numbers, i.e. about $52$Mb.
811 All the tests performed to pass the BigCrush of TestU01 succeeded. Different
812 number of threads, called \texttt{NumThreads} in our algorithm, have been tested
816 Algorithm~\ref{algo:gpu_kernel} has the advantage to manipulate independent
817 PRNGs, so this version is easily usable on a cluster of computer. The only thing
818 to ensure is to use a single ISAAC PRNG. For this, a simple solution consists in
819 using a master node for the initialization which computes the initial parameters
820 for all the differents nodes involves in the computation.
823 \subsection{Improved version for GPU}
825 As GPU cards using CUDA have shared memory between threads of the same block, it
826 is possible to use this feature in order to simplify the previous algorithm,
827 i.e. using less than 3 xor-like PRNGs. The solution consists in computing only
828 one xor-like PRNG by thread, saving it into shared memory and using the results
829 of some other threads in the same block of threads. In order to define which
830 thread uses the result of which other one, we can use a permutation array which
831 contains the indexes of all threads and for which a permutation has been
832 performed. In Algorithm~\ref{algo:gpu_kernel2}, 2 permutations arrays are used.
833 The variable \texttt{offset} is computed using the value of
834 \texttt{permutation\_size}. Then we can compute \texttt{o1} and \texttt{o2}
835 which represent the indexes of the other threads for which the results are used
836 by the current thread. In the algorithm, we consider that a 64-bits xor-like
837 PRNG is used, that is why both 32-bits parts are used.
839 This version also succeed to the BigCrush batteries of tests.
843 \KwIn{InternalVarXorLikeArray: array with internal variables of 1 xor-like PRNGs in global memory\;
844 NumThreads: Number of threads\;
845 tab1, tab2: Arrays containing permutations of size permutation\_size\;}
847 \KwOut{NewNb: array containing random numbers in global memory}
848 \If{threadId is concerned} {
849 retrieve data from InternalVarXorLikeArray[threadId] in local variables\;
850 offset = threadIdx\%permutation\_size\;
851 o1 = threadIdx-offset+tab1[offset]\;
852 o2 = threadIdx-offset+tab2[offset]\;
855 shared\_mem[threadId]=(unsigned int)t\;
856 x = x $\oplus$ (unsigned int) t\;
857 x = x $\oplus$ (unsigned int) (t>>32)\;
858 x = x $\oplus$ shared[o1]\;
859 x = x $\oplus$ shared[o2]\;
861 store the new PRNG in NewNb[NumThreads*threadId+i]\;
863 store internal variables in InternalVarXorLikeArray[threadId]\;
866 \caption{main kernel for the chaotic iterations based PRNG GPU efficient version}
867 \label{algo:gpu_kernel2}
872 \section{Experiments}
874 Differents experiments have been performed in order to measure the generation speed.
877 \includegraphics[scale=.5]{curve_time_gpu.pdf}
880 \caption{Number of random numbers generated per second}
881 \label{fig:time_naive_gpu}
884 First of all we have compared the time to generate X random numbers with both the CPU version and the GPU version.
886 Faire une courbe du nombre de random en fonction du nombre de threads, éventuellement en fonction du nombres de threads par bloc.
890 \bibliographystyle{plain}
891 \bibliography{mabase}