ourapproach.tex

   1 This section first presents the embedding scheme through its
   2 four main steps: the data encryption (Sect.~\ref{sub:bbs}),
   3 the cover pixel selection (Sect.~\ref{sub:edge}),
   4 the adaptive payload considerations (Sect.~\ref{sub:adaptive}),
   5 and how the distortion has been minimized (Sect.~\ref{sub:stc}).
   6 The message extraction is then presented  (Sect.~\ref{sub:extract}) while a running example ends this section (Sect.~\ref{sub:xpl}).
   7
   8
   9 The flowcharts given in Fig.~\ref{fig:sch}
  10 summarize our steganography scheme denoted by
  11 STABYLO, which stands for STe\-ga\-no\-gra\-phy with
  12 Adaptive, Bbs, binarY embedding at LOw cost.
  13 What follows are successively some details of the inner steps and the flows both inside
  14  the embedding stage (Fig.~\ref{fig:sch:emb})
  15 and inside the extraction one (Fig.~\ref{fig:sch:ext}).
  16 Let us first focus on the data embedding.
  17
  18 \begin{figure*}%[t]
  19   \begin{center}
  20     \subfloat[Data Embedding]{
  21       \begin{minipage}{0.4\textwidth}
  22         \begin{center}
  23             %\includegraphics[scale=0.45]{emb}
  24             \includegraphics[scale=0.4]{emb}
  25         \end{center}
  26       \end{minipage}
  27       \label{fig:sch:emb}
  28     }
  29 \hfill
  30     \subfloat[Data Extraction]{
  31       \begin{minipage}{0.49\textwidth}
  32         \begin{center}
  33             \includegraphics[scale=0.4]{dec}
  34         \end{center}
  35       \end{minipage}
  36       \label{fig:sch:ext}
  37     }%\hfill
  38   \end{center}
  39   \caption{The STABYLO scheme}
  40   \label{fig:sch}
  41 \end{figure*}
  42
  43
  44
  45
  46
  47
  48
  49
  50 \subsection{Security considerations}\label{sub:bbs}
  51 Among the methods of  message encryption/decryption
  52 (see~\cite{DBLP:journals/ejisec/FontaineG07} for a survey)
  53 we implement the asymmetric
  54 Blum-Goldwasser cryptosystem~\cite{Blum:1985:EPP:19478.19501}
  55 that is based on the Blum Blum Shub~\cite{DBLP:conf/crypto/ShubBB82}
  56 pseudorandom number generator (PRNG) and the
  57 XOR binary function.
  58 The main justification of this choice
  59 is that it has been proven~\cite{DBLP:conf/crypto/ShubBB82} that this PRNG
  60 has the property of cryptographical security, \textit{i.e.},
  61 for any sequence of $L$ output bits $x_i$, $x_{i+1}$, \ldots, $x_{i+L-1}$,
  62 there is no algorithm, whose time complexity is polynomial  in $L$, and
  63 which allows to find $x_{i-1}$ or $x_{i+L}$ with a probability greater
  64 than $1/2$.
  65 Equivalent formulations of such a property can
  66 be found. They all lead to the fact that,
  67 even if the encrypted message is extracted,
  68 it is impossible to retrieve the original one in
  69 polynomial time.
  70
  71 Starting thus with a key $k$ and the message \textit{mess} to hide,
  72 this step computes a message $m$, which is the encrypted version  of \textit{mess}.
  73
  74
  75 \subsection{Edge-based image steganography}\label{sub:edge}
  76
  77
  78 The edge-based image
  79 steganography schemes
  80 already presented \cite{Luo:2010:EAI:1824719.1824720,DBLP:journals/eswa/ChenCL10} differ
  81 in how carefully they select edge pixels, and
  82 how they modify them.
  83
  84 %Image Quality: Edge Image Steganography
  85 %\JFC{Raphael, les fuzzy edge detection sont souvent utilisés.
  86 %  il faudrait comparer les approches en terme de nombre de bits retournés,
  87 %  en terme de complexité. } \cite{KF11}
  88 %\RC{Ben, à voir car on peut choisir le nombre de pixel avec Canny. Supposons que les fuzzy edge soient retourne un peu plus de points, on sera probablement plus détectable...  Finalement on devrait surement vendre notre truc en : on a choisi cet algo car il est performant en vitesse/qualité. Mais on peut aussi en utilisé d'autres :-)}
  89
  90 Many techniques have been proposed in the literature to  detect
  91 edges in  images (whose noise has been initially reduced).
  92 They can be separated in two categories: first and second order detection
  93 methods on the one hand, and fuzzy detectors on the other  hand~\cite{KF11}.
  94 In first order methods like Sobel, Canny~\cite{Canny:1986:CAE:11274.11275}, and so on,
  95 a first-order derivative (gradient magnitude, etc.) is computed
  96 to search for local maxima, whereas in second order ones, zero crossings in a second-order derivative, like the Laplacian computed from the image,
  97 are searched in order to find edges.
  98 As far as fuzzy edge methods are concerned, they are obviously based on fuzzy logic to highlight edges.
  99
 100 Canny filters, on their parts, are an old family of algorithms still remaining a state of the art edge detector. They can be well-approximated by first-order derivatives of Gaussians.
 101 As the Canny algorithm is fast, well known, has been studied in depth, and is implementable
 102 on many  kinds of architectures like FPGAs, smart phones,  desktop machines, and
 103 GPUs, we have chosen this edge detector for illustrative purpose.
 104
 105
 106
 107
 108 This edge detection is applied on a filtered version of the image given
 109 as input.
 110 More precisely, only $b$ most significant bits are concerned by this step,
 111 where the parameter $b$ is practically set with $6$ or $7$.
 112 Notice that only the 2 LSBs of pixels in the set of edges
 113 are returned if $b$ is 6, and the LSB of pixels if $b$ is 7.
 114 If set with the same value $b$, the edge detection returns thus the same
 115 set of pixels for both the cover and the stego image.
 116 Moreover, to provide edge gradient value,
 117 the Canny algorithm computes derivatives
 118 in the two directions with respect to a mask of size $T$.
 119 The higher $T$ is, the coarse the approach is. Practically,
 120 $T$ is set with $3$, $5$, or $7$.
 121 In our flowcharts, this step is represented by ``Edge Detection(b, T, X)''.
 122
 123
 124 Let $x$ be the sequence of these bits.
 125 The next  section presents how to adapt our scheme
 126 with respect to the size
 127 of the message $m$ to embed and the size of the cover $x$.
 128
 129
 130
 131
 132
 133
 134
 135
 136 \subsection{Adaptive embedding rate}\label{sub:adaptive}
 137 Two strategies have been developed in our approach,
 138 depending on the embedding rate that is either \emph{Adaptive} or \emph{Fixed}.
 139 In the former the embedding rate depends on the number of edge pixels.
 140 The higher it is, the larger the message length that can be inserted is.
 141 Practically, a set of edge pixels is computed according to the
 142 Canny algorithm with parameters $b=7$ and $T=3$.
 143 The message length is thus defined to be lesser than
 144 half of this set cardinality.
 145 If $x$ is too short for $m$, the message is split into sufficient parts
 146 and a new cover image should be used for the remaining part of the message.
 147
 148 In the latter, the embedding rate is defined as a percentage between the
 149 number of modified pixels and the length of the bit message.
 150 This is the classical approach adopted in steganography.
 151 Practically, the Canny algorithm generates
 152 a set of edge pixels related to increasing values of $T$ and
 153 until its cardinality
 154 is sufficient. Even in this situation, our scheme adapts
 155 its algorithm to meet all the user's requirements.
 156
 157
 158 Once the map of possibly modified pixels is computed,
 159 two methods may further be applied to extract bits that
 160 are really changed.
 161 The first one randomly chooses the subset of pixels to modify by
 162 applying the BBS PRNG again. This method is further denoted  as a \emph{sample}.
 163 Once this set is selected, a classical LSB replacement is applied to embed the
 164 stego content.
 165 The second method considers the last significant bits of all the pixels
 166 inside the previous map. It next directly applies the STC
 167 algorithm~\cite{DBLP:journals/tifs/FillerJF11}.
 168 It  is further referred to as \emph{STC} and is detailed in the next section.
 169
 170
 171
 172
 173
 174
 175
 176
 177 \subsection{Minimizing distortion with Syndrome-Trellis Codes}\label{sub:stc}
 178 \input{stc}
 179
 180
 181
 182 % Edge Based Image Steganography schemes
 183 % already studied~\cite{Luo:2010:EAI:1824719.1824720,DBLP:journals/eswa/ChenCL10,DBLP:conf/ih/PevnyFB10} differ
 184 % how they select edge pixels, and
 185 % how they modify these ones.
 186
 187 % First of all, let us discuss about compexity of edge detetction methods.
 188 % Let then $M$ and $N$ be the dimension of the original image.
 189 % According to~\cite{Hu:2007:HPE:1282866.1282944},
 190 % even if the fuzzy logic based edge detection methods~\cite{Tyan1993}
 191 % have promising results, its complexity is in $C_3 \times O(M \times N)$
 192 % whereas the complexity on the Canny method~\cite{Canny:1986:CAE:11274.11275}
 193 % is in $C_1 \times O(M \times N)$ where  $C_1 < C_3$.
 194 % \JFC{Verifier ceci...}
 195 % In experiments detailled in this article, the Canny method has been retained
 196 % but the whole approach can be updated to consider
 197 % the fuzzy logic edge detector.
 198
 199 For a given set of parameters,
 200 the Canny algorithm returns a numerical value and
 201 states whether a given pixel is an edge or not.
 202 In this article, in the Adaptive strategy
 203 we consider that all the edge pixels that
 204 have been selected by this algorithm have the same
 205 distortion cost \textit{i.e.} $\rho_X$ is always 1 for these bits.
 206 In the Fixed strategy, since pixels that are detected to be edge
 207 with small values of $T$ (e.g. when $T=3$)
 208 are more accurate than these with higher values of $T$,
 209 we give to STC the following distortion map of the corresponding bits
 210 $$
 211 \rho_X= \left\{
 212 \begin{array}{l}
 213 1 \textrm{ if an edge for $T=3$} \\
 214 10 \textrm{ if an edge for $T=5$} \\
 215 100 \textrm{ if an edge for $T=7$}
 216 \end{array}
 217 \right.
 218 $$.
 219
 220
 221
 222
 223 \subsection{Data extraction}\label{sub:extract}
 224 The message extraction summarized in Fig.~\ref{fig:sch:ext}
 225 follows the data embedding approach
 226 since there exists a reverse function for all its steps.
 227
 228 More precisely,  let $b$ be the most significant bits and
 229 $T$ be the size of the Canny mask, both be given as a key.
 230 Thus, the same edge detection is applied on a stego content $Y$ to
 231 produce the sequence $y$ of LSBs.
 232 If the STC approach has been selected in embedding, the STC reverse
 233 algorithm is directly executed to retrieve the encrypted message.
 234 This inverse function takes the $\hat{H}$ matrix as a parameter.
 235 Otherwise, \textit{i.e.}, if the \emph{sample} strategy is retained,
 236 the same random bit selection than in the embedding step
 237 is executed with the same seed, given as a key.
 238 Finally, the Blum-Goldwasser decryption function is executed and the original
 239 message is extracted.
 240
 241
 242 \subsection{Running example}\label{sub:xpl}
 243 In this example, the cover image is  Lena,
 244 which is a $512\times512$  image with 256 grayscale levels.
 245 The message is the poem Ulalume (E. A. Poe), which is constituted by 104 lines, 667
 246 words, and 3,754 characters, \textit{i.e.},  30,032 bits.
 247 Lena and the first verses are given in Fig.~\ref{fig:lena}.
 248
 249 \begin{figure}
 250 \begin{center}
 251 \begin{minipage}{0.49\linewidth}
 252 \begin{center}
 253 \includegraphics[scale=0.20]{lena512}
 254 \end{center}
 255 \end{minipage}
 256 \begin{minipage}{0.49\linewidth}
 257 \begin{flushleft}
 258 \begin{scriptsize}
 259 The skies they were ashen and sober;\linebreak
 260 $\qquad$ The leaves they were crisped and sere—\linebreak
 261 $\qquad$ The leaves they were withering and sere;\linebreak
 262 It was night in the lonesome October\linebreak
 263 $\qquad$ Of my most immemorial year;\linebreak
 264 It was hard by the dim lake of Auber,\linebreak
 265 $\qquad$ In the misty mid region of Weir—\linebreak
 266 It was down by the dank tarn of Auber,\linebreak
 267 $\qquad$ In the ghoul-haunted woodland of Weir.
 268 \end{scriptsize}
 269 \end{flushleft}
 270 \end{minipage}
 271 \end{center}
 272 \caption{Cover and message examples} \label{fig:lena}
 273 \end{figure}
 274
 275 The edge detection returns 18,641 and 18,455 pixels when $b$ is
 276 respectively 7 and 6 and $T=3$.
 277 These edges are represented in Figure~\ref{fig:edge}.
 278 When $b$ is 7, it remains one bit per pixel to build the cover vector.
 279 This configuration leads to a cover vector of size  18,641 if b is 7
 280 and 36,910 if $b$ is 6.
 281
 282 \begin{figure}[t]
 283   \begin{center}
 284     \subfloat[$b$ is 7.]{
 285       \begin{minipage}{0.49\linewidth}
 286         \begin{center}
 287           %\includegraphics[width=5cm]{emb.pdf}
 288           \includegraphics[scale=0.20]{edge7}
 289         \end{center}
 290       \end{minipage}
 291       %\label{fig:sch:emb}
 292     }%\hfill
 293     \subfloat[$b$ is 6.]{
 294       \begin{minipage}{0.49\linewidth}
 295         \begin{center}
 296           %\includegraphics[width=5cm]{rec.pdf}
 297           \includegraphics[scale=0.20]{edge6}
 298         \end{center}
 299       \end{minipage}
 300       %\label{fig:sch:ext}
 301     }%\hfill
 302   \end{center}
 303   \caption{Edge detection wrt $b$ with $T=3$}
 304   \label{fig:edge}
 305 \end{figure}
 306
 307
 308
 309 The STC algorithm is optimized when the rate between message length and
 310 cover vector length is lower than 1/2.
 311 So, only 9,320 bits  are available for embedding
 312 in the  configuration where $b$ is 7.
 313
 314 When $b$ is 6, we could have considered 18,455 bits for the message.
 315 However, first experiments have shown that modifying this number of bits is too
 316 easily detectable.
 317 So, we choose to modify the same amount of bits (9,320) and keep STC optimizing
 318 which bits to change among  the 36,910 ones.
 319
 320 In the two cases, about the third part of the poem is hidden into the cover.
 321 Results with {Adaptive} and {STC} strategies are presented in
 322 Fig.~\ref{fig:lenastego}.
 323
 324 \begin{figure}[t]
 325   \begin{center}
 326     \subfloat[$b$ is 7.]{
 327       \begin{minipage}{0.49\linewidth}
 328         \begin{center}
 329           %\includegraphics[width=5cm]{emb.pdf}
 330           \includegraphics[scale=0.20]{lena7}
 331         \end{center}
 332       \end{minipage}
 333       %\label{fig:sch:emb}
 334     }%\hfill
 335     \subfloat[$b$ is 6.]{
 336       \begin{minipage}{0.49\linewidth}
 337         \begin{center}
 338           %\includegraphics[width=5cm]{rec.pdf}
 339           \includegraphics[scale=0.20]{lena6}
 340         \end{center}
 341       \end{minipage}
 342       %\label{fig:sch:ext}
 343     }%\hfill
 344   \end{center}
 345   \caption{Stego images wrt $b$}
 346   \label{fig:lenastego}
 347 \end{figure}
 348
 349
 350 Finally, differences between the original cover and the stego images
 351 are presented in Fig.~\ref{fig:lenadiff}. For each pair of pixel $X_{ij}$ and  $Y_{ij}$ ($X$ and $Y$ being the cover and the stego content respectively),
 352 the pixel value $V_{ij}$ of the difference is defined with the following map
 353 $$
 354 V_{ij}= \left\{
 355 \begin{array}{rcl}
 356 0 & \textrm{if} &  X_{ij} = Y_{ij} \\
 357 75 & \textrm{if} &  \vert X_{ij} - Y_{ij} \vert = 1 \\
 358 150 & \textrm{if} &  \vert X_{ij} - Y_{ij} \vert = 2 \\
 359 225 & \textrm{if} &  \vert X_{ij} - Y_{ij} \vert = 3
 360 \end{array}
 361 \right.
 362 $$
 363 This function allows to emphasize differences between contents.
 364 Notice that since $b$ is 7 in Fig.~\ref{fig:diff7}, the embedding is binary
 365 and this image only contains 0 and 75 values.
 366 Similarly, if $b$ is 6 as in Fig.~\ref{fig:diff6}, the embedding is ternary
 367 and the image contains all the values in $\{0,75,150,225\}$.
 368
 369
 370
 371 \begin{figure}[t]
 372   \begin{center}
 373     \subfloat[$b$ is 7.]{
 374       \begin{minipage}{0.49\linewidth}
 375         \begin{center}
 376           %\includegraphics[width=5cm]{emb.pdf}
 377           \includegraphics[scale=0.20]{diff7}
 378         \end{center}
 379       \end{minipage}
 380       \label{fig:diff7}
 381     }%\hfill
 382     \subfloat[$b$ is 6.]{
 383       \begin{minipage}{0.49\linewidth}
 384         \begin{center}
 385           %\includegraphics[width=5cm]{rec.pdf}
 386           \includegraphics[scale=0.20]{diff6}
 387         \end{center}
 388       \end{minipage}
 389       \label{fig:diff6}
 390     }%\hfill
 391   \end{center}
 392   \caption{Differences  with Lena's cover  wrt $b$}
 393   \label{fig:lenadiff}
 394 \end{figure}
 395
 396
 397