ourapproach.tex

   1 This section first presents the embedding scheme through its
   2 four main steps: the data encryption (Sect.~\ref{sub:bbs}),
   3 the cover pixel selection (Sect.~\ref{sub:edge}),
   4 the adaptive payload considerations (Sect.~\ref{sub:adaptive}),
   5 and how the distortion has been minimized (Sect.~\ref{sub:stc}).
   6 The message extraction is then presented  (Sect.~\ref{sub:extract}) and a running example ends this section (Sect.~\ref{sub:xpl}).
   7
   8
   9 The flowcharts given in Fig.~\ref{fig:sch}
  10 summarize our steganography scheme denoted by
  11 STABYLO, which stands for STeganography with
  12 Adaptive, Bbs, binarY embedding at LOw cost.
  13 What follows are successively some details of the inner steps and the flows both inside
  14  the embedding stage (Fig.~\ref{fig:sch:emb})
  15 and inside the extraction one (Fig.~\ref{fig:sch:ext}).
  16 Let us first focus on the data embedding.
  17
  18 \begin{figure*}%[t]
  19   \begin{center}
  20     \subfloat[Data Embedding.]{
  21       \begin{minipage}{0.49\textwidth}
  22         \begin{center}
  23           %\includegraphics[width=5cm]{emb.pdf}
  24           \includegraphics[scale=0.45]{emb.ps}
  25         \end{center}
  26       \end{minipage}
  27       \label{fig:sch:emb}
  28     }
  29
  30     \subfloat[Data Extraction.]{
  31       \begin{minipage}{0.49\textwidth}
  32         \begin{center}
  33           %\includegraphics[width=5cm]{rec.pdf}
  34           \includegraphics[scale=0.45]{rec.ps}
  35         \end{center}
  36       \end{minipage}
  37       \label{fig:sch:ext}
  38     }%\hfill
  39   \end{center}
  40   \caption{The STABYLO scheme}
  41   \label{fig:sch}
  42 \end{figure*}
  43
  44
  45
  46
  47
  48
  49
  50
  51 \subsection{Security considerations}\label{sub:bbs}
  52 Among the methods of  message encryption/decryption
  53 (see~\cite{DBLP:journals/ejisec/FontaineG07} for a survey)
  54 we implement the Blum-Goldwasser cryptosystem~\cite{Blum:1985:EPP:19478.19501}
  55 that is based on the Blum Blum Shub~\cite{DBLP:conf/crypto/ShubBB82}
  56 pseudorandom number generator (PRNG) and the
  57 XOR binary function.
  58 It has been indeed proven~\cite{DBLP:conf/crypto/ShubBB82} that this PRNG
  59 has the property of cryptographical security, \textit{i.e.},
  60 for any sequence of $L$ output bits $x_i$, $x_{i+1}$, \ldots, $x_{i+L-1}$,
  61 there is no algorithm, whose time complexity is polynomial  in $L$, and
  62 which allows to find $x_{i-1}$ or $x_{i+L}$ with a probability greater
  63 than $1/2$.
  64 Equivalent formulations of such a property can
  65 be found. They all lead to the fact that,
  66 even if the encrypted message is extracted,
  67 it is impossible to retrieve the original one in
  68 polynomial time.
  69
  70 Starting thus with a key $k$ and the message \textit{mess} to hide,
  71 this step computes a message $m$, which is the encrypted version  of \textit{mess}.
  72
  73
  74 \subsection{Edge-based image steganography}\label{sub:edge}
  75
  76
  77 The edge-based image
  78 steganography schemes
  79 already presented \cite{Luo:2010:EAI:1824719.1824720,DBLP:journals/eswa/ChenCL10} differ
  80 in how carefully they select edge pixels, and
  81 how they modify them.
  82
  83 %Image Quality: Edge Image Steganography
  84 %\JFC{Raphael, les fuzzy edge detection sont souvent utilisés.
  85 %  il faudrait comparer les approches en terme de nombre de bits retournés,
  86 %  en terme de complexité. } \cite{KF11}
  87 %\RC{Ben, à voir car on peut choisir le nombre de pixel avec Canny. Supposons que les fuzzy edge soient retourne un peu plus de points, on sera probablement plus détectable...  Finalement on devrait surement vendre notre truc en : on a choisi cet algo car il est performant en vitesse/qualité. Mais on peut aussi en utilisé d'autres :-)}
  88
  89 Many techniques have been proposed in the literature to  detect
  90 edges in  images (whose noise has been initially reduced).
  91 They can be separated into two categories: first and second order detection
  92 methods on the one hand, and fuzzy detectors on the other  hand~\cite{KF11}.
  93 In first order methods like Sobel, Canny~\cite{Canny:1986:CAE:11274.11275}, \ldots,
  94 a first-order derivative (gradient magnitude, etc.) is computed
  95 to search for local maxima, whereas in second order ones, zero crossings in a second-order derivative, like the Laplacian computed from the image,
  96 are searched in order to find edges.
  97 As far as fuzzy edge methods are concerned, they are obviously based on fuzzy logic to highlight edges.
  98
  99 Canny filters, on their parts, are an old family of algorithms still remaining a state of the art edge detector. They can be well-approximated by first-order derivatives of Gaussians.
 100 As the Canny algorithm is fast, well known, has been studied in depth, and is implementable
 101 on many  kinds of architectures like FPGAs, smartphones,  desktop machines, and
 102 GPUs, we have chosen this edge detector for illustrative purpose.
 103
 104 %\JFC{il faudrait comparer les complexites des algo fuzy and canny}
 105
 106
 107 This edge detection is applied on a filtered version of the image given
 108 as input.
 109 More precisely, only $b$ most
 110 significant bits are concerned by this step, where
 111 the parameter $b$ is practically set with $6$ or $7$.
 112 If set with the same value $b$, the edge detection returns thus the same
 113 set of pixels for both the cover and the stego image.
 114 In our flowcharts, this is represented by ``edgeDetection(b bits)''.
 115 Then only the 2 LSBs of pixels in the set of edges are returned if $b$ is 6,
 116 and the LSB of pixels if $b$ is 7.
 117
 118
 119
 120
 121
 122 Let $x$ be the sequence of these bits.
 123 The next  section presents how our scheme
 124 adapts  when the size of $x$  is not sufficient for the message $m$ to embed.
 125
 126
 127
 128
 129
 130
 131
 132 \subsection{Adaptive embedding rate}\label{sub:adaptive}
 133 Two strategies have been developed in our scheme,
 134 depending on the embedding rate that is either \emph{adaptive} or \emph{fixed}.
 135 In the former the embedding rate depends on the number of edge pixels.
 136 The higher it is, the larger the message length that can be inserted is.
 137 Practically, a set of edge pixels is computed according to the
 138 Canny algorithm with a high threshold.
 139 The message length is thus defined to be less than
 140 half of this set cardinality.
 141 If $x$ is then too short for $m$, the message is split into sufficient parts
 142 and a new cover image should be used for the remaining part of the message.
 143
 144
 145 In the latter, the embedding rate is defined as a percentage between the
 146 number of modified pixels and the length of the bit message.
 147 This is the classical approach adopted in steganography.
 148 Practically, the Canny algorithm generates
 149 a set of edge pixels related to a threshold that is decreasing
 150 until its cardinality
 151 is sufficient. Even in this situation, our scheme is adapting
 152 its algorithm to meet all the user's requirements.
 153
 154
 155 Once the map of possibly modified pixels is computed,
 156 two methods may further be applied to extract bits that
 157 are really modified.
 158 The first one randomly chooses the subset of pixels to modify by
 159 applying the BBS PRNG again. This method is further denoted  as a \emph{sample}.
 160 Once this set is selected, a classical LSB replacement is applied to embed the
 161 stego content.
 162 The second method is a direct application of the
 163 STC algorithm~\cite{DBLP:journals/tifs/FillerJF11}.
 164 It  is further referred to as \emph{STC} and is detailed in the next section.
 165
 166
 167
 168
 169
 170 % First of all, let us discuss about compexity of edge detetction methods.
 171 % Let then $M$ and $N$ be the dimension of the original image.
 172 % According to~\cite{Hu:2007:HPE:1282866.1282944},
 173 % even if the fuzzy logic based edge detection methods~\cite{Tyan1993}
 174 % have promising results, its complexity is in $C_3 \times O(M \times N)$
 175 % whereas the complexity on the Canny method~\cite{Canny:1986:CAE:11274.11275}
 176 % is in $C_1 \times O(M \times N)$ where  $C_1 < C_3$.
 177 % \JFC{Verifier ceci...}
 178 % In experiments detailled in this article, the Canny method has been retained
 179 % but the whole approach can be updated to consider
 180 % the fuzzy logic edge detector.
 181
 182
 183
 184
 185
 186
 187
 188 \subsection{Minimizing distortion with syndrome-trellis codes}\label{sub:stc}
 189 \input{stc}
 190
 191
 192
 193 % Edge Based Image Steganography schemes
 194 % already studied~\cite{Luo:2010:EAI:1824719.1824720,DBLP:journals/eswa/ChenCL10,DBLP:conf/ih/PevnyFB10} differ
 195 % how they select edge pixels, and
 196 % how they modify these ones.
 197
 198 % First of all, let us discuss about compexity of edge detetction methods.
 199 % Let then $M$ and $N$ be the dimension of the original image.
 200 % According to~\cite{Hu:2007:HPE:1282866.1282944},
 201 % even if the fuzzy logic based edge detection methods~\cite{Tyan1993}
 202 % have promising results, its complexity is in $C_3 \times O(M \times N)$
 203 % whereas the complexity on the Canny method~\cite{Canny:1986:CAE:11274.11275}
 204 % is in $C_1 \times O(M \times N)$ where  $C_1 < C_3$.
 205 % \JFC{Verifier ceci...}
 206 % In experiments detailled in this article, the Canny method has been retained
 207 % but the whole approach can be updated to consider
 208 % the fuzzy logic edge detector.
 209
 210 % Next, following~\cite{Luo:2010:EAI:1824719.1824720}, our scheme automatically
 211 % modifies Canny parameters to get a sufficiently large set of edge bits: this
 212 % one is practically enlarged untill its size is at least twice as many larger
 213 % than the size of embedded message.
 214
 215
 216
 217 %%RAPH: paragraphe en double :-)
 218
 219
 220
 221
 222 \subsection{Data extraction}\label{sub:extract}
 223 The message extraction summarized in Fig.~\ref{fig:sch:ext}
 224 follows the data embedding approach
 225 since there exists a reverse function for all its steps.
 226
 227 More precisely, the same edge detection is applied on the $b$ first bits  to
 228 produce the sequence $y$ of LSBs.
 229 If the STC approach has been selected in embedding, the STC reverse
 230 algorithm is directly executed to retrieve the encrypted message.
 231 This inverse function takes the $H$ matrix as a parameter.
 232 Otherwise, \textit{i.e.}, if the \emph{sample} strategy is retained,
 233 the same random bit selection than in the embedding step
 234 is executed with the same seed, given as a key.
 235 Finally, the Blum-Goldwasser decryption function is executed and the original
 236 message is extracted.
 237
 238
 239 \subsection{Running example}\label{sub:xpl}
 240 In this example, the cover image is  Lena,
 241 which is a $512\times512$  image with 256 grayscale levels.
 242 The message is the poem Ulalume (E. A. Poe), which is constituted by 104 lines, 667
 243 words, and 3,754 characters, \textit{i.e.},  30,032 bits.
 244 Lena and the first verses are given in Fig.~\ref{fig:lena}.
 245
 246 \begin{figure}
 247 \begin{center}
 248 \begin{minipage}{0.49\linewidth}
 249 \begin{center}
 250 \includegraphics[scale=0.20]{Lena.eps}
 251 \end{center}
 252 \end{minipage}
 253 \begin{minipage}{0.49\linewidth}
 254 \begin{flushleft}
 255 \begin{scriptsize}
 256 The skies they were ashen and sober;\linebreak
 257 $\qquad$ The leaves they were crisped and sere—\linebreak
 258 $\qquad$ The leaves they were withering and sere;\linebreak
 259 It was night in the lonesome October\linebreak
 260 $\qquad$ Of my most immemorial year;\linebreak
 261 It was hard by the dim lake of Auber,\linebreak
 262 $\qquad$ In the misty mid region of Weir—\linebreak
 263 It was down by the dank tarn of Auber,\linebreak
 264 $\qquad$ In the ghoul-haunted woodland of Weir.
 265 \end{scriptsize}
 266 \end{flushleft}
 267 \end{minipage}
 268 \end{center}
 269 \caption{Cover and message examples} \label{fig:lena}
 270 \end{figure}
 271
 272 The edge detection returns 18,641 and 18,455 pixels when $b$ is
 273 respectively 7 and 6. These edges are represented in Figure~\ref{fig:edge}.
 274 When $b$ is 7, it remains one bit per pixel to build the cover vector.
 275 in this configuration, this leads to a cover vector of size  18,641 and
 276 36,910 if $b$ is 6.
 277
 278 \begin{figure}[t]
 279   \begin{center}
 280     \subfloat[$b$ is 7.]{
 281       \begin{minipage}{0.49\linewidth}
 282         \begin{center}
 283           %\includegraphics[width=5cm]{emb.pdf}
 284           \includegraphics[scale=0.20]{edge7.eps}
 285         \end{center}
 286       \end{minipage}
 287       %\label{fig:sch:emb}
 288     }%\hfill
 289     \subfloat[$b$ is 6.]{
 290       \begin{minipage}{0.49\linewidth}
 291         \begin{center}
 292           %\includegraphics[width=5cm]{rec.pdf}
 293           \includegraphics[scale=0.20]{edge6.eps}
 294         \end{center}
 295       \end{minipage}
 296       %\label{fig:sch:ext}
 297     }%\hfill
 298   \end{center}
 299   \caption{Edge detection wrt $b$}
 300   \label{fig:edge}
 301 \end{figure}
 302
 303
 304
 305 The STC algorithm is optimized when the rate between message length and
 306 cover vector length is less than 1/2.
 307 So, only 9,320 bits (resp. 18,455 bits) are available for embedding
 308 in the former configuration where $b$ is 7 (resp. where $b$ is 6).
 309 In the first cases, about the third part of the poem is hidden into the cover
 310 whereas the latter allows to embed more than the half part of it.
 311 Results with \emph{adaptive+STC} strategy are presented in
 312 Fig.~\ref{fig:lenastego}.
 313
 314 \begin{figure}[t]
 315   \begin{center}
 316     \subfloat[$b$ is 7.]{
 317       \begin{minipage}{0.49\linewidth}
 318         \begin{center}
 319           %\includegraphics[width=5cm]{emb.pdf}
 320           \includegraphics[scale=0.20]{lena7.eps}
 321         \end{center}
 322       \end{minipage}
 323       %\label{fig:sch:emb}
 324     }%\hfill
 325     \subfloat[$b$ is 6.]{
 326       \begin{minipage}{0.49\linewidth}
 327         \begin{center}
 328           %\includegraphics[width=5cm]{rec.pdf}
 329           \includegraphics[scale=0.20]{lena6.eps}
 330         \end{center}
 331       \end{minipage}
 332       %\label{fig:sch:ext}
 333     }%\hfill
 334   \end{center}
 335   \caption{Stego images wrt $b$}
 336   \label{fig:lenastego}
 337 \end{figure}
 338
 339
 340 Finally, differences between the original cover and the stego images
 341 are presented in Fig.~\ref{fig:lenadiff}. For each pair of pixel $X_{ij}$ and  $Y_{ij}$ ($X$ and $Y$ being the cover and the stego content respectively),
 342 the pixel value $V_{ij}$ of the difference is defined with the following map
 343 $$
 344 V_{ij}= \left\{
 345 \begin{array}{rcl}
 346 0 & \textrm{if} &  X_{ij} = Y_{ij} \\
 347 75 & \textrm{if} &  \vert X_{ij} - Y_{ij} \vert = 1 \\
 348 150 & \textrm{if} &  \vert X_{ij} - Y_{ij} \vert = 2 \\
 349 225 & \textrm{if} &  \vert X_{ij} - Y_{ij} \vert = 3
 350 \end{array}
 351 \right..
 352 $$
 353 This function allows to emphasize differences between contents.
 354
 355 \begin{figure}[t]
 356   \begin{center}
 357     \subfloat[$b$ is 7.]{
 358       \begin{minipage}{0.49\linewidth}
 359         \begin{center}
 360           %\includegraphics[width=5cm]{emb.pdf}
 361           \includegraphics[scale=0.20]{diff7.eps}
 362         \end{center}
 363       \end{minipage}
 364       %\label{fig:sch:emb}
 365     }%\hfill
 366     \subfloat[$b$ is 6.]{
 367       \begin{minipage}{0.49\linewidth}
 368         \begin{center}
 369           %\includegraphics[width=5cm]{rec.pdf}
 370           \includegraphics[scale=0.20]{diff6.eps}
 371         \end{center}
 372       \end{minipage}
 373       %\label{fig:sch:ext}
 374     }%\hfill
 375   \end{center}
 376   \caption{Differences  with Lena's cover  wrt $b$}
 377   \label{fig:lenadiff}
 378 \end{figure}
 379
 380
 381