1 This section first presents the embedding scheme through its
2 four main steps: the data encryption (Sect.~\ref{sub:bbs}),
3 the cover pixel selection (Sect.~\ref{sub:edge}),
4 the adaptive payload considerations (Sect.~\ref{sub:adaptive}),
5 and how the distortion has been minimized (Sect.~\ref{sub:stc}).
6 The message extraction is then presented (Sect.~\ref{sub:extract}) and a running example ends this section (Sect.~\ref{sub:xpl}).
9 The flowcharts given in Fig.~\ref{fig:sch}
10 summarize our steganography scheme denoted by
11 STABYLO, which stands for STeganography with cAnny, Bbs, binarY embedding at LOw cost.
12 What follows are successively some details of the inner steps and the flows both inside
13 the embedding stage (Fig.~\ref{fig:sch:emb})
14 and inside the extraction one (Fig.~\ref{fig:sch:ext}).
15 Let us first focus on the data embedding.
19 \subfloat[Data Embedding.]{
20 \begin{minipage}{0.49\textwidth}
22 %\includegraphics[width=5cm]{emb.pdf}
23 \includegraphics[scale=0.45]{emb.ps}
29 \subfloat[Data Extraction.]{
30 \begin{minipage}{0.49\textwidth}
32 %\includegraphics[width=5cm]{rec.pdf}
33 \includegraphics[scale=0.45]{rec.ps}
39 \caption{The STABYLO scheme}
50 \subsection{Security considerations}\label{sub:bbs}
51 Among methods of the message encryption/decryption
52 (see~\cite{DBLP:journals/ejisec/FontaineG07} for a survey)
53 we implement the Blum-Goldwasser cryptosystem~\cite{Blum:1985:EPP:19478.19501}
54 that is based on the Blum Blum Shub~\cite{DBLP:conf/crypto/ShubBB82}
55 pseudorandom number generator (PRNG) and the
57 It has been indeed proven~\cite{DBLP:conf/crypto/ShubBB82} that this PRNG
58 has the property of cryptographical security, \textit{i.e.},
59 for any sequence of $L$ output bits $x_i$, $x_{i+1}$, \ldots, $x_{i+L-1}$,
60 there is no algorithm, whose time complexity is polynomial in $L$, and
61 which allows to find $x_{i-1}$ or $x_{i+L}$ with a probability greater
63 Equivalent formulations of such a property can
64 be found. They all lead to the fact that,
65 even if the encrypted message is extracted,
66 it is impossible to retrieve the original one in
69 Starting thus with a key $k$ and the message \textit{mess} to hide,
70 this step computes a message $m$, which is the encrypted version of \textit{mess}.
73 \subsection{Edge-based image steganography}\label{sub:edge}
78 already presented \cite{Luo:2010:EAI:1824719.1824720,DBLP:journals/eswa/ChenCL10} differ
79 in how carefully they select edge pixels, and
82 %Image Quality: Edge Image Steganography
83 %\JFC{Raphael, les fuzzy edge detection sont souvent utilisés.
84 % il faudrait comparer les approches en terme de nombre de bits retournés,
85 % en terme de complexité. } \cite{KF11}
86 %\RC{Ben, à voir car on peut choisir le nombre de pixel avec Canny. Supposons que les fuzzy edge soient retourne un peu plus de points, on sera probablement plus détectable... Finalement on devrait surement vendre notre truc en : on a choisi cet algo car il est performant en vitesse/qualité. Mais on peut aussi en utilisé d'autres :-)}
88 Many techniques have been proposed in the literature to detect
89 edges in images (whose noise has been initially reduced).
90 They can be separated in two categories: first and second order detection
91 methods on the one hand, and fuzzy detectors on the other hand~\cite{KF11}.
92 In first order methods like Sobel, Canny~\cite{Canny:1986:CAE:11274.11275}, \ldots,
93 a first-order derivative (gradient magnitude, etc.) is computed
94 to search for local maxima, whereas in second order ones, zero crossings in a second-order derivative, like the Laplacian computed from the image,
95 are searched in order to find edges.
96 As far as fuzzy edge methods are concerned, they are obviously based on fuzzy logic to highlight edges.
98 Canny filters, on their parts, are an old family of algorithms still remaining a state of the art edge detector. They can be well-approximated by first-order derivatives of Gaussians.
99 As the Canny algorithm is fast, well known, has been studied in depth, and is implementable
100 on many kinds of architectures like FPGAs, smartphones, desktop machines, and
101 GPUs, we have chosen this edge detector for illustrative purpose.
103 %\JFC{il faudrait comparer les complexites des algo fuzy and canny}
106 This edge detection is applied on a filtered version of the image given
108 More precisely, only $b$ most
109 significant bits are concerned by this step, where
110 the parameter $b$ is practically set with $6$ or $7$.
111 If set with the same value $b$, the edge detection returns thus the same
112 set of pixels for both the cover and the stego image.
113 In our flowcharts, this is represented by ``edgeDetection(b bits)''.
114 Then only the 2 LSBs of pixels in the set of edges are returned if $b$ is 6,
115 and the LSB of pixels if $b$ is 7.
121 Let $x$ be the sequence of these bits.
122 The next section presents how our scheme
123 adapts when the size of $x$ is not sufficient for the message $m$ to embed.
131 \subsection{Adaptive embedding rate}\label{sub:adaptive}
132 Two strategies have been developed in our scheme,
133 depending on the embedding rate that is either \emph{adaptive} or \emph{fixed}.
134 In the former the embedding rate depends on the number of edge pixels.
135 The higher it is, the larger the message length that can be inserted is.
136 Practically, a set of edge pixels is computed according to the
137 Canny algorithm with a high threshold.
138 The message length is thus defined to be less than
139 half of this set cardinality.
140 If $x$ is then too short for $m$, the message is split into sufficient parts
141 and a new cover image should be used for the remaining part of the message.
144 In the latter, the embedding rate is defined as a percentage between the
145 number of modified pixels and the length of the bit message.
146 This is the classical approach adopted in steganography.
147 Practically, the Canny algorithm generates
148 a set of edge pixels related to a threshold that is decreasing
149 until its cardinality
154 Two methods may further be applied to select bits that
156 The first one randomly chooses the subset of pixels to modify by
157 applying the BBS PRNG again. This method is further denoted as a \emph{sample}.
158 Once this set is selected, a classical LSB replacement is applied to embed the
160 The second method is a direct application of the
161 STC algorithm~\cite{DBLP:journals/tifs/FillerJF11}.
162 It is further referred to as \emph{STC} and is detailed in the next section.
168 % First of all, let us discuss about compexity of edge detetction methods.
169 % Let then $M$ and $N$ be the dimension of the original image.
170 % According to~\cite{Hu:2007:HPE:1282866.1282944},
171 % even if the fuzzy logic based edge detection methods~\cite{Tyan1993}
172 % have promising results, its complexity is in $C_3 \times O(M \times N)$
173 % whereas the complexity on the Canny method~\cite{Canny:1986:CAE:11274.11275}
174 % is in $C_1 \times O(M \times N)$ where $C_1 < C_3$.
175 % \JFC{Verifier ceci...}
176 % In experiments detailled in this article, the Canny method has been retained
177 % but the whole approach can be updated to consider
178 % the fuzzy logic edge detector.
186 \subsection{Minimizing distortion with syndrome-trellis codes}\label{sub:stc}
191 % Edge Based Image Steganography schemes
192 % already studied~\cite{Luo:2010:EAI:1824719.1824720,DBLP:journals/eswa/ChenCL10,DBLP:conf/ih/PevnyFB10} differ
193 % how they select edge pixels, and
194 % how they modify these ones.
196 % First of all, let us discuss about compexity of edge detetction methods.
197 % Let then $M$ and $N$ be the dimension of the original image.
198 % According to~\cite{Hu:2007:HPE:1282866.1282944},
199 % even if the fuzzy logic based edge detection methods~\cite{Tyan1993}
200 % have promising results, its complexity is in $C_3 \times O(M \times N)$
201 % whereas the complexity on the Canny method~\cite{Canny:1986:CAE:11274.11275}
202 % is in $C_1 \times O(M \times N)$ where $C_1 < C_3$.
203 % \JFC{Verifier ceci...}
204 % In experiments detailled in this article, the Canny method has been retained
205 % but the whole approach can be updated to consider
206 % the fuzzy logic edge detector.
208 % Next, following~\cite{Luo:2010:EAI:1824719.1824720}, our scheme automatically
209 % modifies Canny parameters to get a sufficiently large set of edge bits: this
210 % one is practically enlarged untill its size is at least twice as many larger
211 % than the size of embedded message.
215 %%RAPH: paragraphe en double :-)
220 \subsection{Data extraction}\label{sub:extract}
221 The message extraction summarized in Fig.~\ref{fig:sch:ext}
222 follows the data embedding approach
223 since there exists a reverse function for all its steps.
225 More precisely, the same edge detection is applied on the $b$ first bits to
226 produce the sequence $y$ of LSBs.
227 If the STC approach has been selected in embedding, the STC reverse
228 algorithm is directly executed to retrieve the encrypted message.
229 This inverse function takes the $H$ matrix as a parameter.
230 Otherwise, \textit{i.e.}, if the \emph{sample} strategy is retained,
231 the same random bit selection than in the embedding step
232 is executed with the same seed, given as a key.
233 Finally, the Blum-Goldwasser decryption function is executed and the original
234 message is extracted.
237 \subsection{Running example}\label{sub:xpl}
238 In this example, the cover image is Lena,
239 which is a $512\times512$ image with 256 grayscale levels.
240 The message is the poem Ulalume (E. A. Poe), which is constituted by 104 lines, 667
241 words, and 3,754 characters, \textit{i.e.}, 30,032 bits.
242 Lena and the first verses are given in Fig.~\ref{fig:lena}.
246 \begin{minipage}{0.49\linewidth}
248 \includegraphics[scale=0.20]{Lena.eps}
251 \begin{minipage}{0.49\linewidth}
254 The skies they were ashen and sober;\linebreak
255 $~$ The leaves they were crisped and sere—\linebreak
256 $~$ The leaves they were withering and sere;\linebreak
257 It was night in the lonesome October\linebreak
258 $~$ Of my most immemorial year;\linebreak
259 It was hard by the dim lake of Auber,\linebreak
260 $~$ In the misty mid region of Weir—\linebreak
261 It was down by the dank tarn of Auber,\linebreak
262 $~$ In the ghoul-haunted woodland of Weir.
267 \caption{Cover and message examples} \label{fig:lena}
270 The edge detection returns 18,641 and 18,455 pixels when $b$ is
271 respectively 7 and 6. These edges are represented in Figure~\ref{fig:edge}.
276 \subfloat[$b$ is 7.]{
277 \begin{minipage}{0.49\linewidth}
279 %\includegraphics[width=5cm]{emb.pdf}
280 \includegraphics[scale=0.20]{edge7.eps}
285 \subfloat[$b$ is 6.]{
286 \begin{minipage}{0.49\linewidth}
288 %\includegraphics[width=5cm]{rec.pdf}
289 \includegraphics[scale=0.20]{edge6.eps}
295 \caption{Edge detection wrt $b$}
301 Only 9,320 bits (resp. 9,227 bits) are available for embedding
302 in the former configuration where $b$ is 7 (resp. where $b$ is 6).
303 In both cases, about the third part of the poem is hidden into the cover.
304 Results with \emph{adaptive+STC} strategy are presented in
305 Fig.~\ref{fig:lenastego}.
309 \subfloat[$b$ is 7.]{
310 \begin{minipage}{0.49\linewidth}
312 %\includegraphics[width=5cm]{emb.pdf}
313 \includegraphics[scale=0.20]{lena7.eps}
318 \subfloat[$b$ is 6.]{
319 \begin{minipage}{0.49\linewidth}
321 %\includegraphics[width=5cm]{rec.pdf}
322 \includegraphics[scale=0.20]{lena6.eps}
328 \caption{Stego images wrt $b$}
329 \label{fig:lenastego}
333 Finally, differences between the original cover and the stego images
334 are presented in Fig.~\ref{fig:lenadiff}. For each pair of pixel $X_{ij}$ and $Y_{ij}$ ($X$ and $Y$ being the cover and the stego content respectively),
335 the pixel value $V_{ij}$ of the difference is defined with the following map
339 0 & \textrm{if} & X_{ij} = Y_{ij} \\
340 75 & \textrm{if} & \vert X_{ij} - Y_{ij} \vert = 1 \\
341 150 & \textrm{if} & \vert X_{ij} - Y_{ij} \vert = 2 \\
342 225 & \textrm{if} & \vert X_{ij} - Y_{ij} \vert = 3
346 This function allows to emphasize differences between contents.
350 \subfloat[$b$ is 7.]{
351 \begin{minipage}{0.49\linewidth}
353 %\includegraphics[width=5cm]{emb.pdf}
354 \includegraphics[scale=0.20]{diff7.eps}
359 \subfloat[$b$ is 6.]{
360 \begin{minipage}{0.49\linewidth}
362 %\includegraphics[width=5cm]{rec.pdf}
363 \includegraphics[scale=0.20]{diff6.eps}
369 \caption{Differences with Lena's cover wrt $b$}
375 \section{Complexity Analysis}\label{sub:complexity}
376 This section aims at justifying the leightweight attribute of our approach.
377 To be more precise, we compare the complexity of our schemes to the
378 state of the art steganography, namely HUGO~\cite{DBLP:conf/ih/PevnyFB10}.
381 In what folllows, we consider an $n \times n$ square image.
382 First of all, HUGO starts with computing the second order SPAM Features.
383 This steps is in $O(n^2 + 2.343^2)$ due to the calculation
384 of the difference arrays and next of the 686 features (of size 343).
385 Next for each pixel, the distortion measure is calculated by +1/-1 modifying
386 its value and computing again the SPAM
387 features. Pixels are thus selected according to their ability to provide
388 an image whose SPAM features are close to the original one.
389 The algorithm is thus computing a distance between each Feature,
390 which is at least in $O(343)$ and an overall distance between these
391 metrics which is in $O(686)$. Computing the distance is thus in
392 $O(2\time 343^2)$ and this mdification is thus in $O(2\time 343^2 \time n^2)$.
393 Ranking these results may be achieved with a insertion sort which is in $2.n^2 \ln(n)$.
394 The overall complexity of the pixel selection is thus
395 $O(n^2 +2.343^2 + 2\time 343^2 \time n^2 + 2.n^2 \ln(n))$, \textit{i.e}
396 $O(2.n^2(343^2 + \ln(n)))$.
398 Our edge selection is based on a Canny Filter,
399 whose complexity is in $O(2n^2.\ln(n))$ thanks to the convolution step
400 which can be implemented with FFT.
401 The complexity of Hugo is at least $343^2/\ln{n}$ times higher than our scheme.