Deconvolution: A Wavelet Frame Approach: Received: Date / Revised Version: Date
Deconvolution: A Wavelet Frame Approach: Received: Date / Revised Version: Date
Deconvolution: A Wavelet Frame Approach: Received: Date / Revised Version: Date
2
(Z)
.
There are many real life problems which can be modeled by a deconvolution process. For example, mea-
surement devices and signal communication can introduce distortions and add noise to original signals. Invert-
ing the degradation is often modeled by a deconvolution process, i.e. a process of nding a solution in (1.1).
In fact, the deconvolution problem is a critical factor in many applications, especially visual-communication
Anwei Chai
Department of Mathematics, National University of Singapore, 2 Science Drive 2, Singapore 117543, Singapore
E-mail: anwei@stanford.edu
Present address: Institute for Computational and Mathematical Engineering, Stanford University, CA 94305, USA
Zuowei Shen
Department of Mathematics, National University of Singapore, 2 Science Drive 2, Singapore 117543, Singapore
E-mail: matzuows@nus.edu.sg
2 Anwei Chai, Zuowei Shen
related applications including remote sensing, military imaging, surveillance, medical imaging, and high res-
olution image reconstructions.
Solving equation (1.1) is an inverting process, which is often numerically unstable and thus amplies the
noise considerably. Hence, an efcient process of noise removal must be built in numerical algorithms. The
earlier formulation of the problem was proposed in [34] using linear algorithm and in [25] and [33] applying
the regularization idea to solve a system of linear equations the coefcient matrix of which is ill-conditioned.
Since then, there are many papers devoted to this method in the literature. Because this approach is not the
focus of this paper, instead of a detailed count, we simply refer readers to [24] and [28] and the references
there for a complete reference.
The focus of this paper is to use wavelet (more generally, wavelet frame) to solve (1.1). Recently, there are
several papers on solving inverse problems by using wavelet methods, and in particular, deconvolution prob-
lems. One of the main ideas is to construct a wavelet or wavelet inspired basis that can almost diagonalize
the given operator and the underlying solution has a sparse expansion with respect to the chosen basis. The
Wavelet-Vaguelette decomposition proposed in [20], [22] and [23] and the deconvolution in mirror wavelet
bases in [28] and [29] can be both viewed as examples of succuss of this strategy. Another approach is to
apply Galerkin-type methods to inverse problems using an appropriate, but xed wavelet basis (see e.g. [1]
and [16]). The underlying intuition is that if the given operator has a close to sparse representation in wavelets
and the solution has a sparse expansion with respect to the wavelet basis, then the inversion is reduced approx-
imately to the inversion of a truncated operator. The method is adaptive, so that the ner-scale wavelets are
used where lower-scales indicate the presence of singularities. A few new iterative thresholding algorithms
different from the above approaches and developed simultaneously and independently are proposed in [6, 8,
9, 18, 20]. It only requires that the underlying solution has a sparse expansion with respect to a given system
without any attempt to almost diagonalize or sparsely representing the convolution operators.
The main idea of [18, 20] is to expand each iteration with respect to the chosen orthonormal basis for a
given algorithm such as the Landweber method, then a thresholding algorithm is applied to the coefcients of
this expansion. The result is then used to derive the next iteration. The algorithm is shown to converge to the
minimizer of certain cost functional.
In the studies of high resolution image reconstructions, the wavelet based (in fact the frame-based) recon-
struction algorithms are developed in [57], and later [8, 9] through the perfect reconstruction formula of a
bi-frame or tight frame system which has h
0
as its primary low pass lter. The algorithms approximate itera-
tively the coefcients of wavelet frame folded by the given low pass lter. By this approach, many available
techniques developed in the wavelet literatures, such as wavelet-based denoising schemes, can be built in the
iteration. When there are no displacement errors, the high resolution image reconstruction is exactly a de-
convolution problem. Here, we extend the algorithms in the papers mentioned above to solve equation (1.1).
Algorithm 4.3 is used in the papers mentioned above, in particular in [6, 8]. This method has been extended to
algorithms for high resolution image reconstructions with displacement errors in [8] and [9]. Algorithm 4.1
is given in [9] as one of the options which is motivated by the approaches taken by [18, 20]. Algorithms given
in [10, 11] are based on Algorithm 4.2 where high resolution images are constructed from a series of video
clips. The main ideas of all three algorithms are the same. i.e. an iterative process combined with a denoising
scheme applied to each iteration. The differences lay in the different denoising schemes applied to different
algorithms which in turn minimize different cost functionals. Finally, we remark that converting a deblurring
problem of recovering v from c to a problem of inpainting of lost wavelet coefcients can be also found in
[12] with a different approach. The interested reader should consult [12] for details.
The convergence analysis of Algorithm 2.1 (the iteration without built-in denoising scheme) has already
been established in [6] and [9]. However, the convergence of Algorithm 2.2, 2.3, 4.1, 4.2, 4.3 has not been
discussed so far. The current paper aims to build up a complete theory for these algorithms. We will rst give
a solid and complete formulation of reconstructions of a solution to equation (1.1) in terms of multiresolution
analysis and its associated frame system. Then the convergence of all algorithms will be given. A complete
analysis of minimization properties, i.e. in which sense the solution derived from the algorithms attains its
optimal property, will be given. Finally, the stability of algorithms is also discussed, which shows that nu-
merical solution approaches to the exact solution when the noise level decreases to zero. As it has already
been shown in the papers [6, 811], algorithms are numerically efcient, easy to implement and adaptive to
different applications such as high resolution image reconstructions with displacement errors (see e.g. [8] and
[9]). In this paper, a theoretical foundation of the underlying algorithms used in those papers is fully laid out.
The paper is organized as follows: Section 2 is devoted to giving a formulation of the deconvolution prob-
lem in terms of multiresolution analysis and its associated tight wavelet frame. Algorithms are also derived
Deconvolution: A Wavelet Frame Approach 3
from this formulation. Section 3 gives a complete analysis of the algorithms, including the convergence and
minimization properties of the algorithms. Section 4 focuses on the nite dimensional data set, i.e. the data
set has only nitely many entries. Algorithms 2.2 and 2.3 for innite dimensional data set can be converted by
imposing proper boundary conditions. Since any numerical solution of deconvolution ultimately deals with
nite dimensional data sets, such conversion is necessary. As we will see, in many cases, the discussion will
be simpler and we are able to obtain better results. Since the numerical implementations and simulations,
together with comparison of algorithms provided here with other algorithms, e.g. regularization method, are
discussed in details in [6, 8, 9], and since our focus here is to lay the foundation of the algorithms, we omit the
detailed discussions of numerical implementations here. Instead, we give a numerical comparison between
Algorithms 4.1, 4.2 and 4.3 in Section 4. Finally, Several proofs are left over to the appendices.
In the remaining part of this section, we give the notation and collect basic results of tight frame system
that will be used in this paper. Readers whose main interests are the major part of this paper may only need to
briey go through this part to get familiar with the notations.
1.2 Tight Wavelet Frames
We give here a brief introduction of the tight wavelet frame and its quasi-afne counterpart. The decompo-
sition and reconstruction for the afne tight frame system are known (e.g. [19]); however, the analysis of
decomposition and reconstruction of quasi-afne systems is not systematically given. Since these results are
crucial for our analysis, we introduce them here in details and leave the proofs to the appendix. At the same
time, we set the notations used in this paper.
The space L
p
(R) is the set of all the functions f (x) satisfying
|f |
L
p
(R)
:=
_
(
_
R
[ f (x)[
p
dx)
1
p
<, 1 p <;
esssup
xR
[ f (x)[ <, p =;
and
p
(Z) is the set of all sequences dened on Z which satisfy that
|h|
p
(Z)
:=
_
(
kZ
[h[k][
p
)
1
p
<, 1 p <;
sup
kZ
[h[k][ <, p =.
The Fourier transform of a function f L
1
(R) is dened as usual by:
f () :=
_
R
f (x)e
ix
dx, R,
and its inverse is
f (x) =
1
2
_
R
f ()e
ix
d, x R.
They can be extended to more general functions, i.e. the functions in L
2
(R). Similarly, we can dene the
Fourier series for a sequence h
2
(Z) by
h() :=
kZ
h[k]e
ik
, R.
For any function f L
2
(R), the dyadic dilation operator D is dened by Df (x) :=
2f (2x) and the
translation operator T is dened by T
a
f (x) := f (x a) for a R. Given j Z, we have T
a
D
j
= D
j
T
2
j
a
.
Further, a space V is said to be integer-shift invariant if given any function f V, T
j
f V for j Z.
A system X L
2
(R) is called a tight frame of L
2
(R) if
|f |
2
L
2
(R)
=
gX
[ f , g)[
2
,
holds for all f L
2
(R), where , ) is the inner product in L
2
(R) and | |
L
2
(R)
=
_
, ). This is equivalent to
f =
gX
f , g)g, f L
2
(R).
4 Anwei Chai, Zuowei Shen
It is clear that an orthonormal basis is a tight frame.
For given :=
1
, . . . ,
r
L
2
(R), dene the afne system
X() :=
, j,k
: 1 r; j, k Z,
where
, j,k
= D
j
T
k
= 2
j/2
(2
j
k). When X() forms an orthonormal basis of L
2
(R), then
, =
1, . . . , r, are called the orthonormal wavelets. When X() forms a tight frame of L
2
(R), then it is called a
tight wavelet frame and
kZ
h
0
[k](2x k), (1.2)
for some sequence h
0
2
(Z). By the Fourier transform, the renable equation (1.2) can be given as
() =
h
0
(/2)
(/2), a.e. R.
We call the sequence h
0
the renement mask of and
h
0
() the renement symbol of .
For given nitely supported h
0
with
h
0
(0) = 1, the renement equation (1.2) always has distribution
solution which can be written in the Fourier domain as
() =
j=1
h
0
(2
j
), a.e. R.
In this paper, we require h
0
being nitely supported. Then the corresponding renable function satises
that
esssup
R
kZ
[
( +2k)[
2
<, (1.3)
whenever L
2
(R) (see [27]).
For a compactly supported renable function L
2
(R), let V
0
be the closed shift invariant space generated
by ( k) : k Z and V
j
:= f (2
j
) : f V
0
, j Z. It is known that when is compactly supported,
then V
j
jZ
forms a multiresolution analysis. Recall that a multiresolution analysis is a family of closed
subspaces V
j
jZ
of L
2
(R) that satises: (i) V
j
V
j+1
, (ii)
j
V
j
is dense in L
2
(R), and (iii)
j
V
j
= 0
(see [2] and [26]).
For a given MRA of nested spaces V
j
, j Z with the underlying renable function and the renement
mask h
0
, it is well known that (e.g. see [2]) for any V
1
, there exists a 2 periodic function , such that
(2) =
.
Let :=
1
, . . . ,
r
V
1
, then
(2) =
h
, = 1, . . . , r, (1.4)
where
h
1
, . . . ,
h
r
are 2 periodic functions and are called framelet symbols. In the time domain, (1.4) can be
written as
(x) = 2
kZ
h
jZ
. Let (h
1
, . . . , h
r
) be a set of sequences with
_
h
1
, . . . ,
h
r
_
being a set of 2-periodic measurable functions in L
=0
[
()[
2
= 1 and
r
=0
()
( +) = 0 (1.6)
hold for almost all [, ], then the system X() where =
1
, . . . ,
r
dened in (1.5) by (h
1
, . . . , h
r
)
and forms a tight frame in L
2
(R).
Deconvolution: A Wavelet Frame Approach 5
We will use (1.6) in terms of sequences h
0
, . . . , h
r
. The rst condition
r
=0
[
()[
2
= 1 in terms of corre-
sponding sequences is
r
=0
kZ
h
[k]h
[k p] =
0,p
, p Z, (1.7)
where
0,p
= 1 when p = 0 and 0 otherwise. The second condition
r
=0
()
( +) = 0 can be written as
r
=0
kZ
(1)
kp
h
[k]h
[k p] = 0, p Z. (1.8)
With the UEP, the construction of tight framelets become painless. For example, one can construct tight
framelets from spline easily. Next, we give two examples of spline tight framelets.
Example 1.1 Let h
0
= [
1
4
,
1
2
,
1
4
] be the renement mask of the piecewise linear function (x) =max(1[x[, 0).
Dene h
1
= [
1
4
,
1
2
,
1
4
] and h
2
= [
2
4
, 0,
2
4
]. Then
h
0
,
h
1
and
h
2
satisfy (1.6). Hence, the system X()
where =
1
,
2
dened in (1.5) by using h
1
, h
2
and is a tight frame of L
2
(R). This is the rst example
constructed via the UEP in [31].
Example 1.2 Let h
0
= [
1
16
,
1
4
,
3
8
,
1
4
,
1
16
] be the renement mask of . Then is the piecewise cubic B-spline.
Dene h
1
, h
2
, h
3
, h
4
as follows:
h
1
= [
1
16
,
1
4
,
3
8
,
1
4
,
1
16
], h
2
= [
1
8
,
1
4
, 0,
1
4
,
1
8
],
h
3
= [
6
16
, 0,
6
8
, 0,
6
16
], h
4
= [
1
8
,
1
4
, 0,
1
4
,
1
8
].
Then
h
0
,
h
1
,
h
2
,
h
3
,
h
4
satisfy (1.6) and hence the system X() where =
1
,
2
,
3
,
4
dened in (1.5)
by h
1
, h
2
, h
3
, h
4
and is a tight frame of L
2
(R). This is also rst constructed in [31].
The deconvolution process has to be formulated by quasi-afne systems that were rst introduced in [31].
A quasi-afne system from level J is dened as
Denition 1.1 Let =
1
, . . . ,
r
be a set of functions. A quasi-afne system from level J is dened as
X
q
J
() =
q
, j,k
: 1 r; j, k Z,
where
q
, j,k
is dened by
q
, j,k
:=
_
D
j
T
k
, j J;
2
jJ
2
T
2
J
k
D
j
, j < J.
The quasi-afne system is obtained by over sampling the afne system. More precisely, we over sample the
afne system starting from level J 1 and downward to a 2
J
-shift invariant system. Hence, the whole quasi-
afne system is a 2
J
-shift invariant system. The quasi-afne system from level 0 was rst introduced in [31]
to convert a non-shift invariant afne system to a shift invariant system. Further, it was shown in [31, Theorem
5.5] that the afne system X() is a tight frame of L
2
(R) if and only if the quasi-afne counterpart X
q
J
() is
a tight frame of L
2
(R).
In our analysis, we use the quasi-interpolatory operator. Let V
j
, j Z be a given MRA with underlying
renable function and =
1
, . . . ,
r
be the set of corresponding tight framelets derived from the UEP.
The quasi-interpolatory operator in the afne system X() generated by is dened, for f L
2
(R),
P
j
: f
kZ
f ,
j,k
)
j,k
.
It is clear that P
j
f V
j
. As shown in [19, Lemma 2.4], this quasi-interpolatory operator is the same as trun-
cated representation
Q
j
: f
r
=1
j
/
<j,kZ
f ,
, j
/
,k
)
, j
/
,k
.
6 Anwei Chai, Zuowei Shen
Furthermore, a standard framelet decomposition given in [19] says that
P
j+1
f = P
j
f +
r
=1
kZ
f ,
, j,k
)
, j,k
and P
j
f = Q
j
f . (1.9)
When we consider the MRA based quasi-afne system X
q
J
() generated by , the spaces V
j
, j < J in the
MRA for the afne system are replaced by V
q,J
j
, j < J, for the quasi-afne system. Note that the space V
j
is
spanned by functions
j,k
, while the space V
q,J
j
is spanned by functions
q
j,k
, where
q
j,k
is dened by
q
j,k
:=
_
D
j
T
k
, j J;
2
jJ
2
T
2
J
k
D
j
, j < J.
The spaces V
q,J
j
, j <J are 2
J
-shift invariant. We dene the quasi-interpolatory operator P
q,J
j
and the truncated
operator Q
q,J
j
, j Z, for the quasi-afne system by
P
q,J
j
: f
kZ
f ,
q
j,k
)
q
j,k
(1.10)
and
Q
q,J
j
: f
r
=1
j
/
<j,kZ
f ,
q
, j
/
,k
)
q
, j
/
,k
. (1.11)
The quasi-interpolatory operator P
q,J
j
maps f L
2
(R) to V
q,J
j
. From the denition of
q
j,k
, we can see that
P
q,J
j
= P
j
when j J and these two operators are different only when j < J. Moreover, since for an arbitrary
f L
2
(R) and j < J,
P
q,J
j
f =
kZ
f ,
q
j,k
)
q
j,k
= D
J
kZ
D
J
f , 2
jJ0
2
T
k
D
jJ
)2
jJ0
2
T
k
D
jJ
= D
J
P
q,0
jJ
D
J
f ,
one only needs to understand the case J = 0. In this case we simplify our notation by setting
P
q
j
:= P
q,0
j
, Q
q
j
:= Q
q,0
j
(1.12)
for the quasi-interpolatory operators and V
q
j
:=V
q,0
j
, for j Z. From now on, we only give the properties for
P
q
j
and corresponding spaces V
q
j
and the associated quasi-afne system X
q
() := X
q
0
(). The corresponding
results for the over sampling rate of 2
J
Z can be obtained similarly.
For operator P
q
j
, j Z, we also have the decomposition and reconstruction formula similar to (1.9).
Lemma 1.1 Let X(), where the framelets =
1
, . . . ,
r
, be the afne tight frame system obtained from
h
0
and via the UEP and X
q
() be the quasi-afne frame derived from X(). Then we have
P
q
j+1
f = P
q
j
f +
r
=1
kZ
f ,
q
, j,k
)
q
, j,k
, f L
2
(R). (1.13)
More general, it was proven in [19, Lemma 2.4] that the identity P
j
f = Q
j
f holds for all f L
2
(R). Next
result shows that a similar result also holds for the quasi-afne systems.
Proposition 1.1 Let X() with =
1
, . . . ,
r
be the afne tight frame system obtained from h
0
and via
the UEP and X
q
() be the corresponding quasi-afne frame. Then we have P
q
j
f = Q
q
j
f for all f L
2
(R).
We postpone the proofs to Appendix A.
Deconvolution: A Wavelet Frame Approach 7
1.3 Discrete Form
The identity (1.13) essentially gives the decomposition and reconstruction of a function in quasi-afne tight
frame systems. In the implementation, one needs a completely discrete form of the decomposition and recon-
struction and we give such form below.
We introduce the Toeplitz matrix to describe the discrete form of the decomposition and reconstruction
procedure. Given a sequence h
0
= h
0
[k]
kZ
, the Toeplitz matrix generated by h
0
is a matrix satisfying
H
0
= (H
0
[l, k]) = (h
0
[l k]),
where the (l, k)th entry in H
0
is fully determined by the (l k)th entry in h
0
. The Toeplitz matrix is also called
the convolution matrix since it can be viewed as the matrix representation of linear time invariant lter which
can be written as convolution. Hence the convolution of two sequences can be expressed in terms of matrix
vector multiplication, i.e.
h
0
v = H
0
v. (1.14)
In the following, we will denote the Toeplitz matrix generated from h
0
by H
0
= Toeplitz(h
0
). Let H
denote
the innite dimensional Toeplitz matrix Toeplitz(h
0
H
0
+H
1
H
1
+ +H
r
H
r
= I , (1.15)
where I is the identity operator. To write the decomposition and reconstruction in convolution form, the lters
used in decomposition below the 0
th
level need to be dilated. In level j < 0, the dilated lter is denoted by
h
, j
, which is dened by (also see (A.1))
h
, j
[k] =
_
h
[2
j+1
k], k 2
j1
Z;
0, k Z2
j1
Z.
(1.16)
The corresponding Toeplitz matrix is
H
, j
= Toeplitz(h
, j
). (1.17)
By the denition of h
, j
, we have
h
, j
=
h
(2
j1
) and hence [
h
, j
[ 1 a.e. R. Moreover, as a byproduct
in the proof of Lemma 1.1, we have a condition similar to (1.15) for dilated lters h
0, j
, . . . , h
r, j
, j < 0:
H
0, j
H
0, j
+H
1, j
H
1, j
+ +H
r, j
H
r, j
= I . (1.18)
We can see that when j = 1, (1.15) and (1.18) are the same.
The discrete forms of decomposition and reconstruction from level j
1
to level j
2
, where j
1
, j
2
0, are the
same as those in the afne system, which are given in [19]. We only consider the discrete form of decompo-
sition and reconstruction from level j
1
to level j
2
, where j
1
, j
2
< 0. For a function f L
2
(R), we decompose
f in X
q
() and collect the coefcients in each level j < 0 to form an innite column vector
v
, j
:= [. . . , f ,
q
, j,k
), . . .]
t
,
where
q
0
:=
q
and [ ]
t
is the transpose of a row vector. Set the Toeplitz block matrix
H
j
:=
_
H
0, j
, H
1, j
, . . . , H
r, j
t
.
With this, condition (1.18) implies H
j
H
j
= I . The decomposition process (1.13) can be written in the matrix
form as:
v
, j
= H
, j
v
0, j+1
, = 0, . . . , r,
or
_
v
0, j
, . . . , v
r, j
t
=H
j
v
0, j+1
. (1.19)
Because of (1.18), the reconstruction process of Lemma 1.1 can be interpreted in the discrete form as
v
0, j+1
=H
j
H
j
v
0, j+1
= H
0, j
H
0, j
v
0, j+1
+H
1, j
H
1, j
v
0, j+1
+ +H
r, j
H
r, j
v
0, j+1
= H
0, j
v
0, j
+H
1, j
v
1, j
+ +H
r, j
v
r, j
.
(1.20)
8 Anwei Chai, Zuowei Shen
The identities (1.19) and (1.20) together give the equivalent discrete representation of (1.13).
The above discussion essentially is one level decomposition and reconstruction. Next, we introduce the
notation of several to innite levels decomposition and reconstruction. For any sequence v, it is decomposed
by H
1
v rst, then the low frequency component H
0
v is further decomposed by the same procedure. The
same process goes inductively. To describe this discrete process, we dene the decomposition operator A
J
,
J < 0 and A. They are composed of matrix blocks like H
, j
1
j
/
=j
H
0, j
/ where
1
j
/
=j
H
0, j
/ is the composition of
[ j[ Toeplitz matrices H
0, j
/ , j j
/
1, given by (A.1) and (1.17), and acts on any sequence v
2
(Z) in the
following order:
1
j
/
=j
H
0, j
/ v = H
0, j
H
0, j+1
H
0,1
v.
The decomposition operator A
J
is a (rectangular) block matrix dened as:
[
_ 1
j=J
H
0, j
_
;
_
H
1,J
1
j=J+1
H
0, j
_
; . . . ;
_
H
r,J
1
j=J+1
H
0, j
_
; . . . ; H
1,1
; . . . ; H
r,1
]
t
(1.21)
and A is dened as
[. . . ;
_
H
1,J1
1
j=J1
H
0, j
_
; . . . ;
_
H
r,J1
1
j=J1
H
0, j
_
;
_
H
1,J
1
j=J
H
0, j
_
; . . . ;
_
H
r,J
1
j=J
H
0, j
_
;
_
H
1,J+1
1
j=J+1
H
0, j
_
; . . . ;
_
H
r,J+1
1
j=J+1
H
0, j
_
; . . . ; H
1,1
; . . . ; H
r,1
]
t
.
(1.22)
In (1.21) and (1.22), H
,1
= H
, = 0, 1, . . . , r and thus A
1
=H
1
.
As we will see that both A
J
and A are the operators dened on
2
(Z) into the tensor product space
r,[J[
=0, j=1
, j
2
(Z) and
r,
=0, j=1
, j
2
(Z)
respectively, with
, j
2
(Z) =
2
(Z). The reconstruction operators
A
J
= [
_ J
j=1
H
0, j
_
;
_ J+1
j=1
H
0, j
H
1,J
_
; . . . ;
_ J+1
j=1
H
0, j
H
r,J
_
; . . . ; H
1,1
; . . . ; H
r,1
] (1.23)
and
A
= [. . . ;
_ J1
j=1
H
0, j
H
1,J1
_
; . . . ;
_ J1
j=1
H
0, j
H
r,J1
_
;
_ J
j=1
H
0, j
H
1,J
_
; . . . ;
_ J
j=1
H
0, j
H
r,J
_
;
_
J+1
j=1
H
0, j
H
1,J+1
_
; . . . ;
_
J+1
j=1
H
0, j
H
r,J+1
_
; . . . ; H
1,1
; . . . ; H
r,1
]
(1.24)
are the adjoint operators of A
J
and A respectively.
The operators A
J
and A are closely related to P
0
and Q
q
0
. By Lemma 1.1 we have the identity
P
0
f = P
q
J
f +
r
=1
1
j=J
kZ
f ,
q
, j,k
), J < 0.
The corresponding coefcients on the right hand side is A
J
v
0,0
with v
0,0
= f ,
0,k
). Similarly, the coef-
cients on the right hand side of the identity used in analysis
P
0
f = Q
q
0
f
can be obtained by Av
0,0
. Furthermore, the next proposition shows that the decomposition and reconstruction
process is perfect, i.e. A
J
A
J
= I and A
J
A
J
= I and A
2
(Z)
. To simplify our notation, we use | | := | |
2
(Z)
.
Our approach starts with the renable function generated by the low pass lter h
0
. There are many suf-
cient conditions on the low pass lter h
0
with
h
0
(0) = 1 under which is in L
2
(R). Here we assume that h
0
satises the following condition
[
h
0
()[
2
+[
h
0
( +)[
2
1, a.e. R. (2.2)
As we will show in Appendix A, the corresponding renable function is in L
2
(R) under assumption (2.2).
We further remark that this is not a strong assumption. For example, all renement masks of B-splines, the
renable functions whose shifts form an orthonormal system derived in [17], the base functions of interpola-
tory functions, and more general, pseudo-splines introduced by [19] and [21] satisfy this assumption. In fact,
many low pass lters used in practical problems satisfy (2.2). For example, the low pass lters used in high
resolution image reconstructions satisfy (2.2). Furthermore, with this assumption, we can construct a tight
frame system via unitary extension principle of [31] which is used in our algorithm.
To make our ideas work here, the crucial step is to construct a tight frame or a bi-frame system via a
multiresolution analysis with underlying renement mask being the given low pass lter. The assumption
(2.2) is a necessary and sufcient condition to construct a tight frame system associated with the given low
pass lter. However, assumption (2.2) is not crucial for our idea to work. For example, when the underlying
renable function is in L
2
(R), whose renement mask is the given low pass lter in (2.1), together with
some additional minor conditions, we can always obtain a bi-frame system via the mixed unitary extension
principle of [32] and more generally the mixed oblique extension principle of [15] and [19]. For example, let
h
0
(z) :=
kZ
h
0
[k]z
k
.
Then (2.2) can be replaced by the condition that h
0
(z) and h
0
(z) have no common zeros in complex domain.
With this, one can construct a bi-frame system by using the mixed unitary extension principle. This is essen-
tially the approach taken by [6]. Our analysis can be carried out for this case with some efforts. To simplify
our discussion here, we only use the tight frame system, hence assume (2.2).
Finally, since our approach is based on denoising schemes with threshold of framelet coefcients, we
implicitly assume that the underlying function of the data set has a sparse representation in the tight frame
system and the errors are relatively small and spread out uniformly in the frame transform domain.
2.1 Formulation in MRA
This section is to formulate the problem of solving
h
0
v = b+ = c (2.3)
via the multiresolution analysis framework. As we will see, the approach here reduces solving equation (2.3)
to the problem of lling the missing framelet coefcients. This approach was rst taken by [6], however, we
give a complete analysis and formulation here.
As we mentioned before, by using
P
J
f = D
J
P
0
D
J
f and P
q,J
J1
f = D
J
P
q
1
D
J
f ,
we may assume that data set is given on Z (i.e. J = 0) without loss of generality. In fact, when the data set is
given on 2
J
Z, we consider function f (2
J
) instead of f . The approximation power of a function f in space
V
J
is the same as that of the function f (2
J
) in space V
0
.
10 Anwei Chai, Zuowei Shen
Let L
2
(R) be the renable function with renement mask h
0
and h
1
, . . . , h
r
be high pass lters ob-
tained via the UEP which are the framelet masks of
1
, . . . ,
r
. First we suppose that the given data set
contains no error, i.e. = 0. The convolution equation h
0
v = b implies that b is obtained by passing the
original sequence v through a low pass lter h
0
. Assume that b = S,
q
1,k
), where S L
2
(R) is the under-
lying function from which the data set b is obtained. Then we are given
P
q
1
S =
kZ
S,
q
1,k
)
q
1,k
=
kZ
b[k]
q
1,k
. (2.4)
Let v
S
= S,
0,k
), then
P
0
S =
kZ
S,
0,k
)
0,k
=
kZ
v
S
[k]
0,k
. (2.5)
Applying the framelet decomposition algorithm (1.13), one obtains that h
0
v
S
= b. This implies that solving
equation (2.3) is equivalent to reconstructing the quasi-interpolation P
0
S V
0
from the quasi-interpolation
P
q
1
S V
q
1
. Since
P
0
S = P
q
1
S+
r
=1
kZ
S,
q
,1,k
)
q
,1,k
,
to recover v
S
= S,
0,k
) from given b, we need the framelet coefcients S,
q
,1,k
). This leads to an
algorithm that restores v
S
from data b iteratively by updating the framelet coefcients S,
q
,1,k
) in each
iteration. All these have been given in [6] and consequent papers [8, 9] in their reconstructions of high resolu-
tion images. In fact, it motivates the algorithms developed in [6, 8, 9].
By this approach, we not only give a solution of (2.3), but also give an interpretation in terms of the
underlying function S where we view the data b = S,
q
1,k
) as the given sample of S. Under this setting,
we are given P
q
1
S V
q
1
, and the solution of (2.3) leads to P
0
S V
0
, which is a higher resolution subspace
in the multiresolution analysis. Although there are more than one function whose quasi-interpolations are
P
q
1
S and P
0
S given as (2.4) and (2.5), we never get the underlying function S. One can only expect to obtain
a better approximation P
0
S of S from P
q
1
S. The approximation power of P
0
S and P
q
1
S and their difference
can be established for smooth functions by applying the corresponding results in [19] which depend on the
properties of the underlying renable function; more general for piecewise smooth functions, it can be studied
by applying results and ideas from [3] and [4] which depend on the properties of the framelets. We omit the
detailed discussion here.
Roughly speaking, the idea of solving equation (2.3) here can be understood as for a given coarse level ap-
proximation P
q
1
S to nd a ner level approximation P
0
S is reduced to nding the coefcients v
S
=S,
0,k
).
The derivation of v
S
is an iterative process which recovers P
0
S from P
q
1
S as discussed before and detailed
in the algorithms given in the next section. Then h
0
v
S
= b by the decomposition algorithm (1.13) and we
conclude that v
S
is a solution of (2.3).
However, the given data may contain errors, i.e. instead of b, the data set is given in the form of c = b+.
Furthermore, the given data set b may not be necessary of the form of S,
q
1,k
), for some S L
2
(R). In
both cases, the exact
2
(Z) solution of h
0
v = c may not exist or it may not be desirable or possible to get
the exact solution.
Nevertheless, there is a need to have
s =
r
=1
j<0,kZ
s
, j,k
q
, j,k
V
0
to approximate the underlying function where the sample data set c comes from. Let
s = s
, j,k
and s =A
s, (2.6)
where A
is the reconstruction operator given in (1.24). For the vector s being a candidate of the solution of
(2.3), it requires h
0
s within the ball of c and the function s has some smoothness. The smoothness of the
function is reected by the decay of the framelet coefcients which is measured by the
p
norm of s. Given
any sequence v determined by three indices (, j, k) with = 1, . . . , r, j < 0 and k Z, we say v is in space
p
, for a given p, if
r
=1
j<0,kZ
[ v
, j,k
[
p
<.
Deconvolution: A Wavelet Frame Approach 11
Assuming that there exists a function S such that s
, j,k
= S,
q
, j,k
), then function s = Q
q
0
S. For a given p,
1 p 2, we say that the pair (s, s) dened in (2.6) is the solution of (2.3) (and s is an approximation of the
underlying function of the data set) if for an arbitrary g L
2
(R), the pair (g, g) where g = g,
q
, j,k
) and
g =A
=1
j<0,kZ
j
[g,
q
, j,k
)[
p
|h
0
s c|
2
+
r
=1
j<0,kZ
j
[S,
q
, j,k
)[
p
. (2.7)
Here
j
/
, j Z, where 0 <
/
, are parameters which will be determined by the error level.
The function s is considered as an approximation of the underlying function whose sample is given by c.
The rst term measures the residue of the solution s and the given data set c. The second term is a penalization
term using a weighted (with weights
j
)
p
-norm of the coefcients of framelets. Since the framelet coef-
cients are closely related to the smoothness of the underlying function (see [3, 4]), minimization problem (2.7)
balances the tness of the solution and the smoothness of the solution function s.
The minimization condition (2.7) can be stated as following: for a xed p, 1 p 2, the pair (s, s) dened
in (2.6) is a solution of (2.3) (the function s is an approximation of the underlying function of the data) if for
an arbitrary L
2
(R) satisfying ,
q
, j,k
) =
, j,k
=
p
, the pair (, ), where =A
, satises
the following inequality
|h
0
(s +) c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
+
, j,k
[
p
|h
0
s c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
[
p
. (2.8)
However, as we will see that the sequence s is uniquely determined by algorithms, it may not be of the
form S,
q
, j,k
) for any S L
2
(R), since
q
, j,k
j<0
is redundant which implies that the representation of s
is not unique. Nevertheless, the pair (s, s) can still be considered as a solution of equation (2.3) if (2.8) holds
for an arbitrary pair (, ), where =
, j,k
= ,
q
, j,k
)
p
, and =A
with L
2
(R). Here, we
note that since = ,
q
, j,k
), =A
s, is the solution
of (2.3) if for an arbitrary pair (, ) satisfying =A
p
, the following inequality holds:
|h
0
(s +) c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
+
, j,k
[
p
|h
0
s c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
[
p
. (2.9)
We note that (2.8) and (2.9) look similar, but they are derived in different settings. For example, sequences
in (2.8) are derived from the analysis sequences of functions under the given wavelet frame system, while
sequences (2.9) are obtained in a purely discrete sense via lters of the given wavelet frame system. We
should also remark here the condition s =A
, since A
= A
A = by A
A = I . However,
the condition s = A
= I does not hold for any redundant system. The reasons for imposing the different
conditions are due to that (s, s) is obtained by the algorithm which only satises s =A
0
H
0
+
r
=1
H
= I . (2.10)
12 Anwei Chai, Zuowei Shen
Let v
n
be the solution for the n
th
iteration, then
H
0
H
0
v
n
+
r
=1
H
v
n
= v
n
. (2.11)
First, we consider the case that b = S,
q
1,k
), where S is the underlying function and b is the given data
as a set of the samples of S, and = 0. Then by h
0
v
S
= b with v
S
= S,
0,k
), we have v
S
is a solution
to equation (2.3). In each iteration, we can replace H
0
v
n
by the known data b to improve the approximation.
This can be also viewed as that we use the framelet coefcients of the n
th
iteration to approximate the framelet
coefcients of the underlying function S. We summarize the algorithm as follows:
Algorithm 2.1
(i) Choose an initial approximation v
0
(e.g. v
0
= b);
(ii) Iterate on n until convergence:
v
n+1
= H
0
b+
r
=1
H
v
n
. (2.12)
As we will see in the next section, Algorithm 2.1 converges, but it converges slowly. We need to adjust the
iteration in Algorithm 2.1 to quicken the convergence, which motivates us to introduce the acceleration factor
0 < < 1 into the above algorithm. The new iteration with is given below:
v
n+1
= (H
0
c +
r
=1
H
v
n
) = H
0
c +
r
=1
H
v
n
. (2.13)
This scheme can be viewed as the traditional regularization method used in noise removal, the solution of
which satises the matrix equation
_
H
0
H
0
+(1)
r
=1
H
_
v = H
0
c.
Here is a regularization parameter. The solution of the original convolution equation (2.3) is v = v
/ with
v
the solution to the above matrix equation. The solution v minimizes the following functional:
|H
0
vc|
2
+
1
|v|
2
.
This is the standard regularization form with a special regularization operator, which was more or less the [6,
Algorithm 2] given to us. The parameter has to be carefully chosen to balance the error and smoothness of
the solution. It plays a role in both convergence acceleration and error removal. However, when a different
penalty functional instead of
2
norm of the solution (e.g. the one given in the formulation), which is desirable
in many applications, is used, we need a different approach. In our new algorithms, the acceleration factor is
mainly used to accelerate the convergence and leave the regularization part to a threshold process. Finally,
we remark that, as will see in 4, in the numerical implementation, when proper boundary conditions (e.g.
periodic boundary condition with some modications) are used, the matrix H
0
becomes a nonsingular nite
order matrix. The iteration in Algorithm 2.1 converges with rate 1 , where is the minimum eigenvalue
of H
0
H
0
. Hence, we do not need to introduce the acceleration factor .
Next, we introduce the following denoising operators to the iteration (2.13).
Denoising Operator When data are contaminated with errors, we need to remove the errors from each iter-
ation before putting it into the next iteration. The denoising scheme is needed to prevent the limit of iteration
(2.13) from following the noise residing in c. For any vector v and given p, 1 p 2, let threshold operator
be
D
p
(v) :=
_
t
p
(v[0]), t
p
(v[1]), . . .
t
, (2.14)
where t
p
(x) := t
1
(x) := x +
p
2
sgn(x)[x[
p1
. (2.15)
Deconvolution: A Wavelet Frame Approach 13
Function F
p
(x) is a one-to-one differentiable function with unique inverse. For 1 < p 2, the explicit formula
of the inverse of function F
p
is not always available. Numerical method may be needed to calculate the value
of t
p
(x) := (F
p
)
1
(x). As we will see, the difference of the threshold operators D
p
according to different p is
that the limit of the algorithm has different minimization properties.
When a signal v is given, the normal procedure is rst transforming v to the framelet domain via the
decomposition operator A to decorrelate the signal, and then applying the threshold operator D
p
j
with the
threshold parameter
j
depending on the decomposition level j. For a given sequence v
2
(Z), the denoising
operator T
p
which applies the threshold operator D
p
j
on Av with the threshold parameters
j
is dened
as:
T
p
A(v) = [D
p
j
(H
, j
1
j
/
=j+1
H
0, j
/ v)]
t
, j
, (2.16)
where 1 p 2, = 1, 2, . . . , r, j < 0. This noise removal scheme will then be applied at each iteration
before applying the next iteration in Algorithm 2.1.
Algorithm 2.2 is given in [9] which was motivated by [18]. At the n
th
step, the threshold operator is applied
to the framelet decomposition of H
0
c +
r
=1
H
v
n
. The parameters
j
are xed during the iteration.
Algorithm 2.2
(i) Choose an initial approximation v
0
(e.g. v
0
= c);
(ii) Iterate on n until convergence:
v
n+1
=A
T
p
A(H
0
c +
r
=1
H
v
n
); (2.17)
(iii) Suppose the limit of step (ii) is v
= v
/.
We will prove that the pair (s
, s
) where s
=
1
T
p
A(H
0
c +
r
=1
H
0
c +
r
=1
H
(A
T
p
A)(H
v
n
); (2.18)
(iii) Let v
be the nal iterative solution from (ii). Then the solution to the algorithm is
s
= v
/.
For better denoising effect, we may apply the denoising scheme to the nal result s
T
p
A(s
)
to further remove the error effect arose by c, which is used in [6, 8, 9].
3 Analysis of Algorithms
This section focuses on the analysis of the algorithms given in 2.2. We rst show that all algorithms converge.
Secondly, we prove that the solutions of Algorithm 2.2 and 2.3 satisfy some minimization property.
14 Anwei Chai, Zuowei Shen
3.1 Convergence
In this section, we will show the convergence of Algorithm 2.1, 2.2 and 2.3. The proof of the convergence of
Algorithm 2.1 was given in [6] and [9]. We include the proof here for the sake of the self completeness of the
paper. However, the convergence of Algorithm 2.2 and 2.3 is new. This is important, since both algorithms
are the ones used in practice.
Proposition 3.1 Let h
1
, . . . , h
r
be the high pass lters of a tight frame system derived by the UEP with nitely
supported h
0
being the given low pass lter which satises (2.2). Suppose there exists a function S such that
c =S,
q
1,k
). Then for arbitrary v
0
2
(Z), the sequence v
n
dened by (2.12) converges to v =S,
q
0,k
).
Especially, h
0
v = c.
Proof The proof was given in [6]. Writing (2.12) in frequency domain, one obtains
v
n+1
=
h
0
c +
r
=1
v
n
.
Let v = S,
0,k
). Since c = S,
q
1,k
), v is the solution to (2.3). Using the UEP condition, we have
v =
h
0
c +
r
=1
v.
For arbitrary v
0
2
(Z), applying the iteration n times, we have
v
n
v =
_
r
=1
_
n
(v
0
v).
From (2.2), we have 0 [
h
0
()[ 1 a.e. R and [
h
0
()[ = 0 only holds on a zero measure set since
h
0
() is a polynomial the zero points of which are nite. Because h
1
, . . . , h
r
satisfy (1.6), it follows that
r
=1
[
()[
2
1, a.e. R
and the equality only holds on a zero measure set. Thus we have [v
n
v[ [v
0
v[ and v
n
v 0 a.e. R
as n . Then by Dominated Convergence Theorem, |v
n
v|
2
(Z)
=
1
2
|v
n
v|
L
2
[,]
0, i.e. v
n
converges to v as n . .
Since [
r
=1
h
[ = 1 at , the convergence of the algorithm is slow. That is the reason why we introduce
the acceleration factor into iteration. The convergence of iteration (2.13) can be proved similarly. Next we
show the convergence of the iterations in Algorithm 2.2 and Algorithm 2.3. The following lemma is needed,
the proof of which is given in [18, Lemma 2.2].
Proposition 3.2 The denoising operator D
p
(v
1
) D
p
(v
2
)| |v
1
v
2
|.
Furthermore, let T
p
be the denoising operator dened by (2.16), it also satises that
|T
p
A(v
1
) T
p
A(v
2
)| |v
1
v
2
|.
In particular, T
p
A is continuous and |T
p
Av
1
| |v
1
|.
Now we are ready to show the convergence of Algorithm 2.2.
Theorem 3.1 Let h
1
, . . . , h
r
be the high pass lters of a tight frame system derived by the UEP with h
0
being the given low pass lter which satises (2.2). Then the sequence v
n
dened by (2.17) in Algorithm 2.2
converges for arbitrary initial seed v
0
2
(Z) to v
which satises
v
=A
T
p
A(H
0
c +
r
=1
H
). (3.1)
Deconvolution: A Wavelet Frame Approach 15
Proof The idea of the proof is to show that the sequence v
n
is a Cauchy sequence. We rst note that
|A
| 1. Let
v
n
=A
T
p
A(H
0
c +
r
=1
H
v
n1
)
and for m > 0
v
n+m
=A
T
p
A(H
0
c +
r
=1
H
v
n+m1
).
For convenience, denote u = H
0
c +
r
=1
H
v
n1
and u
/
= H
0
c +
r
=1
H
v
n+m1
. Then using
Proposition 3.2 we have:
|v
n+m
v
n
| = |A
(T
p
Au
/
T
p
Au)| |T
p
Au
/
T
p
Au| |u
/
u| |v
n+m1
v
n1
|.
Inductively, we nally obtain that
|v
n+m
v
n
|
n
|v
m
v
0
|. (3.2)
Then sequence v
n
is a Cauchy sequence if v
n
is bounded. Since 0 < < 1, indeed due to Proposition 3.2
we have
|v
n
| = |A
T
p
Au| |T
p
Au| |u| |c|+|v
n1
|
1
|c|+|v
0
|. (3.3)
Hence the limit of the iteration (2.17) exists. The limit v
satisfying v
=A
T
p
A(H
0
c+
r
=1
H
)
follows the continuity of T
p
A. .
Here we note that the limit v
=T
p
A(H
0
c+
r
=1
H
),
then the pair (v
, v
) satises v
=A
, s
) with
s
=
1
and s
=
1
T
p
A(H
0
c +
r
=1
H
)
also satises s
= A
, s
which satises
v
= H
0
c +
r
=1
H
T
p
A(H
). (3.4)
3.2 Minimization Property of Algorithm 2.2
In this section, we discuss to what extend that the solution s
=
1
and s
=
1
,
where
v
=T
p
A(H
0
c +
r
=1
H
) and v
=A
(3.5)
are obtained from the limit of iteration (2.17). First, if s
,
p
, then for any pair (, ) with = A
p
,
the values of both sides in (2.9) are innite and the inequality holds. For the case s
p
, what we will prove
is a slightly weaker result than (2.9) for the pair (s
, s
) as stated below.
16 Anwei Chai, Zuowei Shen
For the given constant C > 0 and arbitrary > 0, the pair (s
, s
+) c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
+
, j,k
[
p
|h
0
s
c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
[
p
, (3.6)
for any pair (, ) satisfying =A
p
with || C, as long as the acceleration factor is close enough
to 1.
As we will see in the next section, when certain boundary conditions are imposed in numerical implemen-
tations, the solution will satisfy (2.9).
We rst prove the following statement: for given > 0 and C > 0, the pair (v
, v
+) c|
2
+
r
=1
j<0,kZ
2p
j
[ v
, j,k
+
, j,k
[
p
+(1)
2
r
=1
|H
(v
+)|
2
|H
0
v
c|
2
+
r
=1
j<0,kZ
2p
j
[ v
, j,k
[
p
+(1)
2
r
=1
|H
|
2
,
(3.7)
whenever the pair (, ) satisfying =A
p
with || C and the acceleration factor is close enough
to 1. Note that the threshold parameters
2p
j
are smaller than those in (3.6). It is reasonable because the
use of acceleration factor helps to damp out the noise residing in c.
To show (3.7), we introduce the following functionals. For a given pair of sequences (v, v) satisfying
v =A
=1
j<0,kZ
2p
j
[ v
, j,k
[
p
+(1)
2
r
=1
|H
v|
2
(3.8)
and
(v; a) := |H
0
vc|
2
+
r
=1
j<0,kZ
2p
j
[ v
, j,k
[
p
+
r
=1
|H
(va)|
2
. (3.9)
It is clear that when a = v, we have
(v; v) = (v). Furthermore, the following result on
(v; a) holds.
Proposition 3.3 Suppose c =Ac is in
p
. Let h
1
, . . . , h
r
be the high pass lters obtained from h
0
by the UEP
and H
0
, H
1
, . . . , H
r
be the corresponding matrix counterparts of these lters as dened in (1.14). Given a pair
(a, a) satisfying a =A
a and a
p
, let
v
=T
p
A(H
0
c +
r
=1
H
a) =T
p
A(a(H
0
(c H
0
a))) (3.10)
and v
=A
, v
(v
+; a)
(v
; a) +||
2
. (3.11)
A similar proposition is proved in [18], where the underlying system used in denoising is orthonormal
basis. The proof depends on the fact A
A = AA
A = I , while AA
,= I . This adds difculties to the proof and it also leads to the conditions on the pairs
(v
, v
, v
), we need that v
j
> 0, j < 0.
This condition is natural in applications. Indeed, this assumption requires to discard the framelet coefcients
when [ j[ is sufciently large, because for a given signal, when [ j[ is large enough, the coefcients of the low
frequency subband are very small and can be discarded anyway. We rst prove the following lemma:
Deconvolution: A Wavelet Frame Approach 17
Lemma 3.1 Let h
1
, . . . , h
r
be the high pass lters of a tight frame system derived by the UEP with h
0
being
the given low pass lter. Suppose the threshold parameters > 0, then there exists a constant 0 < < 1 such
that for any sequence v
2
(Z)
|D
p
(v)| |v|,
where D
p
j
> 0, we have
|T
p
A(v)| |v|, 0 < < 1.
Proof By (2.14), we have
|D
p
(v)|
2
=
kZ
[t
p
(v[k])[
2
.
When p =1, it is the soft-threshold function: t
(v[k])
v[k]
1
2[v[k][
1
2|v|
.
Since v
2
(Z), we have = sup
kZ
(v[k])
v[k]
1
2|v|
< 1.
Next, when 1 < p 2, by (2.15), we have t
p
(x) = (F
p
)
1
(x) where F
p
(x) = x +
p
2
sgn(x)[x[
p1
. Given
v[k] for a xed k Z, if (F
p
)
1
(v[k]) ,= 0, then let y = (F
p
)
1
(v[k]) and we have
(F
p
)
1
(v[k])
v[k]
y
y +
p
2
sgn(y)[y[
p1
=
1
1+
p
2
[y[
p2
1
1+
p
2
|v|
p2
< 1.
When (F
p
)
1
(v[k]) = 0, it is clear that
[(F
p
)
1
(v[k])[
1
1+
p
2
|v|
p2
[v[k][.
Thus when 1 < p 2, we take
= sup
kZ
(F
p
j
)
1
(v[k])
v[k]
1
1+
p
2
|v|
p2
< 1.
Thus threshold operator D
p
satises
|D
p
(v)| |v|, 1 p 2.
For the denoising operator, by (2.16),
|T
p
A(v)|
2
=
r
=1
1
j=
|D
p
j
(H
, j
1
j
/
=j+1
H
0, j
/ v)|
2
,
then for each sequence H
, j
1
j
/
=j+1
H
0, j
/ v, there exists
, j
such that
|D
p
j
(H
, j
1
j
/
=j+1
H
0, j
/ v)|
, j
|H
, j
1
j
/
=j+1
H
0, j
/ v|.
Since |H
, j
1
j
/
=j+1
H
0, j
/ v| |v| and inf
j
j
> 0, we can take
= sup
, j
, j
_
1
2|v|
< 1, when p = 1;
1
1+
p
2
|v|
p2
< 1, when 1 < p 2.
18 Anwei Chai, Zuowei Shen
Hence,
|T
p
A(v)|
2
=1
1
j=
2
, j
|H
, j
1
j
/
=j+1
H
0, j
/ v|
2
2
|v|
2
,
which completes our proof. .
Note that since D
p
be the limit of iteration (2.17) for 0 < < 1. Assume that the threshold
parameters
j
, j < 0 are independent of iteration and and inf
j
j
> 0. Then there exists C > 0, such
that |v
n
be the sequence obtained by
iteration (2.17) in Algorithm 2.2. Applying Lemma 3.1 and the argument used in (3.3) lead to
|v
n
| (|c|+|v
n1
|)
1
|c|+|v
0
|.
Let C =
1
|c|+|v
0
|, then |v
n
| C. Hence, the limit v
to v
n
also satises that |v
| C.
With this, a consequence of Proposition 3.3 is the minimization property of v
.
Proposition 3.5 Suppose c = Ac
p
. For given > 0 and C > sup
|v
, v
) of iteration (2.17) in
Algorithm 2.2 satises the inequality (3.7) for an arbitrary pair (, ) with =A
p
and || C.
Proof Note inequality (3.7) for an arbitrary is equivalent to
(v
+) (v
)
for all (, ), satisfying =A
p
and || C.
Applying Proposition 3.3 by letting a = v
, we have inequality
(v
+; v
)
(v
; v
) +||
2
(3.12)
for any pair (, ) satisfying = A. Since limit v
satises v
= A
T
p
A(H
0
c +
r
=1
H
) by
Theorem 3.1 and since v
satises v
=A
T
p
A(H
0
c +
r
=1
H
= v
. Hence,
(v
; v
) =
(v
; v
) = (v
).
By the denition of
(v; a), one obtains that
(v
+; v
) = |H
0
(v
+) c|
2
+
r
=1
j<0,kZ
2p
j
[ v
, j,k
+
, j,k
[
p
+
r
=1
|(1)H
(v
+) +H
|
2
.
Since
|(1)H
(v
+) +H
|
2
(1)
2
|H
(v
+)|
2
+
2
|H
|
2
+2(1)|H
||H
(v
+)|,
Deconvolution: A Wavelet Frame Approach 19
this leads to
(v
+; v
) |H
0
(v
+) c|
2
+
r
=1
j<0,kZ
2p
j
[ v
, j,k
+
, j,k
[
p
+(1)
2
r
=1
|H
(v
+)|
2
+
2
r
=1
|H
|
2
+2(1)
r
=1
|H
||H
(v
+)|.
Note that
(v
+) = |H
0
(v
+) c|
2
+
r
=1
j<0,kZ
2p
j
[ v
, j,k
+
, j,k
[
p
+(1)
2
r
=1
|H
(v
+)|
2
.
So we have
(v
+) +
2
r
=1
|H
|
2
+2(1)
r
=1
|H
||H
(v
+)|
(v
+; v
) (v
) +||
2
.
This leads to the following equality
(v
+) (v
) +||
2
2
r
=1
|H
|
2
2(1)
r
=1
|H
||H
(v
+)|.
Using
||
2
= |H
0
|
2
+
r
=1
|H
|
2
,
one obtains
||
2
2
r
=1
|H
|
2
2(1)
r
=1
|H
||H
(v
+)|
= |H
0
|
2
+(1)
r
=1
|H
|
_
(1+)|H
|2|H
(v
+)|
_
|H
0
|
2
+2(1)
r
=1
|H
|
_
|H
||H
||H
|
_
= |H
0
|
2
2(1)
r
=1
|H
||H
|.
This leads to
(v
+) (v
) +|H
0
|
2
2(1)
r
=1
|H
||H
|. (3.13)
Because v
||H
| is
bounded by rC
2
. So given arbitrary > 0, we can take
2rC
2
and then for any (1, 1),
(v
+) (v
) +|H
0
|
2
(v
) , (3.14)
which completes the proof. .
Based on the minimization of v
|v
, s
, v
)
satises
(v
+
1
) (v
)
8
, (3.15)
as long as (
1
,
1
) satises
1
= A
1
p
and |
1
| C. From Algorithm 2.2, we have s
=
v
and
s
=
v
. Dividing
2
on both sides of (3.15), we have
|H
0
(s
+) c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
+
(
1
)
, j,k
[
p
+
(1)
2
2
r
=1
|H
(v
+
1
)|
2
|H
0
s
c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
[
p
+
(1)
2
2
r
=1
|H
|
2
8
2
.
(3.16)
Because v
and
1
are bounded, for any given > 0, we can take
2
1
5C
_
r
and then any (1
2
, 1) satises (1 )
2
r
=1
(|H
(v
+
1
)|
2
|H
|
2
) <
8
. Taking = min(
1
,
2
,
1
2
) and combining
with (3.16), the pair (s
, s
+) c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
+
, j,k
[
p
|H
0
s
c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
[
p
4
2
|H
0
s
c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
[
p
,
as long as the pair (, ) satises =A
p
and || C. .
Remark 3.1 We note that since in each iteration solution pair (v
n
, v
n
) satisfy v
n
2
(Z) and v
n
2
, and
(v
, v
2
(Z) and v
2
, and furthermore s
2
(Z) and
s
2
. The minimization property (3.6) holds with nite value on both sides whenever p = 2. For 1 p < 2,
as we have already proved that when s
p
, the solution satises the minimization inequality (3.6). In fact,
the values on the both sides of inequality (3.6) are nite.
In the proof of Theorem 3.3 (See (3.16)), when is chosen to be small (say smaller than 1/2) instead of
closing to 1, we have
|H
0
(s
+) c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
+
, j,k
[
p
+
r
=1
|H
(s
+)|
2
|H
0
s
c|
2
+
r
=1
j<0,kZ
j
[ s
, j,k
[
p
+
r
=1
|H
|
2
.
In this case, in addition to penalize the functional in (2.9) we also penalize
r
=1
|H
|
2
, (3.17)
the high frequency information of the solution. However, as we discussed in the formulation, since the de-
convolution process is essentially to recover the term
r
=1
H
0
H
0
nonsingular and hence the iteration (2.17) will converges without the acceleration factor . In
such a case, we can directly prove inequality (2.9).
3.3 Minimization Property of Algorithm 2.3
In this section, we discuss the minimization property of the solution s
which satises
v
= H
0
c +
r
=1
H
T
p
A(H
). (3.18)
Dene
v
=T
p
A(H
) and v
=A
, = 1, . . . , r. (3.19)
If we further denote c by v
0
, the limit of iteration (2.18) satises v
=A
1
v
r
=0
where A
1
is given by
(1.23). We denote the quantities that determine the limit v
by the (r +1)-tuple be (v
, v
1
, . . . , v
r
).
The solution of Algorithm 2.3 is given by another (r +1)-tuple (s
, s
1
, . . . , s
r
) with
s
=
v
and s
=A
, = 1, . . . , r. (3.20)
Since v
=A
1
s
r
=0
= H
0
c +
r
=1
H
, (3.21)
where s
0
:= c and v
, v
, = 1, . . . , r, by ( s
/
, j,k
where
/
= 1, . . . , r, j < 0 and k Z.
The solution of Algorithm 2.3 has different minimization property from the solution of Algorithm 2.2.
Given any > 0 and C > 0, the (r +1)-tuple (s
, s
1
, . . . , s
r
) satises the following inequality
|h
0
(s
+) c|
2
+
r
=1
r
/
=1
j<0,kZ
j
[( s
/
, j,k
+(
/
, j,k
[
p
|h
0
s
c|
2
+
r
=1
r
/
=1
j<0,kZ
j
[( s
/
, j,k
[
p
.
(3.22)
for any (r +1)-tuple (,
1
, . . . ,
r
) satisfying || C and
=A(H
) where
1
, . . . ,
r
p
.
Note that
= A(H
) implies that H
= A
A(H
) = A
)
is the set of coefcients of framelet packet in canonical form (see [13] and [30]). From the penalty terms in
22 Anwei Chai, Zuowei Shen
(3.22), we can also see that the terms s
=A
, = 1, . . . r, which also
reect certain smoothness of the underlying functions. It is nature to penalize the
p
-norm of framelet packet
coefcients of each high frequency component H
, s
1
, . . . , s
r
). Since the proof is similar to that of Theorem 3.3, and since we will give a full proof of
this result for the nite data set, we omit it here.
Theorem 3.4 For given > 0 and C > sup
|v
, s
1
, . . . , s
r
) of iteration (2.18) in Algorithm 2.3
satises inequality (3.22) for any (r +1)-tuple (,
1
, . . . ,
r
) satisfying
= A(H
),
1
, . . . ,
r
p
and
|| C.
4 Deconvolution of Finite Data Set
In the previous sections, our algorithms and analysis are given for the innite data set which is of theoret-
ical interests, provide the understanding and insight of algorithms, and connect to multiresolution analysis.
However, in application, given data sets are always nite, e.g. a vector in R
N
0
. Thus it is necessary to adjust
our approach for these cases. This is achieved by extrapolating the data out of the boundary. The numerical
simulation shows that the algorithms work well under different boundary conditions as shown in [6, 8, 9].
4.1 Algorithms for Finite Data
In this section, we convert the algorithms given in previous sections to the ones which deal with the nite data
set. The convolution equation becomes
h
0
v = b+ = c
with the nite given data set c and ||
2
= <. Since our data are no longer innite, the boundary conditions
are needed to extend the data beyond their original domain. Basically, there are three types of boundary
conditions: zero-padding, periodic and symmetric. Since the zero-padding boundary condition simply adds
zeros out of the original domain, it is more or less reduced to the case discussed in the previous section and it
normally gives boundary artifacts, we omit discussion on this case. We focus on more detailed discussion on
periodic boundary condition and the discussion of symmetric boundary condition can be carried out similarly.
When the given data set is extended using the periodic boundary condition, i.e.
[n] = [nmodN
0
], n Z,
where N
0
is the length of data , the convolution of data with given lter h
0
then becomes a special kind
of convolution, circular convolution. We denote such circular convolution by
h
0
.
The circular convolution can also be written as a matrix-vector multiplication where the matrix is a circulant
matrix, a special kind of Toeplitz matrix, i.e. the entries of matrix H
0
generated from h
0
are
H
0
[l, k] = h
0
[(l k)modN
0
], 0 l, k < N
0
. (4.1)
Using periodic boundary condition to extend data implies that the matrices H
0
, H
1
, . . . , H
r
used in con-
volution are now circulant matrices of nite order generated from the lters h
0
, h
1
, . . . , h
r
. Further, we have
dilated lters h
0, j
, . . . , h
r, j
for the jth level decomposition, where h
, j
is obtained by inserting 2
j1
1 zeros
Deconvolution: A Wavelet Frame Approach 23
between every two entries in h
J
analog to (1.21) and (1.23) by
A
J
= [
_ 1
j=J
H
0, j
_
;
_
H
1,J
1
j=J+1
H
0, j
_
; . . . ;
_
H
r,J
1
j=J+1
H
0, j
_
; . . . ; H
1
; . . . ; H
r
]
t
(4.2)
and
A
J
= [
_ J
j=1
H
0, j
_
;
_ J+1
j=1
H
0, j
H
1,J
_
; . . . ;
_ J+1
j=1
H
0, j
H
r,J
_
; . . . ; H
1
; . . . ; H
r
]. (4.3)
Each block in A
J
and A
J
is the product of a series of circulant matrices H
, j
generated from lter h
, j
. Similar
to Proposition 1.2, it can be proved that A
J
is the adjoint of A
J
and A
J
A
J
= I, where I is the identity matrix
and this identity is due to the condition similar to (1.18).
For the nite data set, we decompose to a nite level to denoise. Hence in the iteration, operators A
J
and
A
J
are used instead of A and A
j=J
H
0, j
v); D
p
J
(H
1,J
1
j=J+1
H
0, j
v); . . . ; D
p
J
(H
r,J
1
j=J+1
H
0, j
v); . . . ; D
p
1
(H
1
v); . . . ; D
p
1
(H
r
v)]
t
,
where the threshold operator D
p
is given in (2.14). With these notations, we can convert Algorithm 2.2 and
2.3 to the nite data set. The rst one is given in [9]
Algorithm 4.1 (Algorithm 2.2 for nite data)
(i) Choose an initial approximation v
0
(e.g. v
0
= c);
(ii) Iterate on n until convergence:
v
n+1
= A
J
T
p
A
J
(H
0
c +
r
=1
H
v
n
). (4.4)
The following algorithm is the basic algorithm from which the algorithms in [10, 11] are based on.
Algorithm 4.2 (Algorithm 2.3 for nite data)
(i) Choose an initial approximation v
0
(e.g. v
0
= c);
(ii) Iterate on n until convergence:
v
n+1
= H
0
c +
r
=1
H
(A
J
T
p
A
J
)(H
v
n
). (4.5)
Since c contains noise, it was suggested by the numerical simulations in [6, 8, 9], one needs to take an addi-
tional step of denoising from the nal iteration:
(iii) v = A
J
T
p
A
J
(v
n
0
).
As we can see, the difference between Algorithm 4.1 and 4.2 is the different denoising schemes used in each
iteration. Next, we discuss the algorithm which is mainly used in numerical implementation of high resolu-
tion image reconstructions in [6, 8, 9]. The algorithm applies a different decomposition operator in denoising
scheme. The decomposition operator used is dened as:
B
J
= [H
[J[
0
; H
1
H
[J[1
0
; . . . ; H
r
H
[J[1
0
; . . . ; H
1
; . . . ; H
r
]
t
(4.6)
and the reconstruction operator is its adjoint operator
B
J
= [(H
[J[
0
)
; (H
[J[1
0
)
1
; . . . ; (H
[J[1
0
)
r
; . . . ; H
1
; . . . ; H
r
]. (4.7)
It can be easily seen that the difference between A
J
and B
J
is in the blocks. In A
J
, each block is of form
H
, j
1
j
/
=j+1
H
0, j
/ ,
24 Anwei Chai, Zuowei Shen
which is a product of matrices generated from up sampled lters h
, j
, = 0, . . . , r; while in B
J
, each block is
of form
H
H
[ j[
0
,
which is a product of matrices generated from lters h
J
B
J
= I still hold. In fact, one can prove this identity easily by modifying the proof of Proposition 1.2. The
denoising scheme is formed by applying the threshold operator T
p
to B
J
v, i.e.
[H
[J[
0
v; D
p
J
(H
1
H
[J[1
0
v); . . . ; D
p
J
(H
r
H
[J[1
0
v); . . . ; D
p
1
(H
1
v); . . . ; D
p
1
(H
r
v)]
t
.
The algorithm used in high resolution image reconstructions (see [6, 8, 9]) is essentially the same as Algo-
rithm 4.2 except the A
J
and A
J
are replaced by B
J
and B
J
, where B
J
and B
J
are dened in (4.6) and (4.7), i.e.
step (ii) in above algorithm is replaced by
Algorithm 4.3 Iterate on n until convergence:
v
n+1
= H
0
c +
r
=1
H
(B
J
T
p
B
J
)(H
v
n
). (4.8)
Here we remark that in most numerical implementation of [6, 811], the hard threshold is used; though, here
we use soft threshold. We also note that in the nite dimensional data case, the acceleration factor is removed
from the iterations in both Algorithm 2.2 and 2.3. From the proof of the convergence of Algorithm 2.2 and
2.3, to remove the acceleration factor, we need that the largest eigenvalues of the matrix
r
=1
H
=1
H
= I H
0
H
0
,
the convergence of the iteration depends on the nonsingularity of matrix H
0
H
0
. As we will prove in Propo-
sition 4.1, the matrix H
0
H
0
can be nonsingular by a proper extension of the data before imposing periodic
boundary condition.
The underlying framelet analysis for all three algorithms can be carried out by using the framelets on
intervals, e.g. periodic framelets when the periodic boundary conditions are imposed. We omit the discussion
here. On the other hand, the above algorithms can be also viewed as algorithms to solve the equation:
H
0
v = b+ = c, (4.9)
where H
0
is the matrix depends on the boundary condition imposed, e.g. H
0
is a circulant matrix generated by
h
0
when the periodic boundary conditions are imposed. Since H
0
is nonsingular in this case by Proposition 4.1,
the linear system always has a unique solution.
4.2 Convergence and Minimization Properties
In this section, we discuss the convergence of Algorithm 4.1, 4.2 and 4.3 and the minimization properties of
their limits.
The analysis is based on the nonsingularity of matrix H
0
. We consider the nite data sets with periodic
boundary condition. The eigenvalues of the circulant matrix H
0
generated from h
0
can be given out explicitly
as follows:
p
[H
0
] =
K1
k=0
h
0
[k] exp
_
i2kp
N
0
_
, p = 0, 1, , N
0
1, (4.10)
where N
0
is the length of given data and K is the length of lter h
0
. Here we assume, without loss of generality,
K < N
0
. The eigenvalues of the matrix H
0
are the values of polynomial
h
0
() at =
2p
N
0
, p = 0, . . . , N
0
1.
The matrix H
0
is nonsingular, whenever
h
0
(
2p
N
0
) is not equal to zero for each p = 0, . . . , N
0
1. Since
h
0
only
Deconvolution: A Wavelet Frame Approach 25
has nitely many zeros, we can extend the data set to increase the length of the data from N
0
to N
1
(before
making a periodical extension of the data) to avoid zero eigenvalues of H
0
. This observation is summarized
in the following result, the proof of which is given in Appendix C.
Proposition 4.1 Let h
0
be the given low pass lter with length K and the given data be of length N
0
> K.
Then the data set can always be extended to have the length N
1
> N
0
such that the corresponding circulant
matrix H
0
generated from h
0
with the data set of length N
1
is nonsingular. Consequently, the matrices H
0
and
H
0
H
0
are nonsingular.
As shown in the proof, the number N
1
can be very close to N
0
. In fact, for many cases, e.g. h
0
is a
renement mask of splines, pseudo-splines or those used in high resolution image reconstructions (e.g. see
[6, 8, 9]), whenever the length of the data is odd, the corresponding circulant matrix H
0
is nonsingular. This
means that N
1
N
0
is at most 1 for those cases. In the following, we assume that the length of data is N
1
such
that the corresponding circulant matrix is nonsingular. As we will see, the nonsingularity of H
0
ensures the
convergence of iterations without using the acceleration factor . Furthermore, the threshold parameters
j
no longer need to satisfy the additional condition inf
j
j
> 0 imposed in the last section.
The convergence of iteration (4.4) in Algorithm 4.1 and iteration (4.5) in Algorithm 4.2 can be proved
based on the nonsingularity of the circulant matrix H
0
. We give the proof for the second iteration and only list
the relevant result of the rst iteration.
Theorem 4.1 Let h
0
be the low pass lter in the convolution equation and h
1
, . . . , h
r
be the high pass lters
generated from h
0
via the UEP. The corresponding circulant matrices are H
0
, . . . , H
r
with H
0
H
0
being non-
singular (by applying Proposition 4.1). Then iteration (4.4) in Algorithm 4.1 converges for any initial seed v
0
and the limit satises
s = A
J
T
p
A
J
(H
0
c +
r
=1
H
s). (4.11)
Similarly, iteration (4.5) in Algorithm 4.2 converges for any initial seed v
0
and the limit satises
s = H
0
c +
r
=1
H
J
T
p
A
J
(H
s). (4.12)
Proof We only give the proof of the convergence of iteration (4.5). The convergence of iteration (4.4) can
be proved similarly. Because H
0
is a nonsingular circulant matrix and I = H
0
H
0
+
r
=1
H
, there exists a
constant < 1 such that |
r
=1
H
|
2
= |I H
0
H
0
|
2
. Let H = [H
1
, . . . , H
r
]
t
, then we have
|H|
2
2
= max
|u|
2
=1
|Hu|
2
2
= max
|u|
2
=1
u
=1
H
u = |
r
=1
H
|
2
.
Denote g
(v,v
/
)
= [(T
p
A
J
H
1
v T
p
A
J
H
1
v
/
), . . . , (T
p
A
J
H
r
v T
p
A
J
H
r
v
/
)]
t
for any two vectors v and v
/
.
Following the proof of Theorem 3.1, given any v
0
, for any positive integers m and n,
|v
n+m
v
n
|
2
= |
r
=1
H
J
(T
p
A
J
H
v
n+m1
T
p
A
J
H
v
n1
)|
2
= |H
J
g
(v
n+m1
,v
n1
)
|
2
|H
|
2
|g
(v
n+m1
,v
n1
)
|
2
|H
|
2
|H|
2
|v
n+m1
v
n1
|
2
|v
n+m1
v
n1
|
2
.
Similarly, one can prove by using (4.5)
|v
n
|
2
|c|
2
+|v
n1
|
2
1
1
|c|
2
+|v
0
|
2
. (4.13)
Thus the iteration sequence v
n
is a Cauchy sequence and the limit exists and satises (4.12). .
Note that solution s to Algorithm 4.1 satises (4.11). Let s = T
p
A
J
(H
0
c +
r
=1
H
J
s. As we discussed in the formulation, s can be viewed as coefcients of certain
representation of the underlying function in a frame system. The following theorem states that pair (s, s) is a
solution of (2.3) with nite data set and satises minimization property (2.9) given by formulation.
26 Anwei Chai, Zuowei Shen
Theorem 4.2 Let h
1
, . . . , h
r
be the high pass lters obtained from h
0
by the UEP and H
0
, H
1
, . . . , H
r
be the
corresponding circulant matrices of these lters. Then for xed p, 1 p 2, the solution pair (s, s) satises
|H
0
(s +) c|
2
2
+
r
=1
1
j=J
N
1
1
k=0
j
[ s
, j,k
+
, j,k
[
p
|H
0
s c|
2
2
+
r
=1
1
j=J
N
1
1
k=0
j
[ s
, j,k
[
p
,
for any pair (, ) with = A
J
, where A
J
is given in (4.2).
The counterpart of this theorem has been proved in 3.2. In fact, the current case is simpler because we do
not have the acceleration factor in the iteration. Instead of giving the proof of this, we prove Theorem 4.3
which states the minimization property of the limit to iteration (4.5). Let
s
=T
p
A
J
(H
s), = 1, . . . , r.
Since solution s of iteration (4.5) satises (4.12), we have the (r +1)-tuple (s, s
1
, . . . , s
r
) satises s = H
0
c +
r
=1
H
J
s
. We will prove this tuple of nite data sets satises a similar property to its innite counterpart
and hence s is a solution to (2.3).
Theorem 4.3 Let h
1
, . . . , h
r
be the high pass lters obtained from h
0
by the UEP and H
0
, H
1
, . . . , H
r
be the
corresponding circulant matrices of these lters. Then for xed p, 1 p 2, the (r +1)-tuple (s, s
1
, . . . , s
r
)
satises the following inequality
|H
0
(s +) c|
2
2
+
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
[( s
/
, j,k
+(
/
, j,k
[
p
|H
0
s c|
2
2
+
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
[( s
/
, j,k
[
p
,
(4.14)
for any (r +1)-tuple (,
1
, . . . ,
r
) satisfying
= A
J
(H
) for = 1, . . . , r, where A
J
is given in (4.2).
Proof We prove this theorem by proving a more general inequality. For a given sequence a, we dene v
=
T
p
A
J
(H
a) and v
= A
J
( v
0
and dene v
by
v
= A
1
v
r
=0
= H
0
c +
r
=1
H
J
T
p
A
J
(H
a).
With this set up, we show that the inequality
|H
0
(v
+) c|
2
2
+
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
[( v
/
, j,k
+(
/
, j,k
[
p
+
r
=1
|H
(v
+) H
a|
2
2
|H
0
v
c|
2
2
+
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
[( v
/
, j,k
[
p
+
r
=1
|H
a|
2
2
+||
2
2
,
(4.15)
holds for any (r+1)-tuple (,
1
, . . . ,
r
) with
=A
J
(H
= H
0
c +
r
=1
H
J
T
p
A
J
(H
s) = s, and inequality
(4.14) can be easily deduced from (4.15). In the following we give the proof of (4.15), which is similar to that
of Proposition 3.3.
Given the (r +1)-tuple (,
1
, . . . ,
r
), we expand the left hand side of (4.15) as follows:
|H
0
(v
+) c|
2
2
+
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
[( v
/
, j,k
+(
/
, j,k
[
p
+
r
=1
|H
(v
+) H
a|
2
2
= |H
0
v
c|
2
2
+
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
[( v
/
, j,k
[
p
+
r
=1
|H
a|
2
2
+||
2
2
+
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
_
[( v
/
, j,k
+(
/
, j,k
[
p
[( v
/
, j,k
[
p
_
+2, v
0
c
r
=1
H
a).
(4.16)
Deconvolution: A Wavelet Frame Approach 27
Compare with (4.15), we only need to show
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
_
[( v
/
, j,k
+(
/
, j,k
[
p
[( v
/
, j,k
[
p
_
+2, v
0
c
r
=1
H
a) 0. (4.17)
Denote a
= A
J
(H
and A
J
A
J
= I, we can simplify the inner product
in (4.16) as
, v
0
c
r
=1
H
a) = ,
r
=1
H
(A
J
T
p
A
J
(H
a) H
a))
=
r
=1
A
J
(H
), T
p
A
J
(H
a) A
J
(H
a))
=
r
=1
r
/
=1
1
j=J
N
1
1
k=0
(
/
, j,k
(( v
/
, j,k
( a
/
, j,k
).
(4.18)
The last equality holds because the denoising operator T
p
does not apply to the low frequency component of
the vector. With this, (4.17) becomes
r
=1
r
/
=1
1
j=J
N
1
1
k=0
j
_
[( v
/
, j,k
+(
/
, j,k
[
p
[( v
/
, j,k
[
p
_
+2
r
=1
r
/
=1
1
j=J
N
1
1
k=0
(
/
, j,k
(( v
/
, j,k
( a
/
, j,k
) 0,
which will be proven by showing each summand is nonnegative, i.e.
j
_
[( v
/
, j,k
+(
/
, j,k
[
p
[( v
/
, j,k
[
p
_
+2(
/
, j,k
(( v
/
, j,k
( a
/
, j,k
) 0.
The rest of the proof follows from the exact the same discussion in the proof of Proposition 3.3. .
In the proof of Theorem 4.3, we do not need to consider the convergence of the sum (4.15) since unlike
the proof of Proposition 3.3, there are only nite terms in the sum.
Remark 4.1 Roughly speaking, Algorithm 4.3 is obtained by replacing decomposition and reconstruction
operators A
J
and A
J
by B
J
and B
J
in the iteration (4.5). The convergence of iteration (4.8) and the minimization
property of the corresponding limit can be obtained by simply changing those results for Algorithm 4.2 and
the proofs could be carried out with little effort.
Remark 4.2 When boundary conditions other than periodic ones are used, if the corresponding matrix H
0
H
0
is nonsingular, we still have the convergence of Algorithm 4.1, 4.2 and 4.3. If the matrix is singular, we can
solve it by introducing the acceleration factor and hence reduce the problem to the cases discussed in 3.
We need to choose close to 1 in order for better data recovery as discussed in 3.2. We omit the detailed
discussion here.
Remark 4.3 Finally, we remark that the algorithms can be applied to higher dimensions, especially the two
dimensional images. This is achieved by using tensor product lters and its underlying wavelets. The details
can found in [6, 8, 9].
4.3 Stability Analysis
In this section, we discuss the stability of the algorithms given in 4.1. An algorithmof solving H
0
v =b+ =c
is stable if the result of the algorithm approaches to the exact solution of the equation H
0
v = b, as ||
2
=
0. We give the stability analysis of Algorithm 4.1, and the analysis of Algorithms 4.2 and 4.3 is similar.
Let the threshold
j
= C
j
for some constant C
j
, J j < 0. Let C = max
Jj<0
C
j
and without loss of
generality, we take C = 1 below. For a given pair (v, v) with v = A
J
v, let
[ v[
p
:=
r
=1
1
j=J
N
1
1
k=0
C
j
[ v
, j,k
[
p
.
Let the pair (s
, s
) be the limit of iteration (4.4) associated with the error bound . Let be the exact solution
to linear system H
0
v = b, then |H
0
c|
2
= ||
2
= . Here, we note that the existence of follows from
the nonsingularity of H
0
.
28 Anwei Chai, Zuowei Shen
Proposition 4.2 Let s
|
2
= 0.
Proof We only need to consider the case when 1. Based on a similar proof of (4.13) in Theorem 4.1,
one can show that s
=T
p
A
J
(H
0
c+
r
=1
H
), we have | s
2
|c|
2
+|s
|
2
, i.e. its
2
norm of s
is bounded independent
of by Proposition 3.2. This leads to [ s
[
p
B with B dependent of neither nor p, 1 p 2, because s
is
a nite dimensional vector. From the boundedness of s
and s
)
with
= A
J
such that |s
|
2
< and [ s
[
p
B
/
. The pair (
) depends on ; however,
the constant B
/
can be chosen to be independent of , since bounds of s
, s
c|
2
2
|H
0
s
c|
2
2
+[ s
[
p
|H
0
(s
) c|
2
2
+[ s
[
p
2|H
0
(s
) H
0
|
2
2
+2|H
0
c|
2
2
+[ s
[
p
< 4
2
+B
/
.
Thus, |H
0
s
c|
2
<
4
2
+B
/
. Since matrix H
0
is nonsingular, the conclusion of this proposition follows
from |H
0
(s
)|
2
0 as 0. For arbitrary , we have
|H
0
(s
)|
2
|H
0
s
c|
2
+|H
0
c|
2
<
_
4
2
+B
/
+.
Then the stability holds by letting 0. .
4.4 Comparison of Algorithm 4.1, 4.2 and 4.3
As we mentioned before that the methods given here have been implemented successfully in high resolution
image reconstruction and comparison with some other numerical methods has been given (see [6, 811]). The
focus of this paper is to build up the theory of the methods used in [6, 811]. Here, we give some numerical
results for 1D signal to illustrate the different performances of Algorithm 4.1, 4.2 and 4.3. The methods are
evaluated by the relative error (RE), signal-to-noise ratio (SNR) and the peak signal-to-noise ratio (PSNR).
They are dened by
RE =
|v
n
v|
2
|v|
2
, SNR = 10log
10
|v
n
|
2
2
|v
n
v|
2
2
, and PSNR = 10log
10
N
0
max
kZ
(v
n
)
2
|v
n
v|
2
2
,
where v
n
is the iterative solution, v is the original data and N
0
stands for the length of signal.
We choose the piecewise smooth signals in the WaveLab Toolbox developed by Donohos research group.
Each signal rst passes the low pass lter from the cubic spline, then it is contaminated by white noise at
SNR = 25. Applying Algorithm 4.1, 4.2 and 4.3 to recover the original signal where the soft threshold is
used. The periodic boundary conditions with proper pre-extension of the data to ensure the convergence of
the iterations (4.4), (4.5) and (4.8) are used. The numerical results after 12 iterations with periodic boundary
conditions are listed in the following table.
Type of Algorithm 4.1 Algorithm 4.2 Algorithm 4.3
Signal Rel. Err. PSNR SNR Rel. Err. PSNR SNR Rel. Err. PSNR SNR
HeaviSine 0.028460 33.217871 30.915279 0.028670 33.156742 30.851414 0.069625 25.455757 23.144756
Bumps 0.069254 39.577640 23.191044 0.075530 38.491668 22.437622 0.618064 17.970758 4.179333
Blocks 0.062193 30.895391 24.125186 0.067464 30.187947 23.418590 0.258222 19.087661 11.760150
Doppler 0.049995 30.580579 26.021500 0.048172 30.905828 26.344157 0.277670 15.688367 11.129433
Ramp 0.031575 32.332058 30.013089 0.039259 30.610982 28.121127 0.157946 17.146609 16.029819
Cusp 0.024914 36.003742 32.070983 0.024833 36.054969 32.099584 0.031825 33.847170 29.944680
Sing 0.089109 46.888308 21.001552 0.089637 46.860294 20.950295 0.805165 22.317274 1.882298
Piece-Polynomial 0.067718 32.882431 23.385962 0.070475 32.443317 23.039271 0.270130 20.086519 11.368543
Piece-Regular 0.048856 34.113632 26.221678 0.049799 33.855047 26.055584 0.221711 20.171143 13.084247
The results show that the performances of Algorithm 4.1 and 4.2 are close, while Algorithm 4.3 seems
keeping the smoothness of the signal better as shown in the gures for Piece-Regular, but it does not perform
as good as other two in terms of relative error, PSNR and SNR.
Deconvolution: A Wavelet Frame Approach 29
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
30
20
10
0
10
20
30
40
50
(a)
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
30
20
10
0
10
20
30
40
50
(b)
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
30
20
10
0
10
20
30
40
50
(c)
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
30
20
10
0
10
20
30
40
50
(d)
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
30
20
10
0
10
20
30
40
50
(e)
Fig. 4.1 (a) Original signal; (b) Signal blurred by lter in Example 1.2 and contaminated by white noise at SNR=25; (c) Recon-
structed signal by Algorithm 4.1; (d) Reconstructed signal by Algorithm 4.2; (e) Reconstructed signal by Algorithm 4.3.
References
1. Beylkin G., Coifman R., Rokhlin V.: Fast Wavelet Transforms and Numerical Algorithms I. Comm. Pure Appl. Math. 44,
141-183 (1991)
2. de Boor C., DeVore R., Ron A.: On the Construction of Multivariate (Pre)wavelet. Constr. Approx. 9, 123-166 (1993)
3. Borup, L., Grivonbal R., Nielsen M.: Tight Wavelet Frames in Lebesgue and Sobolev Spaces. J. Funct. Spaces Appl. 2(3),
(2004)
4. Borup, L., Grivonbal R., Nielsen M.: Bi-framelet Systems with Few Vanishing Moments Characterize Besov Spaces. Appl.
Comput. Harmon. Anal. 17, 3-28 (2004)
5. Chan R., Chan T., Shen L., Shen Z.: A Wavelet Method for High-resolution Image Reconstruction with Displacement Errors.
In: IEEE Signal Processing Society (ed.) Proceedings of the 2001 International Symposium of Intelligent Multimedia, Video
and Speech Processing, Hong Kong, 2001, pp. 24-27. IEEE, U.S. (2001)
6. Chan R., Chan T., Shen L., Shen Z.: Wavelet Algorithms for High-resolution Image Reconstruction. SIAM J. Sci. Comput.
24(4), 1408-1432 (2003)
7. Chan R., Chan T., Shen L., Shen Z.: Wavelet Deblurring Algorithms for Sparially Varying Blur from High-resolution Image
Reconstruction. Linear Algebra Appl. 366, 139-155 (2003)
8. Chan R., Riemenschneider S., Shen L., Shen Z.: Tight Frame: The Efcient Way for High-resolution Image Reconstruction.
Appl. Comput. Harmon. Anal. 17, 91-115 (2004)
9. Chan R., Riemenschneider S., Shen L., Shen Z.: High-resolution Image Reconstruction with Displacement Errors: A Frame
Approach. Int. J. Imaging Syst. Technol. 14, 91-104 (2004)
10. Chan R., Shen Z., Xia T.: Resolution Enhancement for Video Clips : Tight frame approach. Proceedings of IEEE Interna-
tional Conference on Advanced Video and Signal-Based Surveillance, Italy, 2005, pp. 406-410.
11. Chan R., Shen Z., Xia T.: A framelet algorithm for enchancing video stills. To appear in Appl. Comput. Harmon. Anal.
12. Chan T., Shen J, Zhou H.: Total variation wavelet inpainting. J. Math. Imaging Vision 25(1), 107-125 (2006)
13. Chen D.: On the Splitting Trick and Wavelet Frame Packets. SIAM J. Math. Anal. 31(4), 726-739 (2000)
14. Chui C., He W.: Compactly Supported Tight Frames Associated With Renable Functions. Appl. Comput. Harmon. Anal.
8(3), 293-319 (2000)
15. Chui C., He W., St ockler J.: Compact Supported Tight and Sibling Frames with Maximum Vanishing Moments. Appl.
Comput. Harmon. Anal. 13, 224-262 (2002)
16. Cohen A., Hoffmann M., Reiss M.: Adaptive Wavelet Galerkin Methods for Linear Inverse Probelms. SIAM J. Numer.
Anal. 42(4), 14791501 (2004)
17. Daubechies I.: Ten Lectures on Wavelets. CBMS Conference Series in Applied Mathematics 61, SIAM, Philadelphia, PA
(1992)
18. Daubechies I., Defrise M., De Mol C.: An Iterative Thresholding Algorithm for Linear Inverse Problems with a Sparsity
Constraint. Comm. Pure Appl. Math. 57, 1413-1457 (2004)
19. Daubechies I., Han B., Ron A., Shen Z.: Framelets: MRA-based Constructions of Wavelet Frames. Appl. Comput. Harmon.
Anal. 14, 1-46 (2003)
20. De Mol C., Defrise M.: A Note on Wavelet-based Inversion Algorithms. In: Nashed M., Scherzer O. (ed.) Inverse problems,
image analysis, and medical imaging. New Orleans, LA, 2001, pp. 8596. Contemp. Math. 313, Amer. Math. Soc., Providence,
RI (2002)
21. Dong B., Shen Z.: Pseudo-spline, Wavelets and Framelets. Appl. Comput. Harmon. Anal. 22 (1), 78-104 (2007)
22. Donoho D.: Nonlinear Solution of Linear Inverse Problems by Wavelet-Vaguelette Decomposition. Appl. Comput. Harmon.
Anal. 2, 101-126 (1995)
23. Donoho D., Raimondo M.: Translation Invariant Deconvolution in a Periodic Setting. Int. J. Wavelets Multiresolut. Inf.
Process. 2(4), 415-431 (2004)
24. Engl H., Hanke M., Neubauer A.: Regularization of Inverse Problems. Kluwer Academic Publishers, Dordrecht, Boston
(1996)
25. Foster M.: An Application of the Wiener-Kolmogorov Smoothing Theory to Matrix Inversion. J. SIAM9(3), 387-392 (1961)
26. Jia R., Shen Z.: Multiresolution and Wavelets. Proc. Edinburgh Math. Soc. 37, 271-300 (1994)
27. Jia R., Micchelli C.: Using the renement equations for the construction of pre-wavelets. II. Powers of two. In: Laurent P.,
Mehaute A., Schumaker L. (ed.) First International Conference on Curves and Surfaces, Chamonix-Mont-Blanc, 1990, pp.
209246. Curves and Surfaces, Academic Press, Boston, MA (1991)
28. Kalifa J., Mallat S., Roug e B.: Deconvolution by Thresholding in Mirror Wavelet Bases. IEEE Trans. Image Process. 12(4),
446457 (2003)
29. Kalifa J., Mallat S.: Thresholding Estimators for Linear Inverse Problems and Deconvolutions. Ann. Statist. 31(1), 58-109
(2003)
30. Long R., Chen W.: Wavelet Basis Packets and Wavelet Frame Packets. J. Fourier Anal. Appl. 3(3), 239-256 (1997)
30 Anwei Chai, Zuowei Shen
31. Ron A., Shen Z.: Afne Systems in L
2
(R
d
) : the Analysis of the Analysis Operator. J. Funct. Anal. 148, 408-447 (1997)
32. Ron A., Shen Z.: Afne Systems in L
2
(R
d
) II: Dual Systems. J. Fourier Anal. Appl. 3, 617-637 (1997)
33. A. N. Tikhonov: On the Solution of Incorrectly put Problems and the Regularization Method. Soviet Math. Doklady 4,
1035-1038 (1963)
34. N. Wiener: Extrapolation, Interpolation and Smoothing of Stationary Time Series. Wiley, New York (1949)
Appendix
A Proofs in 1 and 2
We give proofs of Lemma 1.1, Proposition 1.1 and Proposition 1.2.
Proof (Proof of Lemma 1.1) When j 0, we have
q
j,k
= D
j
T
k
=
j,k
and
q
j,k
= D
j
T
k
=
j,k
, which imply that
P
j
f =
kZ
f ,
j,k
)
j,k
=
kZ
f ,
q
j,k
)
q
j,k
= P
q
j
f , and
r
=1
kZ
f ,
q
, j,k
) =
r
=1
kZ
f ,
, j,k
).
Since in [19, Lemma 2.4], it has already been proved that P
j+1
f = P
j
f +
r
=1
kZ
f ,
, j,k
)
, j,k
, we have
P
q
j+1
f = P
j+1
f = P
j
f +
r
=1
kZ
f ,
, j,k
)
, j,k
= P
q
j
f +
r
=1
kZ
f ,
q
, j,k
)
q
, j,k
,
i.e. the identity (1.13) holds when j 0. Next we show (1.13) also holds for j < 0. We rst denote as
0
.
By the denitions of renable equation (1.2) and framelet (1.5), one obtains that for = 0, 1, . . . , r,
(x) = 2
kZ
h
[k](2x k).
This leads to
q
, j,k
= 2
j
T
k
(2
j
) = 2
j+1
T
k
(
k
/
Z
h
[k
/
]
0
(2
j+1
k
/
))
=
k
/
Z
h
[k
/
]2
j+1
0
(2
j+1
( k 2
j1
k
/
))
=
k
/
2
j1
Z
h
[2
j+1
k
/
]2
j+1
0
(2
j+1
( k k
/
)).
We dene the dilated sequence h
, j
by
h
, j
[k] =
_
h
[2
j+1
k], k 2
j1
Z;
0, k / 2
j1
Z.
(A.1)
The sequence h
, j
is obtained inductively by inserting 0 between every two entries in h
, j+1
with h
,1
= h
=0
kZ
f ,
q
, j,k
)
q
, j,k
=
r
=0
kZ
_
k
/
Z
h
, j
[k
/
] f ,
q
0, j+1,k
/
+k
)
__
k
//
Z
h
, j
[k
//
]
q
0, j+1,k
//
+k
_
=
k
/
Z
k
//
Z
_
r
=0
kZ
h
, j
[k]h
, j
[k +k
//
k
/
]
_
f ,
q
0, j+1,k
/
)
q
0, j+1,k
//
.
Next, we check that
r
=0
kZ
h
, j
[k]h
, j
[k +k
//
k
/
] =
0,k
/
k
// . When kk
//
2
j1
Z, there exists p Zsuch that k
/
k
//
=
2
j1
p and we have
r
=0
kZ
h
, j
[k]h
, j
[k +k
//
k
/
] =
r
=0
kZ
h
, j
[k]h
, j
[k 2
j1
p]
=
r
=0
k2
j1
Z
h
, j
[k]h
, j
[k 2
j1
p]
=
r
=0
kZ
h
[k]h
[k p] =
0,p
.
Deconvolution: A Wavelet Frame Approach 31
The last identity follows by (1.7). The sum is nonzero if and only if p = 0, which is exactly k
/
= k
//
. When k
/
k
//
, 2
j1
Z,
there exist p
1
, p
2
Z and p
2
, 2
j1
Z such that k
/
k
//
= 2
j1
p
1
+ p
2
. Then we have
r
=0
kZ
h
, j
[k]h
, j
[k +k
//
k
/
] =
r
=0
kZ
h
, j
[k]h
, j
[k 2
j1
p
1
p
2
] =
r
=0
k2
j1
Z
h
, j
[k]h
, j
[k 2
j1
p
1
p
2
].
Since k2
j1
p
1
p
2
,2
j1
Z when k 2
j1
Z, we have h
, j
[k2
j1
p
1
p
2
] =0 for any k 2
j1
Z and the last identity
is equal to 0. In conclusion, for the dilated lters h
0, j
, h
1, j
, . . . , h
r, j
, we still have a similar result as (1.7)
r
=0
kZ
h
, j
[k]h
, j
[k p] =
0,p
, p Z. (A.2)
Thus we have
r
=0
kZ
f ,
q
, j,k
)
q
, j,k
=
kZ
f ,
q
0, j+1,k
)
q
0, j+1,k
= P
q
j+1
f .
This is the identity we need to prove when j < 0. In all, identity (1.13) holds for any j Z. .
Remark A.1 We note here that in the proof of identity (1.9) for the afne system, one needs both conditions in (1.6); while in
the proof of identity (1.13), when the quasi-afne system is used, one only needs condition (1.7).
Proof (Proof of Proposition 1.1) First we consider the case j 0. In this case, since
q
j,k
= D
j
T
k
=
j,k
, we have
P
j
f =
kZ
f ,
j,k
)
j,k
=
kZ
f ,
q
j,k
)
q
j,k
= P
q
j
f .
Next, we show that Q
j
f = Q
q
j
f when j 0. Since X() is a tight frame, X
q
() is also a tight frame by [31, Theorem 5.5]. On
the other hand, j 0 implies
q
, j,k
= D
j
T
k
=
, j,k
. Thus we have
r
=1
j<0
kZ
f ,
, j,k
)
, j,k
= f
r
=1
j0
kZ
f ,
, j,k
)
, j,k
=
r
=1
j<0
kZ
f ,
q
, j,k
)
q
, j,k
.
Hence, when j 0,
Q
q
j
f =
r
=1
j
/
<0,kZ
f ,
q
, j
/
,k
)
q
, j
/
,k
+
r
=1
j
j
/
=0
kZ
f ,
, j
/
,k
)
, j
/
,k
=
r
=1
j
/
<0,kZ
f ,
, j
/
,k
)
, j
/
,k
+
r
=1
j
j
/
=0
kZ
f ,
, j
/
,k
)
, j
/
,k
= Q
j
f .
Since P
j
f = Q
j
f by [19, Lemma 2.4], we have P
q
j
f = Q
q
j
f for j 0.
Next we show that P
q
j
f = Q
q
j
f holds when j < 0. Applying Lemma 1.1 inductively for any f L
2
(R) and j < 0, we have
P
q
j
f = P
q
j
//
f +
r
=1
j
j
/
=j
//
kZ
f ,
q
, j
/
,k
)
q
, j
/
,k
. (A.3)
Thus the proof of P
q
j
f = Q
q
j
f is transferred to the proof of P
q
j
//
f 0 as j
//
. The proof below is essentially the same as
that of [26, Theorem 2.2].
Since h
0
is nitely supported, the renable function derived from h
0
satises (1.3), which implies that the integer shifts
of
q
j
//
,0
is a Bessel sequence. Because
P
q
j
//
f =
kZ
f ,
q
j
//
,k
)
q
j
//
,k
,
the norm of P
q
j
//
f satises
|P
q
j
//
f |
2
L
2
(R)
C
kZ
[ f ,
q
j
//
,k
)[
2
, (A.4)
where the constant C is independent of j
//
. Based on the result in approximation theory, we only need to check the value of
|P
q
j
//
f |
L
2
(R)
when f is supported on an interval [R, R] for an arbitrary given R > 0. Applying the Cauchy-Schwartz inequality
to (A.4), we have for j
//
< 0 and [ j
//
[ sufciently large,
|P
q
j
//
f |
2
L
2
(R)
C|f |
2
L
2
_
E
j
//
[(x)[
2
dx, (A.5)
32 Anwei Chai, Zuowei Shen
where E
j
// =
kZ
_
k +2
j
//
[R, R]
_
. Now P
q
j
//
f 0 follows by letting j
//
in (A.5). Then (A.3) becomes
P
q
j
f =
r
=1
j
/
<j
kZ
f ,
q
, j
/
,k
)
q
, j
/
,k
= Q
q
j
f .
Thus we complete our proof of P
q
j
f = Q
q
j
f for any j Z. .
Proof (Proof of Proposition 1.2) The result on A
J
can be proved by induction. When J = 1, this follows from (1.15). For
arbitrary J < 0, we start from the denition of A
J
. By (1.21), we have
A
J
A
J
=
_
J
j=1
H
0, j
__
1
j=J
H
0, j
_
+
_
J+1
j=1
H
0, j
H
1,J
__
H
1,J
1
j=J+1
H
0, j
_
+ +
_
J+1
j=1
H
0, j
H
r,J
__
H
r,J
1
j=J+1
H
0, j
_
+
1
j=J+1
r
=1
_
j+1
j
/
=1
H
0, j
/
H
, j
__
H
, j
1
j
/
=j+1
H
0, j
/
_
=
_
J+1
j=1
H
0, j
__
1
j=J+1
H
0, j
_
+
1
j=J+1
r
=1
_
j+1
j
/
=1
H
0, j
/
H
, j
__
H
, j
1
j
/
=j+1
H
0, j
/
_
=A
J+1
A
J+1
.
In the above,
1
j
/
=j
H
0, j
/ = H
0, j
H
0, j+1
H
0,1
and
j
j
/
=1
H
0, j
/
= H
0,1
H
0,2
H
0, j
. The last equality is obtained by applying
(1.18), which reects the reconstruction process from Jth level to (J +1)th level. Hence, A
J
A
J
= I holds for J <0 by induction.
For operator A, we note that proving A
2
(Z). We next note that
v, v) = A
J
v, A
J
v) =
_
v
j=1
H
0, j
__
1
j=J
H
0, j
v
_
+
1
j=J
r
=1
_
v
j+1
j
/
=1
H
0, j
/
H
, j
__
H
, j
1
j
/
=j+1
H
0, j
/ v
_
(A.6)
and
Av, Av) =
1
j=
r
=1
_
v
j+1
j
/
=1
H
0, j
/
H
, j
__
H
, j
1
j
/
=j+1
H
0, j
/ v
_
. (A.7)
Thus to show Av, Av) = v, v) we only need to prove that
1
j=J
H
0, j
v approaches 0 as J .
Since the matrix H
0, j
are Toeplitz matrices generated by lters h
0, j
, we have
j=J
H
0, j
v =
1
j=J
h
0, j
v.
Since [
h
0, j
[ 1,
j=J
H
0, j
v
[v[, a.e. R.
Note that the compactly supported renable function obtained from the nite-length low pass lter h
0
can be written as
() =
j=0
h
0
(2
j1
). Since L
2
(R) is compactly supported, we have L
1
(R) and
,= 0 a.e. R satisfying that
h
0, j
() =
h
0
(2
j1
), we have
j=J
H
0, j
v =
1
j=J
h
0, j
v =
1
j=J
h
0
(2
j1
)v =
1
j=0
h
0
(2
j1
)
j=J
h
0
(2
j1
)v =
1
(2
J
)v.
Thus
lim
J
1
j=J
h
0, j
v =
v
lim
J
(2
J
) = 0.
So for any R, [
1
j=J
h
0, j
[[v[ 0 a.e. as J . Applying the Dominated Convergence Theorem, we obtain
|
1
j=J
H
0, j
v|
2
(Z)
=
1
2
|
1
j=J
h
0, j
v|
L
2
[,]
0, J .
Let J in (A.6), we have v, v) = lim
J
A
J
v, A
J
v) = Av, Av), which completes our proof. .
Deconvolution: A Wavelet Frame Approach 33
Next we give the following result which demonstrates that once the low pass lter h
0
satises assumption (2.2), the renable
function with renable mask h
0
as given in (1.2) exists and hence a tight frame system can be constructed from h
0
via the
UEP.
Proposition A.1 Suppose h
0
is nitely supported and satises the following condition:
_
[
h
0
()[
2
+[
h
0
( +)[
2
1, a.e. R;
h
0
(0) = 1.
(A.8)
The solution of the renement equation
(x) = 2
kZ
h
0
[k](2x k)
is in L
2
(R).
Proof Since h
0
is nitely supported and
h
0
(0) = 1, the compactly supported renable function exists in the sense of distribu-
tion with the Fourier transform of given by
() =
j=1
h
0
(
2
j
), (A.9)
satisfying
(0) = 1. Further, the distribution solution is unique. In the following, we will prove L
2
(R) whenever h
0
satises (A.8).
Our proof uses the cascade algorithm dened by
n
() =
h
0
(
2
)
n1
(
2
) =
n
j=1
h
0
(
2
j
)
0
(
2
n
), n > 0, (A.10)
with initial function
0
satisfying
0
() =
[,]
(). It is known that the cascade algorithm always converges to as a
distribution. Since
0
() satises
kZ
[
0
( +2k)[
2
= 1, a.e. R,
it can be proven inductively that for any
n
, n > 0,
kZ
[
n
( +2k)[
2
1, a.e. R.
Thus we have
|
n
|
2
L
2
(R)
=
_
R
[
n
()[
2
d =
_
kZ
[
n
( +2k)[
2
d 2.
Since the sequence |
n
|
L
2
(R)
is bounded for each n, there exists a subsequence
n
j
which converges weakly to some
function g L
2
(R). As shown in [17], when h
0
is nitely supported,
n
in (A.10) converges absolutely and uniformly on
compact sets. Thus the function
is uniformly continuous on compact sets. Since
(0) = 1, in a neighborhood of 0, we have
,= 0. Thus each
n
j
,= 0 in such a neighborhood. It leads to the weak limit g ,= 0 in this neighborhood. On the other hand,
because the sequence
n
converges to the function in the sense of distribution, which is weaker than the weak convergence,
we have = g L
2
(R). .
Remark A.2 It was shown in [14] that if h
0
satises (A.8) and if the corresponding renable function is in L
2
(R), then there is
constructive way to derive a set of tight framelets. Further, if is symmetric, the framelets are symmetric or antisymmetric. Con-
structions of tight frames when the renement mask h
0
satises (A.8) are also given in [19] in their construction of tight frames
from pseudo-splines (also available in [21]). The above proposition shows that condition (A.8) on h
0
implies the corresponding
renable function L
2
(R).
B Proofs of Minimization Properties
Here, we give a proof of Proposition 3.3.
Proof (Proof of Proposition 3.3) The (, j, k)th entries of sequences v
)
, j,k
and
, j,k
respec-
tively.
34 Anwei Chai, Zuowei Shen
From the denition of
(v; a) by (3.9), we have
(v
+; a) = |H
0
(v
+) c|
2
+|v
+ a|
2
|H
0
(v
+) H
0
a|
2
+
r
=1
j<0,kZ
2p
j
[( v
)
, j,k
+
, j,k
[
p
= |H
0
v
c|
2
+2H
0
, H
0
v
c) +|H
0
|
2
+|v
a|
2
+||
2
+2, v
a)
|H
0
v
H
0
a|
2
|H
0
|
2
2H
0
, H
0
v
H
0
a) +
r
=1
j<0,kZ
2p
j
[( v
)
, j,k
+
, j,k
[
p
= |H
0
v
c|
2
+|v
a|
2
|H
0
v
H
0
a|
2
+
r
=1
j<0,kZ
2p
j
[( v
)
, j,k
[
p
+||
2
+2, H
0
(H
0
v
c)) +2, v
a) 2, H
0
(H
0
v
H
0
a))
+
r
=1
j<0,kZ
2p
j
([( v
)
, j,k
+
, j,k
[
p
[( v
)
, j,k
[
p
)
=
(v
; a) +||
2
+2, v
aH
0
c +H
0
H
0
a) +
r
=1
j<0,kZ
2p
j
([( v
)
, j,k
+
, j,k
[
p
[( v
)
, j,k
[
p
).
(B.1)
Since A
aH
0
c +H
0
H
0
a) = , A
aA
AH
0
c +A
AH
0
H
0
a)
= A, v
aA(H
0
(c H
0
a))).
(B.2)
Next we denote the (, j, k)th entry of a and A(H
0
(cH
0
a)) by a
, j,k
and w
, j,k
. Together with the simplied notation
, j,k
:=
r
=1
j<0
kZ
and
j
:=
2p
j
,
(v
+; a) becomes:
(v
+; a) =
(v
; a) +||
2
+
, j,k
j
([( v
)
, j,k
+
, j,k
[
p
[( v
)
, j,k
[
p
) +
, j,k
2
, j,k
(( v
)
, j,k
a
, j,k
w
, j,k
).
(B.3)
Next we prove the inequality
(v
+; a)
(v
; a) +||
2
for p, 1 p 2. We only need to show
, j,k
j
([( v
)
, j,k
+
, j,k
[
p
[( v
)
, j,k
[
p
) +
, j,k
2
, j,k
(( v
)
, j,k
a
, j,k
w
, j,k
) 0. (B.4)
Since a and c are in
p
, if further assuming v
p
, by applying the Minkowskis and Youngs inequalities as well as the
nonexpansive property of the threshold function t
p
(x), we have ( v
)
, j,k
a
, j,k
w
, j,k
p
. Because
p
, we have
q
for q =
p
p1
p, and then by H older inequality, we can derive that
, j,k
(( v
)
, j,k
a
, j,k
w
, j,k
)
1
(Z).
Thus the sequences in (B.4) are absolutely convergent, hence we only need to prove (B.4) term by term, i.e.
j
[( v
)
, j,k
+
, j,k
[
p
j
[( v
)
, j,k
[
p
+2
, j,k
(( v
)
, j,k
a
, j,k
w
, j,k
) 0. (B.5)
First we consider the case p = 1. The threshold function is the soft-threshold function and the (, j, k)th entry of v
satises that
( v
)
, j,k
=t
j
(a
, j,k
+ w
, j,k
). We show (B.5) case by case.
1. ( v
)
, j,k
= 0, then [ a
, j,k
+ w
, j,k
[
j
/2.
j
[
, j,k
[ +2
, j,k
( a
, j,k
w
, j,k
)
j
([
, j,k
[
, j,k
) 0;
2. ( v
)
, j,k
> 0, then ( v
)
, j,k
= a
, j,k
+ w
, j,k
j
/2.
j
[( v
)
, j,k
+
, j,k
[
j
( v
)
, j,k
+2
, j,k
(( v
)
, j,k
a
, j,k
w
, j,k
)
=
j
[( v
)
, j,k
+
, j,k
[
j
( v
)
, j,k
+2
, j,k
(
j
/2) =
j
_
[( v
)
, j,k
+
, j,k
[ (( v
)
, j,k
+
, j,k
)
_
0;
Deconvolution: A Wavelet Frame Approach 35
3. ( v
)
, j,k
< 0, then ( v
)
, j,k
= a
, j,k
+ w
, j,k
+
j
/2.
j
[( v
)
, j,k
+
, j,k
[ +
j
( v
)
, j,k
+2
, j,k
(( v
)
, j,k
a
, j,k
w
, j,k
)
=
j
[( v
)
, j,k
+
, j,k
[ +
j
( v
)
, j,k
+2
, j,k
(
j
/2) =
j
_
[( v
)
, j,k
+
, j,k
[ +(( v
)
, j,k
+
, j,k
)
_
0.
Thus when p = 1 the sum in (B.3) is nonnegative and hence the inequality holds.
Next we consider the case of the xed p, 1 < p 2. When 1 < p 2, the value of v
is given by ( v
)
, j,k
= (F
p
j
)
1
( a
, j,k
+
w
, j,k
) and we have
j
[( v
)
, j,k
+
, j,k
[
p
j
[( v
)
, j,k
[
p
+2
, j,k
(( v
)
, j,k
F
p
j
(( v
)
, j,k
))
=
j
[( v
)
, j,k
+
, j,k
[
p
j
[( v
)
, j,k
[
p
, j,k
p
j
sgn(( v
)
, j,k
)[( v
)
, j,k
[
p1
.
If ( v
)
, j,k
= 0, then (B.5) holds clearly. If ( v
)
, j,k
,= 0, we check it using function (t) = [t[
p
where p > 1. The second order
derivative is
//
(t) = p(p1)[t[
p2
, which is nonnegative for any value of t except 0. By Taylor expansion,
j
[( v
)
, j,k
+
, j,k
[
p
j
[( v
)
, j,k
[
p
2
, j,k
(( v
)
, j,k
F
p
j
( v
, j,k
)) =
1
2
j
p(p1)[[
p2
2
, j,k
0,
where is between ( v
)
, j,k
and ( v
)
, j,k
+
, j,k
. Thus (B.5) still holds when 1 < p 2.
In conclusion, when 1 p 2, we always have (B.5) and hence (B.4). Therefore, the inequality
(v
+; a)
(v
; a) +
||
2
holds for 1 p 2. .
C Proof in 4
This is to give the proof of Proposition 4.1.
Proof (Proof of Proposition 4.1) We start the proof fromthe explicit formof the eigenvalues of the circulant matrix H
0
generated
from lter h
0
with the data of length N. The eigenvalues of the N-by-N circulant matrix H
0
are:
p
[H
0
] =
K1
k=0
h
0
[k] exp
_
i2kp
N
_
=
h
0
(
2p
N
), p = 0, 1, , N1.
If
h
0
(2) ,= 0, for Q with Q the set of rational numbers, then
p
[H
0
] ,= 0 for p = 0, 1, . . . , N 1. Since h
0
is nitely
supported, the polynomial
h
0
() has nitely many zeros. Suppose those zeros of
h
0
in terms of rational multiples of 2 are
_
2
q
i
p
i
: i = 1, 2, , n
_
, (C.1)
where for each i, gcd(q
i
, p
i
) = 1. Because
h
0
is 2-periodic, we can take the rationales being proper fractions, i.e. q
i
< p
i
. It is
not necessary to consider the case p
i
= q
i
since
h
0
(2) =
h
0
(0) = 1. To make the matrix H
0
nonsingular, the value of N should
satisfy
p
N
/
_
q
i
p
i
: i = 1, 2, , n
_
, p = 0, 1, , N1. (C.2)
One sufcient condition on N such that (C.2) holds is
p
i
N, i = 1, 2, , n. (C.3)
This is because, assuming (C.2) is not true, i.e. there exist p
i
0
and q
i
0
in the set given in (C.2) such that
p
N
=
q
i
0
p
i
0
, for some
0 < p < N, then pp
i
0
= Nq
i
0
. Since gcd(p
i
0
, q
i
0
) = 1, it leads to p
i
0
[N, which is a contradiction of p
i
0
N. Hence, for a given
lter h
0
, there are innitely many N such that as long as the data length N satises (C.3), the corresponding circulant matrix
H
0
is nonsingular. For a given data with length N
0
, if N
0
dose not satises (C.2), we just simply extend the data to the length
N
1
> N
0
satisfying (C.2), for example, we can take N
1
prime to each p
i
. Then the circulant matrix H
0
generated from h
0
with
respect to the extended data of length N
1
is nonsingular. Since det(H
0
) = det(H
0
) and det(H
0
H
0
) = det(H
0
)
2
, the matrices H
0
and H
0
H
0
are nonsingular once H
0
is. .
36 Anwei Chai, Zuowei Shen
Remark C.1 In practice, if the data are needed to be extended, small N
1
is appreciated. This processing is constructive once all
the zeros in terms of rational multiples of 2 as those in (C.1) are available. Based on the sufcient condition (C.3), we rst
factorize p
i
, i = 1, , n, in (C.1) as:
_
_
p
1
= m
e
11
1
m
e
12
2
m
e
1l
l
;
p
2
= m
e
21
1
m
e
22
2
m
e
2l
l
;
.
.
.
.
.
.
p
n
= m
e
n1
1
m
e
n2
2
m
e
nl
l
,
where m
i
, i = 1, . . . , l are prime numbers. Then N
1
N
0
satisfying (C.3) means that
m
i
N
1
, i = 1, , l.
Starting from above criterion, we can nd the minimum N
1
by directly computation, e.g. using sieve of Eratosthenes.
After we calculate the value of N
1
, we need to extend the data by N
1
N
0
entries. To make the extension meaningful, a
possible way is to repeat the entries in the original data set. For instance, we can append the rst N
1
N
0
entries to the end of the
data set. If h
0
is a renement mask of a spline, pseudo-spline or one of those used in high resolution image reconstructions, then
N
1
N
0
1, since
h
0
(2) = 0 with Q only when =
1
2
. Thus, as long as N
0
is odd, the corresponding circulant matrix
H
0
is nonsingular. This implies that we can simply append at most the rst entry in the data set to guarantee the nonsingularity
of H
0
.