Uzawa interpretation of Bregman iteration
In this note we describe a solution method for the following problem.
The L1 regularized equation for sparse solution of linear equations:
The Bregman iteration can be found in, for example
The problem is replaced by a regularized problem
While the method discussed here is not the fastest, it is popular, and it is written in terms of iterating a sequence of basic steps, so the method can be written in a few lines.
The Bregman iteration
The iteration is on u,p, begins with both initialized to zero. Then
- uk + 1 = shrink(Atp,1)
- pk + 1 = pk − (b − Auk + 1)
The shrink operator is defined below. The idea is that the iteration converges to a minimizer of the original problem (or one nearby).
The Augmented Lagrangian
First replace the problem by a regularized version
Then consider the Lagrangian
The dual function is obtained by minimizing over x for fixed λ
which upon completing the square gives
Notice that the problem is separable in each coordinate. The minimizer is
- x = shrink(Atλ / (2ε),2ε) = shrink(Atλ,1)
Now for a given x, performing gradient descent method on the dual results in
- λ = λ − (b − Ax)
So the Bregman iteration arises as a descent method on the Augmented Lagrangian, for the dual problem.
Definition of Shrinkage
The shrink operator, which is the coordinate operator
The shrink operator can also be written in terms of the projection operator onto the interval
- skrink(b,δ) = b − proj[ − δ,δ](b)
The shrink operator is the minimizer of the one dimensional problem