What is sPLS method ?
The sPLS method (sparse partial least Square) follows the principle
of PLS but also involves applying a Lasso penalty to certain variables
in order to eliminate them. In the set
,
for each component
,
the variable
is eliminated if
on a. In the set
,
for each component
,
the variable
is eliminated if
on a.
When
,
the minimization problem therefore becomes:
and
are the regularization parameters.
is sometimes replaced by the notation
.
These parameters allow us to nuance the degree of sparsity, that is, the
rate of deleted variables. In particular, the larger
and
are, the more variables are deleted in
and
,
respectively.
The minimization function is biconvex, so it cannot be solved
simultaneously in
and
.
The solution is therefore to proceed variable by variable: for example,
we fix
to minimize in
,
then vice versa.
The function can also be written as below:
with the following relations :
When
,
we replace the expressions
,
,
and
with
,
,
and
,
respectively.
How to solve minimisation problem ?
Shen & Huang lemma
We must therefore solve:
Using the first lemma of Shen & Huang which will be demonstrated
a little later, we arrive at the following result:
In the same way,
Remarks :
in the problem according
,
the sum factor
disappears because of the valid condition
.
It is the same for
.
the term
disappears because neither
nor
is depending to.
and
expressions made disappear
and
respectively.
Shen & Huang lemma is defined by :
The right expression is also called function on
.
Lemma demonstration
Let be :
We want to find
such that
.
,
and
play respectively
,
and
role on the one hand then
,
and
on the other hand.
Convergence algorithm
To solve the minimization problem, we first perform an SVD
decomposition as before (first column of matrices
and
).
The vectors found are not yet the solutions. Therefore, a convergence
algorithm must be applied for each component
.
We first define:
and
.
We must therefore calculate
:
We then normalize weights :
We hence select
and
respectively when
and
.
While
or
:
Special case of PLS1
In the case of PLS1, the minimization problem in
becomes, considering
:
We can now compute (in PLS1 as in PLS2) :
The principle of matrix deflation is the same as for the PLS
method.