Functions

Here we list the available functions, grouped by category. Each function is documented with its exact definition and the necessary parameters for construction. The proximal mapping (and gradient, when defined) of such functions is computed by calling the prox and prox! methods (and gradient, gradient!, when defined). These functions can be modified and/or combined together to make new ones, by means of calculus rules.

Indicators of sets

When function $f$ is the indicator function of a set $S$, that is

\[f(x) = δ_S(x) = \begin{cases} 0 & \text{if}\ x \in S, \\ +∞ & \text{otherwise}, \end{cases}\]

then $\mathrm{prox}_{γf} = Π_S$ is the projection onto $S$. Therefore ProximalOperators includes in particular projections onto commonly used sets, which are here listed.

ProximalOperators.IndAffine — Type

IndAffine(A, b; iterative=false)

If A is a matrix (dense or sparse) and b is a vector, return the indicator function of the affine set

\[S = \{x : Ax = b\}.\]

If A is a vector and b is a scalar, return the indicator function of the set

\[S = \{x : \langle A, x \rangle = b\}.\]

By default, a direct method (QR factorization of matrix A') is used to evaluate prox!. If iterative=true, then prox! is evaluated approximately using an iterative method instead.

ProximalOperators.IndBallLinf — Function

Indicator of a $L_∞$ norm ball

IndBallLinf(r=1.0)

Return the indicator function of the set

\[S = \{ x : \max (|x_i|) \leq r \}.\]

Parameter r must be positive.

ProximalOperators.IndBallL0 — Type

IndBallL0(r=1)

Return the indicator function of the $L_0$ pseudo-norm ball

\[S = \{ x : \mathrm{nnz}(x) \leq r \}.\]

Parameter r must be a positive integer.

ProximalOperators.IndBallL1 — Type

IndBallL1(r=1.0)

Return the indicator function of the $L_1$ norm ball

\[S = \left\{ x : \sum_i |x_i| \leq r \right\}.\]

Parameter r must be positive.

ProximalOperators.IndBallL2 — Type

IndBallL2(r=1.0)

Return the indicator function of the Euclidean ball

\[S = \{ x : \|x\| \leq r \},\]

where $\|\cdot\|$ is the $L_2$ (Euclidean) norm. Parameter r must be positive.

ProximalOperators.IndBallRank — Type

IndBallRank(r=1)

Return the indicator function of the set of matrices of rank at most r:

\[S = \{ X : \mathrm{rank}(X) \leq r \},\]

Parameter r must be a positive integer.

ProximalOperators.IndBinary — Type

IndBinary(low, up)

Return the indicator function of the set

\[S = \{ x : x_i = low_i\ \text{or}\ x_i = up_i \},\]

Parameters low and up can be either scalars or arrays of the same dimension as the space.

ProximalOperators.IndBox — Type

IndBox(low, up)

Return the indicator function of the box

\[S = \{ x : low \leq x \leq up \}.\]

Parameters low and up can be either scalars or arrays of the same dimension as the space: they must satisfy low <= up, and are allowed to take values -Inf and +Inf to indicate unbounded coordinates.

ProximalOperators.IndGraph — Type

IndGraph(A)

For matrix A (dense or sparse) return the indicator function of its graph:

\[G_A = \{(x, y) : Ax = y\}.\]

The evaluation of prox! uses direct methods based on LDLt (LL for dense cases) matrix factorization and backsolve.

The prox! method operates on pairs (x, y) as input/output. So if f = IndGraph(A) is the indicator of the graph $G_A$, while (x, y) and (c, d) are pairs of vectors of the same sizes, then

prox!((c, d), f, (x, y))

writes to (c, d) the projection onto $G_A$ of (x, y).

ProximalOperators.IndHalfspace — Type

IndHalfspace(a, b)

For an array a and a scalar b, return the indicator of half-space

\[S = \{x : \langle a,x \rangle \leq b \}.\]

ProximalOperators.IndHyperslab — Type

IndHyperslab(low, a, upp)

For an array a and scalars low and upp, return the so-called hyperslab

\[S = \{x : low \leq \langle a,x \rangle \leq upp \}.\]

ProximalOperators.IndPoint — Type

IndPoint(p=0)

Return the indicator of the singleton set

\[C = \{p \}.\]

Parameter p can be a scalar, in which case the unique element of S has uniform coefficients.

ProximalOperators.IndPolyhedral — Type

IndPolyhedral([l,] A, [u, xmin, xmax])

Return the indicator function of the polyhedral set:

\[S = \{ x : x_\min \leq x \leq x_\max, l \leq Ax \leq u \}.\]

Matrix A is a mandatory argument; when any of the bounds is not provided, it is assumed to be (plus or minus) infinity.

ProximalOperators.IndSimplex — Type

IndSimplex(a=1.0)

Return the indicator of the simplex

\[S = \left\{ x : x \geq 0, \sum_i x_i = a \right\}.\]

By default a=1, therefore $S$ is the probability simplex.

ProximalOperators.IndSphereL2 — Type

IndSphereL2(r=1)

Return the indicator function of the Euclidean sphere

\[S = \{ x : \|x\| = r \},\]

where $\|\cdot\|$ is the $L_2$ (Euclidean) norm. Parameter r must be positive.

ProximalOperators.IndStiefel — Type

IndStiefel()

Return the indicator of the Stiefel manifold

\[S_{n,p} = \left\{ X \in \mathbb{F}^{n \times p} : X^*X = I \right\}.\]

where $\mathbb{F}$ is the real or complex field, and parameters $n$ and $p$ are inferred from the matrix provided as input.

Indicators of convex cones

An important class of sets in optimization is that of convex cones. These are used in particular for formulating cone programming problems, a family of problems which includes linear programs (LP), quadratic programs (QP), quadratically constrained quadratic programs (QCQP) and semidefinite programs (SDP).

ProximalOperators.IndExpPrimal — Type

IndExpPrimal()

Return the indicator function of the primal exponential cone, that is

\[C = \mathrm{cl} \{ (r,s,t) : s > 0, s⋅e^{r/s} \leq t \} \subset \mathbb{R}^3.\]

ProximalOperators.IndExpDual — Function

IndExpDual()

Return the indicator function of the dual exponential cone, that is

\[C = \mathrm{cl} \{ (u,v,w) : u < 0, -u⋅e^{v/u} \leq w⋅e \} \subset \mathbb{R}^3.\]

ProximalOperators.IndFree — Type

IndFree()

Return the indicator function of the whole space, or "free cone", i.e., a function which is identically zero.

ProximalOperators.IndNonnegative — Type

IndNonnegative()

Return the indicator of the nonnegative orthant

\[C = \{ x : x \geq 0 \}.\]

ProximalOperators.IndNonpositive — Type

IndNonpositive()

Return the indicator of the nonpositive orthant

\[C = \{ x : x \leq 0 \}.\]

ProximalOperators.IndPSD — Type

IndPSD(;scaling=false)

Return the indicator of the positive semi-definite cone

\[C = \{ X : X \succeq 0 \}.\]

The argument to the function can be either a Symmetric, Hermitian, or AbstractMatrix object, or an object of type AbstractVector{Float64} holding a symmetric matrix in (lower triangular) packed storage.

If scaling = true then the vectors y and x in prox!(y::AbstractVector{Float64}, f::IndPSD, x::AbstractVector{Float64}, args...) have the off-diagonal elements multiplied with √2 to preserve inner products, see Vandenberghe 2010: http://www.seas.ucla.edu/~vandenbe/publications/coneprog.pdf .

I.e. when when scaling=true, let X,Y be matrices and

x = (X_{1,1}, √2⋅X_{2,1}, ... ,√2⋅X_{n,1}, X_{2,2}, √2⋅X_{3,2}, ..., X_{n,n}),

y = (Y_{1,1}, √2⋅Y_{2,1}, ... ,√2⋅Y_{n,1}, Y_{2,2}, √2⋅Y_{3,2}, ..., Y_{n,n})

then prox!(Y, f, X) is equivalent to prox!(y, f, x).

ProximalOperators.IndSOC — Type

IndSOC()

Return the indicator of the second-order cone (also known as ice-cream cone or Lorentz cone), that is

\[C = \left\{ (t, x) : \|x\| \leq t \right\}.\]

ProximalOperators.IndRotatedSOC — Type

Indicator of the rotated second-order cone

    IndRotatedSOC()

Return the indicator of the rotated second-order cone (also known as ice-cream cone or Lorentz cone), that is

\[C = \left\{ (p, q, x) : \|x\|^2 \leq 2\cdot pq, p \geq 0, q \geq 0 \right\}.\]

ProximalOperators.IndZero — Type

IndZero()

Return the indicator function of the set containing the origin, the "zero cone".

Norms and regularization functions

ProximalOperators.CubeNormL2 — Type

CubeNormL2(λ=1)

With a nonnegative scalar λ, return the function

\[f(x) = λ\|x\|^3.\]

ProximalOperators.ElasticNet — Type

ElasticNet(μ=1, λ=1)

Return the function

\[f(x) = μ\|x\|_1 + (λ/2)\|x\|^2,\]

for nonnegative parameters μ and λ.

ProximalOperators.NormL0 — Type

NormL0(λ=1)

Return the $L_0$ pseudo-norm function

\[f(x) = λ\cdot\mathrm{nnz}(x)\]

for a nonnegative parameter λ.

ProximalOperators.NormL1 — Type

NormL1(λ=1)

With a nonnegative scalar parameter λ, return the $L_1$ norm

\[f(x) = λ\cdot∑_i|x_i|.\]

With a nonnegative array parameter λ, return the weighted $L_1$ norm

\[f(x) = ∑_i λ_i|x_i|.\]

ProximalOperators.NormL2 — Type

NormL2(λ=1)

With a nonnegative scalar parameter λ, return the $L_2$ norm

\[f(x) = λ\cdot\sqrt{x_1^2 + … + x_n^2}.\]

ProximalOperators.NormL21 — Type

NormL21(λ=1, dim=1)

Return the "sum of $L_2$ norm" function

\[f(X) = λ⋅∑_i\|x_i\|\]

for a nonnegative λ, where $x_i$ is the $i$-th column of $X$ if dim == 1, and the $i$-th row of $X$ if dim == 2. In words, it is the sum of the Euclidean norms of the columns or rows.

ProximalOperators.NormL1plusL2 — Type

NormL1plusL2(λ_1=1, λ_2=1)

With two nonegative scalars λ1 and λ2, return the function

\[f(x) = λ_1 ∑_{i=1}^{n} |x_i| + λ_2 \sqrt{x_1^2 + … + x_n^2}.\]

With nonnegative array λ1 and nonnegative scalar λ2, return the function

\[f(x) = ∑_{i=1}^{n} {λ_1}_i |x_i| + λ_2 \sqrt{x_1^2 + … + x_n^2}.\]

ProximalOperators.NormLinf — Function

NormLinf(λ=1)

Return the $L_∞$ norm

\[f(x) = λ⋅\max\{|x_1|, …, |x_n|\},\]

for a nonnegative parameter λ.

ProximalOperators.NuclearNorm — Type

NuclearNorm(λ=1)

Return the nuclear norm

\[f(X) = \|X\|_* = λ ∑_i σ_i(X),\]

where λ is a positive parameter and $σ_i(X)$ is $i$-th singular value of matrix $X$.

ProximalOperators.SqrNormL2 — Type

SqrNormL2(λ=1)

With a nonnegative scalar λ, return the squared Euclidean norm

\[f(x) = \tfrac{λ}{2}\|x\|^2.\]

With a nonnegative array λ, return the weighted squared Euclidean norm

\[f(x) = \tfrac{1}{2}∑_i λ_i x_i^2.\]

ProximalOperators.TotalVariation1D — Type

TotalVariation1D(λ=1)

With a nonnegative scalar parameter λ, return the 1D total variation

\[f(x) = λ ∑_{i=2}^{n} |x_i - x_{i-1}|.\]

Penalties and other functions

ProximalOperators.CrossEntropy — Type

CrossEntropy(b)

Return the function

\[f(x) = -\frac{1}{N} \sum_{i = 1}^{N} b_i \log (x_i)+(1-b_i) \log (1-x_i),\]

where b is an array of length N such that 0 ≤ b ≤ 1 component-wise.

ProximalOperators.HingeLoss — Function

HingeLoss(y, μ=1)

Return the function

\[f(x) = μ⋅∑_i \max\{0, 1 - y_i ⋅ x_i\},\]

where y is an array and μ is a positive parameter.

ProximalOperators.HuberLoss — Type

HuberLoss(ρ=1, μ=1)

Return the function

\[f(x) = \begin{cases} \tfrac{μ}{2}\|x\|^2 & \text{if}\ \|x\| ⩽ ρ \\ ρμ(\|x\| - \tfrac{ρ}{2}) & \text{otherwise}, \end{cases}\]

where ρ and μ are positive parameters.

ProximalOperators.LeastSquares — Type

LeastSquares(A, b, λ=1.0; iterative=false)

For a matrix A, a vector b and a scalar λ, return the function

\[f(x) = \tfrac{\lambda}{2}\|Ax - b\|^2.\]

By default, a direct method (based on Cholesky factorization) is used to evaluate prox!. If iterative=true, then prox! is evaluated approximately using an iterative method instead.

ProximalOperators.Linear — Type

Linear(c)

Return the linear function

\[f(x) = \langle c, x \rangle.\]

ProximalOperators.LogBarrier — Type

LogBarrier(a=1, b=0, μ=1)

Return the function

\[f(x) = -μ⋅∑_i\log(a⋅x_i+b),\]

for a nonnegative parameter μ.

ProximalOperators.LogisticLoss — Type

LogisticLoss(y, μ=1)

Return the function

\[f(x) = μ⋅∑_i log(1+exp(-y_i⋅x_i))\]

where y is an array and μ is a positive parameter.

ProximalOperators.Maximum — Function

Maximum(λ=1)

For a nonnegative parameter λ ⩾ 0, return the function

\[f(x) = \lambda \cdot \max \{x_i : i = 1,\ldots, n \}.\]

ProximalOperators.Quadratic — Type

Quadratic(Q, q; iterative=false)

For a matrix Q (dense or sparse, symmetric and positive semidefinite) and a vector q, return the quadratic function

\[f(x) = \tfrac{1}{2}\langle Qx, x\rangle + \langle q, x \rangle.\]

By default, a direct method (based on Cholesky factorization) is used to evaluate prox!. If iterative=true, then prox! is evaluated approximately using an iterative method instead.

ProximalOperators.SqrHingeLoss — Type

SqrHingeLoss(y, μ=1)

Return the squared Hinge loss

\[f(x) = μ⋅∑_i \max\{0, 1 - y_i ⋅ x_i\}^2,\]

where y is an array and μ is a positive parameter.

ProximalOperators.SumPositive — Type

SumPositive()

Return the function

\[f(x) = ∑_i \max\{0, x_i\}.\]

Distances from convex sets

When the indicator of a convex set is constructed (see Indicators of sets) the (squared) distance from the set can be constructed using the following:

ProximalOperators.DistL2 — Type

DistL2(ind_S)

Given ind_S the indicator function of a set $S$, and an optional positive parameter λ, return the (weighted) Euclidean distance from $S$, that is function

\[g(x) = λ\mathrm{dist}_S(x) = \min \{ λ\|y - x\| : y \in S \}.\]

ProximalOperators.SqrDistL2 — Function

SqrDistL2(ind_S, λ=1)

Given ind_S the indicator function of a set $S$, and an optional positive parameter λ, return the (weighted) squared Euclidean distance from $S$, that is function

\[g(x) = \tfrac{λ}{2}\mathrm{dist}_S^2(x) = \min \left\{ \tfrac{λ}{2}\|y - x\|^2 : y \in S \right\}.\]