LAGRANGE DUAL PROBLEM

만약 다음과 같은 형태의 optimization problem이 주어졌다고 하자.

$\underset{x}{\text{minimize}} \ f_0(x)$
$\text{subject to} \ f_i(x) \leq 0 , i=1,…,m$
$\qquad\qquad\ g_i(x) = 0 , i=1,…,p$

Lagrange dual funtion

그렇다면 lagrange dual function은 다음과 같이 정의된다.

$g(\lambda,\upsilon) = \inf\limits_{x \in \mathcal{D}}L(x,\lambda,\upsilon)= \inf\limits_{x \in \mathcal{D}}(f_{0}(x)+\sum_{i=1}^{m} \lambda_{i}f_{i}(x) + \sum_{i=1}^{p} \upsilon_{i}g_{i}(x) )$

여기서 $L(x,\lambda,\upsilon)$은 Lagrangian function으로

$L:\mathbb{R}^m \times \mathbb{R}^p \rightarrow \mathbb{R} \ \text{with} \ \mathcal{D} = \lbrace x \mid \bigcap_{i=1}^m \text{dom} \ f_i(x) \cap \bigcap_{i=1}^p \text{dom} \ g_i(x) \rbrace$ $(x,\lambda,\upsilon) \mapsto f_{0}(x)+\sum_{i=1}^{m} \lambda_{i}f_{i}(x) + \sum_{i=1}^{p} \upsilon_{i}g_{i}(x)$

그리고 각각의 constraint에 곱해지는 $\lambda_{i}$와 $\upsilon_{i}$는 “dual” 또는 “lagrange multiplier”라고 부른다.

만약 lagrange multiplier들이 0보다 큰 양수라면 constraint의 조건과 합쳐져 다음과 같은 부등식이 성립된다.

$g(\lambda, \upsilon) \leq L(\widetilde{x},\lambda,\upsilon) \leq f_{0}(\widetilde{x})$ ,where $\widetilde{x}$ is any feasible point in $\mathcal{D}$

위의 부등식이 항상 성립하므로 $f_{0}(x)$의 lower bound는 $g(\lambda, \upsilon)$이다.

Dual problem

$g(\lambda, \upsilon)$는 affine function들의 pointwise infimum이므로 concave이다.

그리고 이말은 $g(\lambda, \upsilon)$가 global optimal point(maximum point)가 존재한다는 말과 동치이다.

이러한 사실로 다음과 같은 optimization problem을 생각해 볼 수 있다.

$\underset{\lambda \geq 0}{\text{maximize}} \ g(\lambda, \upsilon)$

$g(\lambda, \upsilon)$가 최대가 되는 지점은 $\max g(\lambda, \upsilon) = d^{\ast}$가 될 것이고, 이 지점은 $f_{0}$가 최소가 되는 $\min f_{0}(x) = p^{\ast}$가 될 것이다. 따라서 다음과 같은 부등식이 성립한다.

\[\max g(\lambda, \upsilon) = d^{\ast} \leq p^{\ast} = \min f_{0}(x)\]

Example

Standard form LP

다음과 같은 standard form LP(linear programming)을 optimzation하는 문제가 주어졌다고 하자.

$\underset{x}{\text{minimize}} \ c^{T}x$
$\text{subject to} \ Ax = b$
$\qquad\qquad\ x \succeq 0$

이에 대한 lagrange dual function은

$g(\lambda,\upsilon) = \inf\limits_{x \in \mathcal{D}}(c^{T}x+\lambda^{T}(Ax-b) + \upsilon^{T}x )$
$\quad\quad\quad = \inf\limits_{x \in \mathcal{D}}((c^{T}+\lambda^{T}A+\upsilon^{T})x - \lambda^{T}b)$
$\quad\quad\quad = \inf\limits_{x \in \mathcal{D}}((c^{T}+\lambda^{T}A+\upsilon^{T})x) - \lambda^{T}b$

이 형태는 linear function의 pointwise infimum을 찾는 것이므로 concave이다.

그리고 다음과 같은 dual problem으로 치환할 수 있다.

$\underset{\lambda,\upsilon}{\text{maximize}} \ g(\lambda,\upsilon)$

$g(\lambda, \upsilon)$는 concave이므로 global optimal point가 존재하고 이 지점은 $\nabla_{x}g(\lambda,\upsilon) = 0$ 를 만족하는 $x$가 될 것이다.

\[\max g(\lambda,\upsilon) \iff \nabla_{x}g(\lambda,\upsilon) = c^{T}+\lambda^{T}A+\upsilon^{T} = 0\]

따라서 $c^{T}+\lambda^{T}A+\upsilon^{T} = 0$일 때 $g(\lambda,\upsilon)$가 최대 값을 가지는 지점이 된다.

\[\ g(\lambda, \upsilon)= \begin{cases} - \lambda^{T}b ,& \text{if } \ c^{T}+\lambda^{T}A+\upsilon^{T} = 0 \newline \newline -\infty, & \text{otherwise} \end{cases} \\]