Given two disjoint convex sets \(U, V\subseteq \R^{n}\text{,}\) there is a hyperplane \(H=\{\mathbf{x}\in \R^n: \mathbf{h}^{\top}\mathbf{x}=k\} \) for some \(\mathbf{h}\in \R^n, k\in \R\text{,}\) such that \(\mathbf{h}^{\top}U \geq k, \mathbf{h}^{\top}V\lt k\text{.}\)
We prove the case of Theorem 9.4.1 where there are \(u_0\in U, v_0\in V\) that minimize \(\{\|u-v\|: u\in U, v\in V\}\text{.}\) We assume this is true.
We want to show that \(H\) is the separating hyperplane. Suppose that \(U\) was not contained in \(\{\mathbf{x}\in \R^n: \mathbf{h}^\top\mathbf{x}\geq0\}\) what must be true about \(U\text{?}\)
Let \(\mathbf{d} = u'-u_0\) describe geometrically what \(\frac{-u_0^\top \mathbf{d}}{\mathbf{d}^\top \mathbf{d}}\cdot \mathbf{d}+u\) represents. (Think dot product, projections and for \(\frac{-u_0^\top \mathbf{d}}{\mathbf{d}^\top \mathbf{d}}\cdot \mathbf{d}+u\) to start at \(u\text{.}\))
Show that \(w^\top w = u_0^\top u_0 + \alpha \mathbf{d}^\top \mathbf{d}\left(2 \frac{u_0^\top \mathbf{d}}{\mathbf{d}^\top \mathbf{d}}+\alpha\right)\text{,}\) and explain why \(w^\top w \lt u_0^\top u_0\text{.}\)
Let \(S\subseteq \R^n\text{.}\) We call the cone of \(S\text{,}\) denoted \(\cone(S)\) to be the set \(\cone(S):=\left\{\displaystyle \sum_{i=1}^m u_i\mathbf{s}_i: u_i\in \R, u_i\geq 0, \mathbf{s}_i\in S \right\}\text{.}\)
Let’s denote the normal vector of the separating hyperplane by \(\y\) (interesting choice 👀) so that \(\y^\top \vb\lt \y^\top p\) for any \(p\in P\text{.}\) Why must \(\y^\top \vb \lt 0\text{?}\)
Suppose \(A\) had a column \(A_j\) such that \(\y^\top A_j\lt 0\text{,}\) show that there is an \(\alpha>0\) such that \(\y^\top (\alpha A_j)\lt \y^\top \vb \text{.}\) Why is this a contradiction?
Suppose (A) held, and we had that there was a \(\begin{bmatrix} \x \\ \vt\end{bmatrix} \geq 0\) so that \(\begin{bmatrix} A \amp I_n \end{bmatrix} \begin{bmatrix} \x \\ \vt\end{bmatrix} = \vb \text{.}\) How would \(A\x\) compare to \(\vb \text{?}\)
Given a pair of primal max-dual min problems, the primal max problem as an optimal solution \(\x^*\) if and only if the dual min problem has an optimal solution \(\y^*\text{.}\) Moreover, \(f(\x^*)=g(\y^*)\text{.}\)
Suppose that optimal dual solution \(\y^*\) exists. Explain why by Activity 4.2.2 it suffices to show that \(f(\x^*)\geq g(\y^*)\) for some feasible \(\x^*\text{.}\)
Consider the matrix \(M = \begin{bmatrix} -\vc^\top \\ A\end{bmatrix}\text{,}\) and vector \(\mathbf{d} = \begin{bmatrix} -g^* \\ \vb \end{bmatrix}\text{.}\) Apply Corollary 9.4.8 to \(M, \mathbf{d}\text{.}\) What does it mean for (A) to hold?
If Corollary 9.4.8 (A) holds for \(M, \mathbf{d}\text{,}\) then it holds for the pair \(-\vc^\top, \begin{bmatrix} -g^*\end{bmatrix} \text{,}\) as well as the pair \(A, \vb \) for the same \(\x\in \R^m\text{.}\) Why does this show that a feasible optimal solution \(\x^*\) exists and that \(f(\x^*)=g(\y^*)\text{?}\)
Let \(\begin{bmatrix} y_0 \amp \y^\top \end{bmatrix}\) denote the vector produced by Corollary 9.4.8 (B). Suppose \(y_0=0\text{.}\) How would \((\y^*+\y)^\top A\) compare to \((\y^*)^\top A\) and \((\y^*+\y)^\top \vb \) compare to \((\y^*)^\top \vb \text{?}\) Why is this a contradiction?