Camera Model | Yipeng Wang

CV_Camera

1. 功能分类

Area Scan / Line Scan
- Area Scan - 最常见的相机
- Line Scan - 打印机里的扫描仪
Color / Monochrome 彩色 / 黑白
CMOS / CCD
- CMOS - Complementary Metal Oxide Semiconductor
- CCD - Charged Coupled Device
目前 CMOS 的综合性能会更好一些
Global Shutter 全局快门 / Rolling Shutter 卷帘快门
- Global Shutter - Capture entire frame at once
- Rolling Shutter - Capture image line by line
由于卷帘快门的抓拍有顺序，因此会出现如下状况
1. 与抓拍物的相对运动会导致抖动和变形
2. 抓拍闪光等瞬时效果时，不一定能拍全，此时（快门速度 < 闪光消失速度）
全局快门相机更适合运动摄影，但是呢...更贵，得爆金币（XP
Resolution 分辨率
- 320p - QVGA
- 640p - VGA
- 720p - HDR
- 1080p - FHD
- 1440p - QHD
- 2160p - 4K UHD
- 4320p - 8K UHD
Frame Rate 帧率

15Hz / 30Hz / 60Hz...
Focal Length 焦距

Fixed 固定焦距 / Auto-Focus 自动对焦
Connection Interface

USB 3.0 / GigE ...

2. Color Space

2.1 RGB[A]

一般的彩图是以 RGB（Red-Green-Blue）格式储存的

rgb

一般R, G, B channel values在不同图片类型下的取值范围：








xxxxxxxxxx
 3
 
 
 
   
 
 1
 CV_8U       8bit unsigned integer       0 - 255
 
  2
 CV_16U      16bit unsigned integer      0 - 65535
 
  3
 CV_32F      32bit floating point        0 - 1

Alpha Channel是除了RGB以外的第四通道——“非彩色通道”，能读取透明度

2.2 Gray / Greyscale 灰度图

一般只有一个 Channel

RGB转灰度的常用公式为

g r a y = R \cdot 0.299 + G \cdot 0.587 + B \cdot 0.114

2.3 HSV

Hue - Saturation - Value / 色相 - 饱和度 - 明度

hsv

Hue 色相 - Base Pigment
Saturation 饱和度 - Depth of Pigment
Value 明度 - Darkness of Pigment

HSV - RGB conversion is mathematically lossless

3. Rigid Body Motion

基于欧拉角的刚体运动描述方式

3.1 2D Rotation

2d trans

$p$ $\{A\}$ $p_A(x_a, y_a)$ $x$ $\alpha$

$\theta$ $\{B\}$ $p$ $\{B\}$ $p_B(x_b, y_b)$

$p_A = p_B$ ，然并卵，这信息不加工没有任何用处

$p_B$ $\{A\}$ coordinates ?

Define

$R_{AB} =$ $\{B\}$ $\{A\}$
$x_{B\{A\}} =$ $x$ $\{B\}$ $\{A\}$
$y_{B\{A\}} =$ $y$ $\{B\}$ $\{A\}$

\begin{matrix} R_{A B} = [\begin{matrix} \cos (θ) & - \sin (θ) \\ \sin (θ) & \cos (θ) \end{matrix}] = [x_{B {A}}, y_{B {A}}] \in R^{2 \times 2} \end{matrix}

然后就能得到如下关系式

证明？嘿嘿嘿...Trivial !

p_{B {A}} = R_{A B} \cdot p_{B {B}}

通用表达方式

$p$ $\{A\}$ $\{B\}$

$p$ $\{A\}$ $\{B\}$ （以黑色实线标注），此时有
- $p_B \longrightarrow$ $p$ $\{B\}$
- $p_A \longrightarrow$ $p$ $\{A\}$
- $R_{AB} =$ $\{B\}$ $\{A\}$
- $R_{BA} =$ $\{A\}$ $\{B\}$
$\begin{matrix} p_{A} = R_{A B} \cdot p_{B} \\ p_{B} = R_{B A} \cdot p_{A} \end{matrix}$
Important Properties
1. $\det{(R)} = 1$
2. $RR^T = R^TR = I$
  
  $\longrightarrow R^T = R^{-1}$
  
  $\longrightarrow R_{AB}^T = R_{BA}$

3.2 2D Translation

$p$ $\{A\}$ $\{B\}$ 可以用如下关系式表达

{\begin{cases} x_{A} = x_{B} + a \\ y_{A} = y_{B} + b \end{cases}

3.3 Homogeneous Coordinates

Homogeneous Coordinates 齐次坐标，这到底是什么捏

旋转 & 平移 Review

3.1 和 3.2 的流程如果都用最基本的矩阵描述，那就是

$q$ $p$ 来表示 Translation

2D Rotation

$\begin{matrix} q_{A} = R_{A B} \cdot q_{B} ⟶ \underset{q_{A}}{\underset{⏟}{[\begin{matrix} x_{A} \\ y_{A} \end{matrix}]}} = \underset{R_{A B}}{\underset{⏟}{[\begin{matrix} \cos (θ) & - \sin (θ) \\ \sin (θ) & \cos (θ) \end{matrix}]}} \underset{q_{B}}{\underset{⏟}{[\begin{matrix} x_{B} \\ y_{B} \end{matrix}]}} \end{matrix}$

2D Translation

$\begin{matrix} q_{A} = p_{A B} + q_{B} ⟶ \underset{q_{A}}{\underset{⏟}{[\begin{matrix} x_{A} \\ y_{A} \end{matrix}]}} = \underset{p_{A B}}{\underset{⏟}{[\begin{matrix} p_{x (A B)} \\ p_{y (A B)} \end{matrix}]}} + \underset{q_{B}}{\underset{⏟}{[\begin{matrix} x_{B} \\ y_{B} \end{matrix}]}} \end{matrix}$

where
- $R_{AB}\in\mathbb{R}^{2\times2} \longrightarrow$ $\{B\}$ $\{A\}$
- $p_{AB} \in\mathbb{R}^{2} \longrightarrow$ $\{B\}$ $\{A\}$
- $q_A \in\mathbb{R}^{2} \longrightarrow$ $q$ $\{A\}$
- $q_B \in\mathbb{R}^{2} \longrightarrow$ $q$ $\{B\}$

暴力 V.S. 优雅运动描述

两个运动，两个公式；那如果只用一个公式，一次性表达 Rotation + Translation 呢？
$q_{A} = R_{A B} \cdot q_{B} + p_{A B}$
如果你觉得还好，那大可以试试多做几个运动，然后依然只用一个公式表达...
$q_{A} = R_{A B} (R_{B C} \cdot q_{C} + p_{B C}) + p_{A B}$
再套几层，动作顺序shuffle一下，这样还能分辨的出哪个运动属于哪个步骤就有鬼了...

现在我告诉阁下，有某种方式能用一条统一且简洁的 Matrix Train 来描述运动，那阁下该如何应对

q_{A} = T_{A B} \cdot T_{B C} \cdot T_{C D} \cdot q_{D}

Homogeneous Coordinates

实现上述简洁方法的方式就是变换成 Homogeneous Coordinates 齐次坐标

换而言之，就是把运动过程放到高一维度的空间来描述：2D 运动变成 3D，3D 运动变成 4D，以此类推...

"Add another coordinate to our points"
- Position
  $\begin{aligned} q = [\begin{array}{c} x \\ y \end{array}] & ⟶ [\begin{array}{c} x \\ y \\ 1 \end{array}] \\ ⟶ [\begin{array}{c} q \\ 1 \end{array}] \end{aligned}$
- Rotation Matrix
  $\begin{aligned} R = [\begin{array}{c} \cos (θ) & - \sin (θ) \\ \sin (θ) & \cos (θ) \end{array}] & ⟶ [\begin{array}{c} \cos (θ) & - \sin (θ) & 0 \\ \sin (θ) & \cos (θ) & 0 \\ 0 & 0 & 1 \end{array}] \\ ⟶ [\begin{array}{c} R & 0 \\ 0 & 1 \end{array}] \end{aligned}$
- Translation Matrix
  $\begin{aligned} p = [\begin{array}{c} p_{x} \\ p_{y} \end{array}] & ⟶ [\begin{array}{c} 1 & 0 & p_{x} \\ 0 & 1 & p_{y} \\ 0 & 0 & 1 \end{array}] \\ ⟶ [\begin{array}{c} I & p \\ 0 & 1 \end{array}] \end{aligned}$
  我们要用统一的方式描述所有运动，所以新的 Translation Matrix 的 Dimension 会和新的 Rotation Matrix 保持一致

Homogeneous Transformation Matrix (HTM)

Homogeneous Transformation Matrix 齐次变换矩阵 $T$

这种矩阵可以通过直接相乘合并使用，不同的相乘顺序会得到不同的结果
- 先 Translate，后 Rotate
  $\begin{matrix} [\begin{matrix} I & p \\ 0 & 1 \end{matrix}] [\begin{matrix} R & 0 \\ 0 & 1 \end{matrix}] = [\begin{matrix} R & p \\ 0 & 1 \end{matrix}] \end{matrix}$
- 先 Rotate，后 Translate
  $\begin{matrix} [\begin{matrix} R & 0 \\ 0 & 1 \end{matrix}] [\begin{matrix} I & p \\ 0 & 1 \end{matrix}] = [\begin{matrix} R & R p \\ 0 & 1 \end{matrix}] \end{matrix}$
[General Form of HTM]

$R$ $p$ 可以随便配置，只要记住其本质变换顺序即可

$\begin{matrix} T = [\begin{matrix} R & p \\ 0 & 1 \end{matrix}] \end{matrix}$

使用说明
1. $p$ $\{B\}$ $\{A\}$ 的齐次变换
  $\begin{matrix} [\begin{matrix} q_{A} \\ 1 \end{matrix}] = [\begin{matrix} R_{A B} & p_{A B} \\ 0 & 1 \end{matrix}] [\begin{matrix} q_{B} \\ 1 \end{matrix}] \end{matrix}$
  对应的公式展开为
  $q_{A} = R_{A B} \cdot q_{B} + p_{A B}$
2. $p$ 点在多个 Frame 之间的齐次变换
  $q_{A} = T_{A B} \cdot T_{B C} \cdot T_{C D} \cdot q_{D}$
3. Important Properties
  - $\det{(T)} = 1$
  - $T^T = T^TT = I$
    
    $\longrightarrow T^T = T^{-1}$
    
    $\longrightarrow T_{AB}^T = T_{BA}$
  - 能让两条平行线相交（see 3.4.2）
  性质上和 Rotation Matrix 几乎没有任何区别，相当于是一个高维的特殊 Rotation Matrix

3.4 齐次坐标下的平移运动

Translation Visualized

2D Translation 在三维中实际上是一个剪切变换，示例如下

Given coordinates in the Initial Frame and Homogeneous Transformation Matrix (HTM)

$\begin{matrix} Initial Coordinates: p_{i} = [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}] & HTM: H = [\begin{matrix} 1 & 0 & 0.4 \\ 0 & 1 & 0.5 \\ 0 & 0 & 1 \end{matrix}] \end{matrix}$

$p$ after translation is

$\begin{matrix} H \cdot p_{i} = [\begin{matrix} 1 & 0 & 0.4 \\ 0 & 1 & 0.5 \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}] = [\begin{matrix} 0.5 \\ 0.4 \\ 1 \end{matrix}] \end{matrix}$

If we visualize this process in 3D space...
- 白点 $\longrightarrow$ $p$ 点的位置
- 黄点 $\longrightarrow$ $z = 0$ 处的二维平面上的映射
- 蓝色立方体区 $\longrightarrow$ 这个这个立方体可以想象为所有 Coordinates，即 $p$ 点之集合
  
  $p = [p_x, p_y, p_z]^T$ $p_z = 1$
- 白点所在的蓝色平面 $\longrightarrow$ 这个平面可以想象为所有符合二维平移运动之要求的 Coordinates，即 $p_z = 1$ $p$ 点之集合
  
  $p = [p_x, p_y, 1]^T$

如何解释图中的形变？

图中形变是一种透视投影 （see 5）

整体发生了形变 $p_i$ $H$ 相乘即可验证

上图中，白色虚线表示的是形变后的 $z$ 轴上的点

$\omega$ $z$ $z$ 轴上的点可以表示为
$\begin{matrix} p_{i} = [\begin{matrix} 0 \\ 0 \\ ω \end{matrix}] \end{matrix}$
而这条线的形变过程则可以表示为
$\begin{matrix} H \cdot p_{i} = [\begin{matrix} 1 & 0 & 0.4 \\ 0 & 1 & 0.5 \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 0 \\ 0 \\ ω \end{matrix}] = [\begin{matrix} 0.5 ω \\ 0.4 ω \\ ω \end{matrix}] = ω \cdot [\begin{matrix} 0.5 \\ 0.4 \\ 1 \end{matrix}] \end{matrix}$
$\omega = 1$ 所描述的平面被称为归一化平面，即上图中从下往上起第一个蓝色平面

4. Pinhole Camera Model

$W$ $C$

cam model scene

上图中的一些元素

$\longrightarrow$ 世界坐标系下的物体坐标点

$\longrightarrow$ Image Plane 相机成像平面

$\longrightarrow$ 白点在相机成像平面上的投影

描述这个投影过程需要解决如下的问题

$W \longrightarrow C$

解决方案： $E$
$C \longrightarrow \text{Image Plane}$

解决方案： $K$

4.1 Extrinsic Matrix 外参矩阵

Extrinsic Matrix 外参矩阵的功能是将物体在 Global Frame 中的坐标转换到 Camera Frame 中，是 3D 空间的坐标转换

3.3 - 使用说明 1. $q_A, q_B, R_{AB}, p_{AB}$ 全都变成三维的版本就可以了！

$R_{AB}\in\mathbb{R}^{3\times3} \longrightarrow$ $\{B\}$ $\{A\}$
$p_{AB} \in\mathbb{R}^{3} \longrightarrow$ $\{B\}$ $\{A\}$
$q_A \in\mathbb{R}^{3} \longrightarrow$ $q$ $\{A\}$
$q_B \in\mathbb{R}^{3} \longrightarrow$ $q$ $\{B\}$

\begin{matrix} [\begin{matrix} q_{A} \\ 1 \end{matrix}] = \underset{E}{\underset{⏟}{[\begin{matrix} R_{A B} & p_{A B} \\ 0 & 1 \end{matrix}]}} [\begin{matrix} q_{B} \\ 1 \end{matrix}] \end{matrix}

$E$ is the Extrinsic Matrix

\begin{matrix} Extrinsic Matrix: E = [\begin{matrix} R & p \\ 0 & 1 \end{matrix}] \in R^{4 \times 4} \end{matrix}

Degrees of Freedom

$(x,y,z)$ $R_{AB}\in SO(3)$ ，哪怕它实际有9个参数，实际上也只有 3 DOF
$E ⟶ 3 DOF (R_{A B}) + 3 DOF (p_{A B}) = 6 DOF$
外参矩阵 Extrinsic Matrix 一共有 6 个自由度

4.2 Intrinsic Matrix 内参矩阵

Problem Statement

4.1 $E$ ，我们可以把物体的位置从世界坐标下转换到相机坐标下

内参矩阵会解决从相机坐标到Image Plane 像平面 的投射问题
- 红点 $p(x, y, z) \longrightarrow$ 物体在相机坐标系中的位置
- 蓝点 $p'(x', y', f) \longrightarrow$ 物体在蓝色平面，即 Image Plane 像平面上上的投影点
- 蓝色平面 $\longrightarrow$ Image Plane 成像平面
- $f \longrightarrow$ 焦距
- $z \longrightarrow$ 主光轴 Optical AxisPrincipal Point $(c_x, c_y)$
Image Plane Problems

以下所有问题均可甩锅给生产工艺
1. $(c_x, c_y) = (0, 0)$
  
  实际上：有 偏移 Offset
2. $x$ $y$ 的比例是正常的
  
  实际上：有 缩放 Scaling
3. $x$ $y$ 轴是完美正交的
  
  实际上：有 偏斜 Skew
Offset 和 Scaling 可以在获得投影点坐标的时候就解决问题 (see 4.2.1)，Skew会更复杂 (see 4.2.2)

4.2.1 如何获得投影点坐标

$p$ $p'$ $\{C\}$ $(x, z)$ $x'$ $y'$ 为

\begin{matrix} \frac{x^{'}}{x} = \frac{y^{'}}{y} = \frac{f}{z} \Rightarrow {\begin{cases} x^{'} & = f \cdot \frac{x}{z} \\ y^{'} & = f \cdot \frac{y}{z} \end{cases} \end{matrix}

考虑主光轴偏移Image Plane $(u, v)$ 为

{\begin{cases} u = f \cdot \frac{x}{z} + c_{x} \\ v = f \cdot \frac{y}{z} + c_{y} \end{cases}

等式右边转化为用 matrix 表达

\begin{matrix} z \cdot [\begin{matrix} u \\ v \\ 1 \end{matrix}] = [\begin{matrix} f & 0 & c_{x} \\ 0 & f & c_{y} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} x \\ y \\ z \end{matrix}] \end{matrix}

$z$ ，在深度上归一化

从 3D 到 2D 的 Dimension Loss 就发生于此

\begin{matrix} [\begin{matrix} u \\ v \\ 1 \end{matrix}] = [\begin{matrix} f & 0 & c_{x} \\ 0 & f & c_{y} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} x \\ y \\ z \end{matrix}] \cdot \frac{1}{z} \end{matrix}

然后就会得到

\begin{matrix} [\begin{matrix} u \\ v \\ 1 \end{matrix}] = [\begin{matrix} f & 0 & c_{x} \\ 0 & f & c_{y} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} x / z \\ y / z \\ 1 \end{matrix}] \end{matrix}

最后一步，采集到的图像后需要数字化 Digitization / Pixelization，同时解决 像平面缩放

$\rho_w$ $\rho_h$ are width and height of each pixel

\begin{matrix} [\begin{matrix} u \\ v \\ 1 \end{matrix}] = [\begin{matrix} f / ρ_{w} & 0 & c_{x} \\ 0 & f / ρ_{h} & c_{y} \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} x / z \\ y / z \\ 1 \end{matrix}] for {\begin{cases} f_{x} = f / ρ_{w} \\ f_{y} = f / ρ_{h} \end{cases} \end{matrix}

4.2.2 Skewed Image Plane

skew

从相机镜头投到 Image Plane 的图像是以左侧的，但是为了让 Image Plane 能正确接收，我们需要把图像变换到右侧

skew coord

从上图可得到如下坐标变换公式

{\begin{cases} x_{s k e w} = \hat{x} - \hat{y} \cot (θ) \\ y_{s k e w} = \hat{y} / \sin θ \end{cases}

把从 4.2.1 中得到的结论从 Matrix Form 变成 Equation Form，然后代入上式

\begin{aligned} {\begin{cases} u = f_{x} \cdot \frac{x_{s k e w}}{z} + c_{x} \\ v = f_{y} \cdot \frac{y_{s k e w}}{z} + c_{y} \end{cases} \\ ⟶ & {\begin{cases} u = f_{x} \cdot (\frac{x}{z} - \frac{y}{z} \cot (θ)) + c_{x} \\ v = f_{y} \cdot \frac{y}{z} \frac{1}{\sin θ} + c_{y} \end{cases} \\ ⟶ & {\begin{cases} u = f_{x} \cdot \frac{x}{z} - f_{x} \cot (θ) \cdot \frac{y}{z} + c_{x} \\ v = f_{y} / \sin θ \cdot \frac{y}{z} + c_{y} \end{cases} \end{aligned}

4.2.3 Summary

Pixelized Matrix Form
$\begin{matrix} z \cdot [\begin{matrix} u \\ v \\ 1 \end{matrix}] = \underset{K}{\underset{⏟}{[\begin{matrix} f_{x} & f_{x} \cot (θ) & c_{x} \\ 0 & f_{y} / \sin (θ) & c_{y} \\ 0 & 0 & 1 \end{matrix}]}} [\begin{matrix} x \\ y \\ z \end{matrix}] \end{matrix}$
Pixelized Matrix Form, Normalized
$\begin{matrix} [\begin{matrix} u \\ v \\ 1 \end{matrix}] = \underset{K}{\underset{⏟}{[\begin{matrix} f_{x} & f_{x} \cot (θ) & c_{x} \\ 0 & f_{y} / \sin (θ) & c_{y} \\ 0 & 0 & 1 \end{matrix}]}} [\begin{matrix} x / z \\ y / z \\ 1 \end{matrix}] \end{matrix}$
Pixelized Equation Form
${\begin{cases} u = f_{x} \cdot \frac{x_{s k e w}}{z} + c_{x} \\ v = f_{y} \cdot \frac{y_{s k e w}}{z} + c_{y} \end{cases} with {\begin{cases} x_{s k e w} = \hat{x} - \hat{y} \cot (θ) \\ y_{s k e w} = \hat{y} / \sin θ \end{cases}$

$K$ is the Intrinsic Matrix

\begin{matrix} Intrinsic Matrix: K = [\begin{matrix} f_{x} & f_{x} \cot (θ) & c_{x} \\ 0 & f_{y} / \sin (θ) & c_{y} \\ 0 & 0 & 1 \end{matrix}] \in R^{3 \times 3} \end{matrix}

Degrees of Freedom

5 个参数，内参矩阵 Intrinsic Matrix 一共有 5 个自由度

4.3 Lens Distortion 透镜畸变

相机肯定不止是小孔（光圈），它也有透镜（镜头），而透镜通常会导致图像变形...然后你的坐标就不对了

好消息是，我们只需要在算投影点坐标前增加一步 Anti-Distortion 就好了

坏消息是，不是一般的麻烦...

Lens Distortion 大部分是 Radial Distortion，少部分是 Tangential Distortion

lens distort

OpenCV 提供的解决方案是用一种长得非常离谱的...系数...

{\begin{cases} x^{″} & = \frac{1 + k_{1} r^{2} + k_{2} r^{4} + k_{3} r^{6}}{1 + k_{4} r^{2} + k_{5} r^{4} + k_{6} r^{6}} \cdot x^{'} + 2 p_{1} \cdot x^{'} y^{'} + p_{2} (r^{2} + 2 x^{' 2}) \\ y^{″} & = \frac{1 + k_{1} r^{2} + k_{2} r^{4} + k_{3} r^{6}}{1 + k_{4} r^{2} + k_{5} r^{4} + k_{6} r^{6}} \cdot y^{'} + 2 p_{2} \cdot x^{'} y^{'} + p_{1} (r^{2} + 2 y^{' 2}) \end{cases}

where

$r^2 = x'^2 + y'^2$
$k_i$ are Radial Distortion Coefficients, and typically

Higher-Order Coefficients are not considered in OpenCV
- $k_1>0 \longrightarrow$ Barrel Distortion
- $k_1<0 \longrightarrow$ Pincushion Distortion

投影点坐标可以用如下的方式计算

算齐次坐标
抗畸变（4.2.1 里没有这一步）
算投影点坐标

$x', y'$ $f_x, f_y$ 移后面去

\begin{aligned} (1) & {\begin{cases} x^{'} & = x / z \\ y^{'} & = y / z \end{cases} \\ (2) & {\begin{cases} x^{″} & = \frac{1 + k_{1} r^{2} + k_{2} r^{4} + k_{3} r^{6}}{1 + k_{4} r^{2} + k_{5} r^{4} + k_{6} r^{6}} \cdot x^{'} + 2 p_{1} \cdot x^{'} y^{'} + p_{2} (r^{2} + 2 x^{' 2}) \\ y^{″} & = \frac{1 + k_{1} r^{2} + k_{2} r^{4} + k_{3} r^{6}}{1 + k_{4} r^{2} + k_{5} r^{4} + k_{6} r^{6}} \cdot y^{'} + 2 p_{2} \cdot x^{'} y^{'} + p_{1} (r^{2} + 2 y^{' 2}) \end{cases} \\ (3) & {\begin{cases} u = f_{x} \cdot x^{″} + c_{x} \\ v = f_{y} \cdot x^{″} + c_{y} \end{cases} \\ ⟹ & (u, v) \end{aligned}

4.4 Model Summary

通用写法

$\{W\}$ $p(x, y, z)$ $\{C\}$ 下投影点坐标的流程为
1. $\{W\} \longrightarrow \{C\}$
  $\begin{matrix} [\begin{matrix} x_{C} \\ y_{C} \\ z_{C} \\ 1 \end{matrix}] = \underset{Extrinsics E}{\underset{⏟}{[\begin{matrix} R_{C W} & p_{C W} \\ 0 & 1 \end{matrix}]}} [\begin{matrix} x_{W} \\ y_{W} \\ z_{W} \\ 1 \end{matrix}] \end{matrix}$
2. $\{C\} \longrightarrow \text{Image Plane}$
  $\begin{matrix} z_{C} \cdot [\begin{matrix} u \\ v \\ 1 \end{matrix}] = \underset{Intrinsics K}{\underset{⏟}{[\begin{matrix} f_{x} & f_{x} \cot (θ) & c_{x} \\ 0 & f_{y} / \sin (θ) & c_{y} \\ 0 & 0 & 1 \end{matrix}]}} [\begin{matrix} x_{C} \\ y_{C} \\ z_{C} \end{matrix}] \end{matrix}$
$z_C$ $(u, v)$ 就完成了
Dimension 能对上的写法

上面的那种 Dimension 没法直接对上，所以此处把 Extrinsic Matrix 的最后一层削了，让它变成 Euclidean Rigid Transformation
$E = [\begin{matrix} R_{C W} & p_{C W} \end{matrix}] \in R^{3 \times 4}$
然后就能写成略丑但 Dimension Match 的版本了
$\begin{matrix} z_{C} \cdot [\begin{matrix} u \\ v \\ 1 \end{matrix}] = \overset{Camera Matrix M}{\overset{⏞}{\underset{Intrinsics K}{\underset{⏟}{[\begin{matrix} f_{x} & f_{x} \cot (θ) & c_{x} \\ 0 & f_{y} / \sin (θ) & c_{y} \\ 0 & 0 & 1 \end{matrix}]}} \underset{Extrinsics E}{\underset{⏟}{[\begin{matrix} r_{11} & r_{12} & r_{13} & p_{1} \\ r_{21} & r_{22} & r_{23} & p_{2} \\ r_{31} & r_{32} & r_{33} & p_{3} \end{matrix}]}}}} [\begin{matrix} x_{W} \\ y_{W} \\ z_{W} \\ 1 \end{matrix}] \end{matrix}$

Degrees of Freedom
$M ⟶ 5 DOF (K) + 6 DOF (E) = 11 DOF$
相机矩阵 Camera Matrix 一共 11 个自由度

5. Camera Calibration

以找 Intrinsic Matrix 为目的的 Checkerboard Calibration

以找 Extrinic Matrix 为目的的 Perspective-n-Point Problem

5.1 Checkerboard Calibration

Camera Intrinsics are generally static，校准只需要一次即可

除非经历长途运送或者受到了很大的冲击，不然不会轻易改变

棋盘格是很好的判断成像形变的参照物，checkerboard calibration 是非常通用的校准方法

checkerboard

如果你会用 AprilTag 这类标记符号，也可以用下图的 AprilTag 棋盘，反正不管哪种都有一对 Libraries 能用

apriltag calib

5.2 PnP Problem

PnP = Perspective-n-Point

pnp

Given

$K \longrightarrow$ 相机内参

$X_i \in \mathbb{R}^3 \longrightarrow$ $n$ 组三维空间中的点

$x_i \in \mathbb{R}^2 \longrightarrow$ $X_i$ 映射到相机 Image Plane 上的坐标

对应点一般至少要给 3 组
Find

$E = [R \;| \;p] \in \mathbb{R}^{3\times 4} \longrightarrow$ 相机外参，即相机姿态

$R\in SO(3)$ $p \in \mathbb{R}^3$
应用场景
- Augmented Reality (AR)
- Structure from Motion (SfM)
- ... ...