You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: lectures/cross_product_trick.md
+41-57
Original file line number
Diff line number
Diff line change
@@ -11,73 +11,64 @@ kernelspec:
11
11
name: python3
12
12
---
13
13
14
-
# Eliminating Cross Products
14
+
# 消除交叉项
15
15
16
-
## Overview
16
+
## 概述
17
17
18
-
This lecture describes formulas for eliminating
18
+
本讲座描述了消除以下内容的公式:
19
19
20
-
*cross products between states and control in linear-quadratic dynamic programming problems
20
+
*线性二次动态规划问题中状态和控制之间的交叉项
21
21
22
-
*covariances between state and measurement noises in Kalman filtering problems
22
+
*卡尔曼滤波问题中状态噪声和测量噪声之间的协方差
23
23
24
+
对于线性二次动态规划问题,主要思路包括以下步骤:
24
25
25
-
For a linear-quadratic dynamic programming problem, the idea involves these steps
26
-
27
-
* transform states and controls in a way that leads to an equivalent problem with no cross-products between transformed states and controls
28
-
* solve the transformed problem using standard formulas for problems with no cross-products between states and controls presented in this lecture {doc}`Linear Control: Foundations <lqcontrol>`
29
-
* transform the optimal decision rule for the altered problem into the optimal decision rule for the original problem with cross-products between states and controls
That the omitted matrix $H=0$ indicates that there are no cross products between states and controls
78
-
in the equivalent problem.
69
+
省略的矩阵 $H=0$ 表示在等价问题中没有状态和控制之间的交叉项。
79
70
80
-
The matrices $(A^*, B, R^*, Q) $ defining the equivalent problem and the value function, policy function matrices $P, F^*$ that solve it are related to the matrices $(A, B, R, Q, H)$ defining the original problem and the value function, policy function matrices $P, F$ that solve the original problem by
The **duality** that prevails between a linear-quadratic optimal control and a Kalman filtering problem means that there is an analogous transformation that allows us to transform a Kalman filtering problem
95
-
with non-zero covariance matrix between between shocks to states and shocks to measurements to an equivalent Kalman filtering problem with zero covariance between shocks to states and measurments.
Thus, $x_t$ is $m \times 1$ and $z_t$ is $k \times 1$.
114
-
115
-
The Kalman filtering formulas are
101
+
因此,$x_t$ 是 $m \times 1$ 且 $z_t$ 是 $k \times 1$。
116
102
103
+
卡尔曼滤波公式为:
117
104
118
105
\begin{align*}
119
106
K(\Sigma_t) & = (A \Sigma_t D' + BF')(D \Sigma_t D' + FF')^{-1}, \\
120
107
\Sigma_{t+1}& = A \Sigma_t A' + BB' - (A \Sigma_t D' + BF')(D \Sigma_t D' + FF')^{-1} (D \Sigma_t A' + FB').
121
108
\end{align*} (eq:Kalman102)
122
-
123
109
124
-
Define tranformed matrices
110
+
定义转换后的矩阵:
125
111
126
112
\begin{align*}
127
113
A^* & = A - BF' (FF')^{-1} D, \\
128
114
B^* {B^*}' & = BB' - BF' (FF')^{-1} FB'.
129
115
\end{align*}
130
116
131
-
### Algorithm
117
+
### 算法
132
118
133
-
A consequence of formulas {eq}`eq:Kalman102} is that we can use the following algorithm to solve Kalman filtering problems that involve non zero covariances between state and signal noises.
The Kalman gain for the original problem **with non-zero covariance** between shocks to states and measurements is then
130
+
原始问题(具有**非零协方差**的状态和测量噪声)的卡尔曼增益为:
146
131
147
132
$$
148
133
K = K^* + BF' (FF')^{-1},
149
134
$$
150
135
151
-
The state reconstruction covariance matrix $\Sigma$ for the original problem equals the state reconstrution covariance matrix for the transformed problem.
136
+
原始问题的状态重构协方差矩阵 $\Sigma$ 等于转换后问题的状态重构协方差矩阵。
152
137
153
138
+++
154
139
155
-
## Duality table
156
-
157
-
Here is a handy table to remember how the Kalman filter and dynamic program are related.
0 commit comments