Linear Least Squares: Fitting Data with Matplotlib Techniques

12/13/22, 12:11 AM Least Squares https://www.prairielearn.org/pl/workspace/626847 1/17 Matplotlib created a temporary config/cache directory at /tmp/matplotlib-45lty7mr becaus e the default path (/tmp/cache/matplotlib) is not a writable directory; it is highly rec ommended to set the MPLCONFIGDIR environment variable to a writable directory, in partic ular to speed up the import of Matplotlib and to better support multiprocessing. Lesson 9: Linear Least Squares Linear Least Squares and Normal Equations Consider a set of $m$ data points $\{(t_1,y_1),(t_2,y_2), ...,(t_m,y_m)\}$. Suppose we want to find a straight line that best fits these data points. Mathematically, we want to find the slope and intercept ($x_1$ and $x_2$ respectively) such that $$y_i = x_1 t_i + x_2, \qquad i \in \{1,2,...,m\} $$ We can write this more explicitly as ${\bf y} = {\bf t}\cdot x_1 + 1\cdot x_2$, which lets us separate the coefficients we are trying to solve for ($x_1$ and $x_2$) from the input data that we have (${\bf y}$ and ${\bf t}$). Now we can put these values into a matrix format: $$\begin{bmatrix}y_1 \\ y_2 \\ \vdots \\ y_m\end{bmatrix} = \begin{bmatrix}t_1 & 1 \\ t_2 & 1 \\ \vdots & \vdots \\ t_m & 1\end{bmatrix} \begin{bmatrix}x_1 \\ x_2\end{bmatrix} \qquad \Leftrightarrow \qquad {\bf y} = {\bf A}{\bf x}$$ where ${\bf A}$ is the design matrix that is a function of the $t_i$ data, ${\bf y}$ is the vector with the $y_i$ data and ${\bf x}$ is the vector with the coefficients $x_i$. Notice how the values that multiply our coefficients $x_i$ are the columns of ${\bf A}$. Generally, if we have a linear system where ${\bf A}$ is an $m \times n$ matrix and $m > n$ we call this system overdetermined . For these systems, the equality is usually not exactly satisfiable as ${\bf y}$ may not lie in the column space of ${\bf A}$. Therefore, an overdetermined system is better written as $${\bf A}{\bf x} \cong {\bf y} $$ For an overdetermined system ${\bf A}{\bf x} \cong {\bf y}$, we are typically looking for a solution ${\bf x}$ that minimizes the squared norm of the residual vector ${\bf r} = {\bf y}-{\bf A x}$, $$ \min_{\bf x} ||{\bf r}||^2 = \min_{\bf x} ||{\bf y} - {\bf A x}||^2$$ This problem is called a linear least-squares problem , and the solution ${\bf x}$ is called least- squares solution. You learned during lecture that the solution of the least squares problem is also the solution of the system of normal equations : $${\bf A}^T {\bf A}{\bf x} = {\bf A}^T {\bf y} $$ Example 1 - fitting a line In [1]: import numpy as np import numpy.linalg as la import scipy.linalg as sla import matplotlib.pyplot as plt

12/13/22, 12:11 AM Least Squares https://www.prairielearn.org/pl/workspace/626847 2/17 Consider the data set given by pts : [2 3 5 6] Check your answers: Define the matrix ${\bf A}$ that can be used to solve for the coefficients $x_1$ and $x_2$. Store the value of it in variable A_line . Hint: Try to define the design matrix programmatically (i.e. not hard-coded). Rather than iterating through a loop, you can directly construct ${\bf A}^T$ as a NumPy array whose first entry is t_line and whose second entry is a vector of 1 s, then apply the transpose with .T to obtain ${\bf A}$ as desired. Hint: To obtain the vector of 1 s with appropriate length, you may use t_line**0 or np.ones_like(t_line) . Recalling the syntax for power in Python and broadcasting, can you reason why the first option works? [[2. 1.] [3. 1.] In [2]: pts = np . array ([[ 2 , 4 ], [ 3 , 2 ], [ 5 , 1 ], [ 6 , 0 ]]) t_line = pts [:, 0 ] y_line = pts [:, 1 ] plt . plot ( t_line , y_line , 'o' ) print ( t_line ) In [19]: #grade (enter your code in this cell - DO NOT DELETE THIS LINE) # Define A_line here A_line = np . ones ( pts . shape ) width , length = pts . shape for i in range ( width ): A_line [ i ][ 0 ] = pts [ i ][ 0 ] A_line [ i ][ 1 ] = 1 print ( A_line )

12/13/22, 12:11 AM Least Squares https://www.prairielearn.org/pl/workspace/626847 3/17 [5. 1.] [6. 1.]] Check your answers: Use the normal equations to find the coefficients in ${\bf x}$. Store the value of it in variable x_line . Plot the points and fit curve: [<matplotlib.lines.Line2D at 0x7f3b849b5d60>] Linear Least Squares and QR While easy to understand and remember, using the normal equations can be prone to numerical instabilities (you will learn more about this if you take a Numerical Methods or Analysis class! Hint hint!) Instead of solving the normal equations using the design matrix ${\bf A}$, we will instead use its QR decomposition $$ {\bf A} = {\bf QR} $$ where ${\bf Q}$ is an orthogonal matrix and ${\bf R}$ is an upper triangular matrix. Hence the least squares solution ${\bf x}$ can be obtained as: $$ {\bf A}^T{\bf Ax} = {\bf A}^T{\bf y} \quad \Leftrightarrow \quad {\bf Rx} = {\bf Q}^T{\bf y}$$ Check your answers: In [4]: #grade (enter your code in this cell - DO NOT DELETE THIS LINE) x_line = np . linalg . solve (( A_line . T @ A_line ), ( A_line . T@y_line )) # x_line = (A_line.T@y_line) @ (np.linalg.inv(A_line.T @ A_line)) In [5]: plt . plot ( t_line , y_line , 'o' ) plt . plot ( t_line , t_line * x_line [ 0 ] + x_line [ 1 ]) Out[5]:

Your preview ends here