CS3414 Afterclass Notes --- 5 June, 2002

Fitting Data

Least Squares Approximation of Data
- Recall the basic idea. Given m data points (x_i,y_i) and n basis functions b_j(x), find a linear combination of the basis functions (call it F(x)), that approximates the given data in some sense.
- More formally, this corresponds to solving the `overdetermined linear system' B alpha = y, where B is an m x n matrix with B_ij = b_j(x_i), and alpha is an n-vector of unknown coefficients (which define the particular linear combination F(x)).
- Note that the "least squares" solution to this problem is the vector alpha that minimizes the two-norm of residual r = y - B alpha. (Minimizing with respect to other norms yields other approaches, mentioned briefly last time.)
- How to find the least squares solution?
  1. Form and solve the `normal equations'
    - Can show that the least squares solution satisfies the n x n linear system B^T B alpha = B^T y.
    - But the condition number of B^T B can be very large (roughly the square of cond(B)), which means this method may be unstable. And depending on the choice of basis functions, B (and therefor B^T B) may be dense.
  2. Do something more clever...
    - Choose basis functions with `small support' (nonzero on only a small region), so that B^T B is very sparse and has better conditioning.
    - Choose `orthonormal' basis functions so that B^T B is the identity matrix (see Section 10.2).
    - Use an orthogonal factorization method (see CS 5485).
- We concluded by looking at a Matlab example of approximating some data using a cubic polynomial and a linear spline.