« previous | Tuesday, April 22, 2014 | next »
Section 8.1 and 8.2
Least Squares approximation
usually
(linear) or
(quadratic)
is given, so we wish to minimize
In 8.1,
, where
is a probability weight distribution.
In 8.2,
Common definitions of
:
(classic definition)
on ![{\displaystyle \left[-\infty ,\infty \right]}](https://wikimedia.org/api/rest_v1/media/math/render/svg/33412bb67247a10b0e9a7916a02aac7abd76d146)
on ![{\displaystyle \left[-1,1\right]}](https://wikimedia.org/api/rest_v1/media/math/render/svg/79566f857ac1fcd0ef0f62226298a4ed15b796ad)
data:image/s3,"s3://crabby-images/fe56d/fe56d478c1d43060327d3e12b44d860d7f4c00bd" alt="{\displaystyle w(x)={\frac {1}{\sqrt {1-x^{2}}}}}"
Basic Concept
We compute
and find the minimum. To find this minimum, we solve for
. This gives a system of normal equations:
This matrix is positive definite, so it has an inverse.
This system is solvable, but it's a pain to solve.
Original function
data:image/s3,"s3://crabby-images/ed6fe/ed6fe63a0e94155c1669feb05b51f369fabd0fae" alt="{\displaystyle f(x)=x^{3}}"
.
Best Quadratic fit
data:image/s3,"s3://crabby-images/84403/84403b9466e4243ab8737a9af94441cbe58dbd2a" alt="{\displaystyle p(x)={\frac {1}{20}}-{\frac {3}{5}}\,x+{\frac {3}{2}}\,x^{2}}"
.
For example, find the best fit to
by
on
Simplifying Normal Equations
What if
were a diagonal matrix? That is, what if
for
?
Then
,
, etc. would be an orthogonal basis for our regression space.
Continuing with our example, let
, where
,
, and
.
is an orthogonoal basis for our regression space.
This gives a much easier system to solve:
There are two ways to find an orthogonal basis for
- Legendre polynomials
- Gram-Schmidt
Legendre Polynomials
For
, we have:
,
, and
data:image/s3,"s3://crabby-images/be25c/be25ced374edbdde5a1fc656ec2719c91f1bf357" alt="{\displaystyle L_{2}(x)=3x^{2}-1}"
These only work on
, so we shift and scale to
by substituting
(or
). Performing this substitution gives our
's above.
Gram-Schmidt
This will always give an orthonormal basis, so
:
def ip(f, g, a, b):
return integrate(f*g, x, a, b)
def norm(f, a, b):
return sqrt(ip(f, f, a, b))
def normalize(f, a, b):
return f/norm(f, a, b)
def gram_schmidt(basis, a, b):
on_basis = [normalize(basis[0], a, b)]
for i in xrange(1, len(basis)):
p = sum(ip(basis[i], e, a, b) * e for e in on_basis)
next_e = normalize(basis[i] - p, a, b)
on_basis.append(next_e)
return map(simplify, on_basis)
data:image/s3,"s3://crabby-images/b6fdc/b6fdce210a20ecedd6d923efc9e41502156f5182" alt="{\displaystyle \varphi _{1}(x)=1}"
data:image/s3,"s3://crabby-images/27f0e/27f0eb707089acc3bff81e74868d063bf9d94877" alt="{\displaystyle \varphi _{2}(x)={\sqrt {3}}\,\left(2x-1\right)}"
data:image/s3,"s3://crabby-images/f3be9/f3be9e51162f153df9af6d505d4f1742e09f0b57" alt="{\displaystyle \varphi _{3}(x)={\sqrt {5}}\,\left(6x^{2}-6x+1\right)}"
Now we have
Quiz Discussion: Matrix Norms
This is horrible. Bojan Popov
Find
,
, and
for
is the max row sum, so
is just the max column sum, so
, so
,
data:image/s3,"s3://crabby-images/58cf9/58cf9a51d6e9e041b71168e0097218b71c351521" alt="{\displaystyle \rho (A^{T}\,A)=\max _{i}\lambda _{i}=\max \left\{4,{\frac {15+{\sqrt {221}}}{2}},{\frac {15-{\sqrt {221}}}{2}}\right\}={\frac {15+{\sqrt {221}}}{2}}}"
data:image/s3,"s3://crabby-images/e56ec/e56ec58a2732bb18576c7be6ad0833049c9555c4" alt="{\displaystyle {\sqrt {\rho (A^{T}\,A)}}={\sqrt {\frac {15+{\sqrt {221}}}{2}}}}"