🔬 Tutorial problems epsilon \epsilon

Contents

🔬 Tutorial problems epsilon ϵ#

Note

This problems are designed to help you practice the concepts covered in the lectures. Not all problems may be covered in the tutorial, those left out are for additional practice on your own.

ϵ.1#

What is the rank of the N×N identity matrix I?

What about the upper-triangular matrix which diagonal elements are 1?

Check all relevant definitions

By definition, rank(I) is equal to the dimension of the span of its columns. Its columns are the N canonical basis vectors in RN, which we know span all of RN. Hence

rank(I)=dim(RN)=N

Draft of the proof for the second question: For the upper triangular matrix start by showing that the columns are linearly independent, and because there are N of them, they span the whole space RN, thus the expression above applies again, and the rank is N.

ϵ.2#

Show that if T:RNRN is nonsingular, i.e. linear bijection, the inverse map T1 is also linear.

Check all relevant definitions

Let T:RNRN be nonsingular and let T1 be its inverse. To see that T1 is linear we need to show that for any pair x,y in RN (which is the domain of T1) and any scalars α and β, the following equality holds:

T1(αx+βy)=αT1x+βT1y.

In the proof we will exploit the fact that T is by assumption a linear bijection.

So pick any vectors x,yRN and any two scalars α,β. Since T is a bijection, we know that x and y have unique preimages under T. In particular, there exist unique vectors u and v such that

Tu=xandTv=y

Using these definitions, linearity of T and the fact that T1 is the inverse of T, we have

T1(αx+βy)=T1(αTu+βTv)=T1(T(αu+βv))=αu+βv=αT1x+βT1y.

This chain of equalities confirms

T1(αx+βy)=αT1x+βT1y.

ϵ.3#

Chose an orthonormal basis in R3 which is not canonical basis {e1,e2,e3} and verify by direct computation that the transformation matrix is orthogonal.

To come up with an orthonormal basis in R3 think first of three orthogonal vectors (directions from the origin), then write them as vectors, and normalize each to length 1.

Denote (x,y,z) the three dimensions in R3. Lets consider the following orthogonal lines:

  • 45 degree line between x and y axes

  • z axis itself since it is normal to the whole plane spanned by x and y axes, and thus orthogonal to the first chosen line

  • lastly, the 45 degree line between the negative side of x axes and positive side of y axes

In other words, choose the following vectors to form the basis:

(1,1,0),(0,0,1),(1,1,0)

It is clear that

Computing the norm of which of the vectors and dividing by it we get the following orthonormal basis:

e1=(12,12,0),e2=(0,0,1),e3=(12,12,0)

The transformation matrix is formed by placing the vectors of the new basis in the coordinates of the old basis into its columns. The matrix is

P=(1201212012010)

To show that the matrix is orthogonal, we need to show that PTP=I.

(1212000112120)(1201212012010)=(12+12012+1201012+12012+12)

Alternatively, by performing Gauss-Jacobian operations, you could derive the inverse matrix P1 and verify that it is equal to PT.

ϵ.4#

For each of the linear maps defined by the following matrices

T1=(4/32/301/35/30001)
T2=(401210201)
T3=(501110710)
T4=(200031013)

perform the following tasks:

  1. Find eigenvalues

  2. Find at least one eigenvector for each eigenvalue

  3. Form a new basis from the eigenvectors (normalized or not)

  4. Compute the transformation matrix to the new basis

  5. Find the matrix T in the new basis and verify that it is diagonal

See example in the lecture notes

T1

1.

To find eigenvalues solve

det(4/3λ2/301/35/3λ0001λ)=(1+λ)(4/3λ2/31/35/3λ)=
=(1+λ)[(43λ)(53λ)233]=(1+λ)(λ23λ+45929)=
=(1+λ)(λ23λ+2)=(1+λ)(λ1)(λ2)

Therefore the eigenvalues are λ1=1, λ2=1 and λ3=2

2.

To find eigenvectors plug the eigenvalues one by one to T1λI and investigate the implications of the resulting system of equations. We should expect to find multiple eigenvectors for each eigenvalue, and therefore are looking for a formula rather than a usual answer.

(T1λ1I)(xyz)=0(7/32/301/38/30000)(xyz)=(000)

Thus, any value of z will do, for example z=1. To find x and y we can use the first two equations (multiplied by 3 right away):

{7x2y=0x+8y=0{x=0y=0

Therefore, the vector v1=(0,0,1) is an eigenvector for λ1=1.

(T1λ2I)(xyz)=0(1/32/301/32/30001)(xyz)=(000)
(120120001)(xyz)=(000)

Obviously, all vectors of the form (2a,a,0) are eigenvectors for λ2=1. In particular, v2=(2/5,1/5,0) has a norm of 1.

(T1λ3I)(xyz)=0(2/32/301/31/30001)(xyz)=(000)
(110110001)(xyz)=(000)

Now, all vectors of the form (a,a,0) are eigenvectors for λ3=2. In particular, v3=(1/2,1/2,0) has a norm of 1.

3.

We have chosen the eigenvectors in such a way that they are already normalized, i.e. have length of 1. To verify, observe

v1=02+02+12=1v2=(2/5)2+(1/5)2+02=1v2=(1/2)2+(1/2)2+02=1

It’s easy to verify that vectors v1,v2,v3 are linearly independent, and therefore the set

{v1,v2,v3}={(001),(2/51/50),(1/21/20)}

forms a normalized basis in R3

4.

The transformation matrix is a matrix with columns formed from the vectors of the new basis expressed in coordinates (``the language’’) of the old basis.

P=(02/51/201/51/2100)

5.

The matrix T and the matrix T in the new basis are related as

T=PTP1T=P1TP

In any case, we need P1. In order to find the inverse of the P matrix, we make the following argument. By definition, PP1=I, and therefore the columns of the unknown matrix P1 (denoted below pi, i=1,2,3) are solutions of the following three systems of equations:

Pp1=(100)Pp2=(010)Pp3=(001)

We can find the solutions of all three systems by Gaussian elimination performing elementary row operations on an ``extra’’ augmented matrix with three columns in place of the right-hand side.

(02/51/210001/51/2010100001)
(100001015/205002/51/2100)
12+(52)(25)=12+22=32
(100001015/2050003/2120)
(100001015/20500012/322/30)
5+52(223)=53
(1000010105/35/300012/322/30)

Therefore, the inverse of the P matrix is given by

P1=(0015/35/302/322/30)

Additional exercise: verify that PP1=I.

Now we can compute P1T1P:

(0015/35/302/322/30)(4/32/301/35/30001)(02/51/201/51/2100)=
(00145959259+5590429+22922910290)(02/51/201/51/2100)=
(001535302234230)(02/51/201/51/2100)=
(100023+1353253204235423523+43)=(100010002)

P1T1P is diagonal with eigenvalues on the main diagonal!

T2

1.

To find eigenvalues solve

det(4λ0121λ0201λ)=

(expanding along the top row)

=(4λ)(1λ)1+2(1λ)=(1λ)(λ25λ+6)=(1λ)(λ2)(λ3)

Therefore the eigenvalues are λ1=1, λ2=2 and λ3=3

2.

To find eigenvectors plug the eigenvalues one by one to T2λI and investigate the implications of the resulting system of equations. We should expect to find multiple eigenvectors for each eigenvalue, and therefore are looking for a formula rather than a usual answer.

(T2λ1I)(xyz)=0(301200200)(xyz)=(000)

Doing Gaussian-Jordan elimination we have

(301020002000)(10130002300000)
(100000100000){x=0y is freez=0

In other words, eigenvector v1 is of the form (0,p,0) where pR. Let us this time not impose the normalization and take v1=(0,1,0) as the first eigenvector.

(T2λ2I)(xyz)=0(201210201)(xyz)=(000)
(201021002010)(1012001100000)
(1012001100000){x=12zy=zz is free

In other words, eigenvector v2 is of the form (12p,p,p) where pR. Again, let us not impose the normalization and take v2=(1,2,2) as the second eigenvector.

(T2λ3I)(xyz)=0(101220202)(xyz)=(000)
(101022002020)(101002200000)
(101001100000){x=zy=zz is free

In other words, eigenvector v3 is of the form (p,p,p) where pR. Let us take v3=(12,1,1) as the second eigenvector.

3.

We have chosen the eigenvectors in a way that they are not normalized, and let’s try to see if this approach results in a diagonal matrix T in the new basis anyway.

{v1,v2,v3}={(010),(122),(111)}

forms a basis in R3

4.

The transformation matrix is a matrix with columns formed from the vectors of the new basis expressed in coordinates (``the language’’) of the old basis.

P=(011121021)

5.

Again, the matrix T and the matrix T in the new basis are related as

T=PTP1T=P1TP

We find P1 in the same way by going Gaussian-Jordan elimination

(011100121010021001)
(121010011100001201)
(101210011100001201)
(100011010101001201)

Therefore, the inverse of the P matrix is given by

P1=(011101201)

Additional exercise: verify that PP1=I.

Now we can compute P1T2P:

(011101201)(401210201)(011121021)=
(011202603)(011121021)=(100020003)

P1T2P is diagonal with eigenvalues on the main diagonal!

T3

1.

To find eigenvalues solve

det(5λ0111λ071λ)=(5λ)(1λ)(λ)+1+7(1λ)=
=(1λ)(λ25λ+7)+1=λ25λ+7λ3+5λ27λ+1==λ3+6λ212λ+8=(λ2)3

Therefore the only eigenvalue is λ=2, this root is repeated three times.

2.

To find eigenvectors plug the eigenvalues one by one to T3λI and investigate the implications of the resulting system of equations.

Because the eigenvalue is repeated, we should expect difficulties finding enough eigenvectors to form a new basis — we need at least three linearly independent eigenvectors in R3!

(T3λI)(xyz)=0(301110712)(xyz)=(000)

Doing Gaussian-Jordan elimination we have

(301011007120)(101300113001130)
(10130011300000){x=13zy=13zz= is free

In other words, all eigenvectors have the form (13p,13p,p) where pR.

In order to form a basis from eigenvectors, we need three linearly independent of them, which is impossible in this case because there is only one free parameter! In other words, all eigenvectors we can come up with will lie within the same line (one degree of freedom), and thus we can not even have two linearly inpendent eigenvectors, let alone three.

3. - 5.

T3 is not diagonalizable through eigendecomposition.

T4

1.

First of all, note that T4 is symmetric with real entries, therefore eigenvalue decomposition definitely be possible – unlike the last case. To find eigenvalues solve

det(2λ0003λ1013λ)=(2λ)((3λ)21)=
=(2λ)(96λ+λ21)=(2λ)(λ26λ+8)==(2λ)(λ2)(λ4)=(λ2)2(λ4)

Therefore eigenvalues are λ1=4 and a repeated one λ2,3=2. Yet, again, we should be able to diagonalize T4 because it is symmetric!

2.

To find eigenvectors plug the eigenvalues one by one to T4λI and investigate the implications of the resulting system of equations.

Because the eigenvalue is repeated, we should expect to do more work than usual, but the basis using eigenvectors should still be possible to find.

(T4λ1I)(xyz)=0(200011011)(xyz)=(000)
(200001100110)(100001100000){x=0y=zz= is free

In other words, the eigenvectors corresponding to λ1 have the form (0,p,p) where pR.

(T4λ2,3I)(xyz)=0(000011011)(xyz)=(000)

It is clear immediately, that the only restriction placed by this linear system of equations is that y=z. In other words, there are no restrictions on two of the three variables, and we can express the eigenvectors corresponding to λ2,3 as (p,q,q) where p,qR are parameters.

3.

In order to form a basis from eigenvectors, we need three linearly independent of them. Fortunately, there is enough degrees of freedom in the parameters (one from the first eigenvalue and two from the second) to have three linearly independent eigenvectors. For example, (0,1,1), (1,0,0) and (0,1,1)

4.

The transformation matrix is a matrix with columns formed from the vectors of the new basis expressed in coordinates (``the language’’) of the old basis.

P=(010101101)

5.

Again, the matrix T and the matrix T in the new basis are related as

T=PTP1T=P1TP

We find P1 in the same way by going Gaussian-Jordan elimination

(010100101010101001)
(101001010100002011)
(10100101010000101212)
(1000121201010000101212)
P1=(0121210001212)

Additional exercise: verify that PP1=I.

Now we can compute P1T2P:

(0121210001212)(200031013)(010101101)=
(0121210001212)(020402402)=(400020002)

We see again that P1T4P is diagonal with eigenvalues on the main diagonal, even though one of the eigenvalues is repeated twice.

ϵ.5#

Compute 10th power of the following matrix

A=(213123112)

No way you should compute tenth power directly. Consider diagonalization of A.

Let’s diagonalize the matrix A by finding a nonsingular matrix T such that A=TDT1 where D is diagonal. Then due to T1T=I we have

A10=(TDT1)10=(TDT1)(TDT1)(TDT1)(TDT1)=TD10T1

and computing D10 is simply taking the 10th power of the diagonal elements.

Let’s find the eigenvalues of A and form a new basis from the eigenvectors to find T.

det(2λ1312λ3112λ)=(2λ)37(2λ)+6

Set t=2λ, then we have to solve the equation

t37t+6=0t317t+7=0(t1)(t2+t+1)7(t1)=0(t1)(t2+t6)=0(t1)(t2)(t+3)=0

Therefore, the eigenvalues given by λ=2t are λ1=1, λ2=0, and λ3=5.

Now we need to find a basis formed of eigenvectors. Each time let’s solve the corresponding system (AλiI)x=0 using Gaussian elimination. For λ1=1, we have

(113011301110)(110000100000)

We conclude that the vectors of the form (p,p,0) where p is a free parameter are eigenvectors corresponding to λ1=1. Let’s choose p=1 to get the eigenvector (1,1,0).

For λ2=0, we have

(213012301120)(101001100000)

In this case the vectors of the form (q,q,q) where q is a free parameter are eigenvectors corresponding to λ2=0. Let’s choose q=1 to get the eigenvector (1,1,1).

For λ3=5, we have

(313013301130)(10320013200000)

Vectors of the form (32s,32s,s) where s is a free parameter are eigenvectors corresponding to λ2=5. Let’s choose s=23 to get the eigenvector (1,1,23).

The transformation matrix T is formed by placing the eqigenvectors as its columns. Therefore, we have

T=(1111110123)

Performing Gaussian elimination further to get the inverse matrix T1 we have

(1111001110100123001)(1111000221100123001)
(1001212001112120005312121)(1001212001015153500131031035)
T1=(1212015153531031035)

Now we have all the components to compute A10:

A10=(1111110123)(10000000510)(1212015153531031035)=
=(10510105100023510)(1212015153531031035)=
=(3259+123259123593259123259+123595959259),

where 59=1953125.

The final answer is

A10=(292968829296875859375292968729296885859375195312519531253906250)

ϵ.6#

A stochastic matrix is a square matrix, whose rows sum up to 1.

Consider the following n×n stochastic matrix:

An=(α10001α10α2001α200α301α3000αn11αn11αn000αn),

where αi[0,1] for i=1,2,,n.

Show that the maximum eigenvalue of An is 1 for all nN.

Both direct proof and proof my mathematical induction will work. In both cases it is worth starting with the simple case of n=2.

Start with n=2 in which case the matrix takes the form

A2=(α11α11α2α2)

The eigenvalues {λj} are solutions to the characteristic equation

det(α1λ1α11α2α2λ)=0

We have

det(α1λ1α11α2α2λ)==(α1λ)(α2λ)(1α1)(1α2)==λ2(α1+α2)λ1+α1+α2==(λ1)(λα1α2+1)

The last line with factorization can be obtained by the quadratic formula or by Vieta’s formula about the product and the sum of the roots of a quadratic equation.

(xa)(xb)=x2x(a+b)+ab{x1+x2=(a+b)x1x2=ab

It is clear from the characteristic equation that the two eigenvalues are λ1=1 and λ2=α1+α211 because αi[0,1]. Thus, the maximum eigenvalue of A2 is λ1=1.

Now consider the general case of n>2. The characteristic polynomial is given by

det(α1λ0001α10α2λ001α200α3λ01α3000αn1λ1αn11αn000αnλ)=

Expanding this determinant along the second column we get

=(α2λ)det(α1λ001α10α3λ01α300αn1λ1αn11αn00αnλ)=

And again expanding this determinant along the second column we get

=(α2λ)(α3λ)det(α1λ01α10αn1λ1αn11αn0αnλ)=

And again and so forth until (n1)-th row

=(α2λ)(α3λ)(αn1λ)det(α1λ1α11αnαnλ)=
=(α2λ)(α3λ)(αn1λ)(λ1)(λα1αn+1)

In the end we are left with nearly the same determinant as in the case of n=2.

The eigenvalues which are the roots of the equation

(α2λ)(α3λ)(αn1λ)(λ1)(λα1αn+1)=0

are

λ1=1λ2=α2λ3=α3λn1=αn1λn=α1+αn1

Given that αi[0,1] for i=1,2,,n we have λi1 for i{2,3,,n1} and λn=α1+αn11.

Therefore λ1=1 is the maximum eigenvalue.