2.8 Other Topics

The following section serves as a repository for mathematical derivations and identities which are used later in this book.

2.8.1 Trigonometric Approximations

Consider the Taylor series approximation for sin(x) centered at x = 0, provided in table 2.4 and rewritten in equation 2.292 for continuity.

sin(x) = x x3 3! + x5 5! x7 7! + ... + (1)n x2n+1 (2n + 1)! (2.292)

We can rewrite equation 2.292 as shown in equations 2.293 and 2.294.

sin(x) = x + ert (2.293)

ert = x3 3! + x5 5! x7 7! + ... + (1)n x2n+1 (2n + 1)! (2.294)

In equations 2.293 and 2.294, ert stands for error truncation terms. For relatively small values x, ert will be small. This is shown in Table 2.5, where values of sin(x) and ert are compared for given values of x. Therefore, we can simply replace sin(x) in a given equation with x since sin(x) x at small x values which have low ert. This is referred to as the small angle approximation.

Table 2.5: Table showing x values, sinx values, and the difference ert = x sinx
x ert sin(x)
0 0 0
0.1 0.00016 0.09983
0.5 0.02057 0.47942
1 0.15852 0.84147

A similar method can be used to show that for small values of x the small angle approximation tan(x) x.

2.8.2 Coordinate axis rotation

In this book, many problems will utilize a coordinate axis or reference frame. At times, this reference frame needs to be rotated. In this section, we will derive the new coordinates of a particle xr,yr after rotating it’s frame of reference by distance 𝜃 from the original coordinate system x,y. In doing this, please consider the Figure 2.14.

PIC

Figure 2.14: Coordinate axis rotation from x,y to xr,yr

Let us analyze point P indicated by distance from the origin r and angle 𝜃a. By definition we know the following is true for the coordinates of P in the x,y coordinate system:

x = |r|cos(𝜃a) (2.295) y = |r|sin(𝜃a) (2.296)

We also know the following is true for the coordinates of P in the xr,yr coordinate system:

xr = |r|cos(𝜃a 𝜃b)

yr = |r|sin(𝜃a 𝜃b)

From trigonometry we can get the following identities:

|r|cos(𝜃a 𝜃b) = |r|cos(𝜃a)cos(𝜃b) + |r|sin(𝜃a)sin(𝜃b) (2.297)

|r|sin(𝜃a 𝜃b) = |r|sin(𝜃a)cos(𝜃b) |r|cos(𝜃a)sin(𝜃b) (2.298)

Plugging 2.295 and 2.296 into 2.297 and 2.298, we get the following:

xr = x cos(𝜃b) + y sin(𝜃b) (2.299) yr = x sin(𝜃b) + y cos(𝜃b) (2.300)

We can rewrite equations 2.299 and 2.300 in matrix form as shown in equation 2.301.

[ xr yr ] = [ cos(𝜃b) sin(𝜃b) sin(𝜃b) cos(𝜃b) ] [ x y ] (2.301)

To go from (xr,yr) to (x,y), we simply take the inverse of the matrix (section 2.6.0.0) to get equation 2.302.

[ x y ] = [ cos(𝜃b) sin(𝜃b) sin(𝜃b) cos(𝜃b) ] [ xr yr ] (2.302)

For a three dimensional coordinate axis rotation, we must consider there are three possible ways the coordinate axis can be rotated; about the x-axis (𝜃x), about the y-axis (𝜃y), and about the z-axis (𝜃z). Each of these rotations has a matrix associated with it, dubbed the ”rotation matrix”. The x-axis rotation matrix is provided in equation 2.303.

Rx(𝜃x) = [ 1 0 0 0 cos(𝜃x) sin(𝜃x) 0 sin(𝜃x) cos(𝜃x) ] (2.303)

The y-axis rotation matrix is provided in equation 2.304.

Ry(𝜃y) = [ cos(𝜃y) 0 sin(𝜃y) 0 1 0 sin(𝜃y) 0 cos(𝜃y) ] (2.304)

The z-axis rotation matrix is provided in equation 2.305.

Rz(𝜃z) = [ cos(𝜃z) sin(𝜃z) 0 sin(𝜃z) cos(𝜃z) 0 0 0 1 ] (2.305)

In addition, the order in which we rotate matters; if we rotate our coordinate system about the x-axis, the y-axis will move. Therefore, if our next rotation is about the ”new” y-axis, a different rotation will be achieved than if we rotate about the original y-axis. In addition, it matters about what coordinate system we rotate. In an extrinsic rotation, each subsequent rotation occurs about a fixed coordinate system. In an intrinsic, each subsequent rotation occurs about the new coordinate system that results from the previous rotation. An extrinsic rotation that occurs about fixed axes x, then y, then z can be conducted using equation 2.306.

[ x y z ] = Rz(𝜃z)Ry(𝜃y)Rx(𝜃x) [ xr yr zr ] (2.306)

An intrinsic rotation about axes x, then the new y, and then the new z can be conducted using equation 2.307.

[ x y z ] = Rx(𝜃x)Ry(𝜃y)Rz(𝜃z) [ xr yr zr ] (2.307)

As shown in equation 2.307, an intrinsic rotation has the reverse order of the extrinsic rotation.

2.8.3 Imaginary Numbers

The imaginary number, symbolized by i, is a term that is used to define the square root of -1 as shown in equation 2.308.

i = 1 (2.308)

There is nothing imaginary about the imaginary number. In fact, the use of the word “imaginary” makes the imaginary number seem far more mysterious than it is. In the following paragraphs it will be shown that applying the same mathematical rules to imaginary numbers that we use for real numbers leads to very useful and interesting results.

Numbers that combine real and imaginary numbers are called complex numbers. There are two forms in which complex numbers can be expressed; these are the polar form and Cartesian form (sometimes called rectangular form). A generalized complex number in the polar form is shown in equation 2.309.

z = re𝑖𝜃 (2.309)

In equation 2.309, r is called the magnitude of the complex number and is also expressed as |z|. A generalized complex number written in Cartesian form is shown in equation 2.310.

z = x ± 𝑦𝑖 (2.310)

Polar form can be converted to Cartesian form by use of Euler’s identity (equation 2.248) as shown in equation 2.311.

x = rcos(𝜃) y = rsin(𝜃) (2.311)

The complex conjugate of a given complex number provided in Cartesian form is defined as a number which has the same real part as the original number, but an opposite imaginary part. To find the magnitude of a complex number provided in Cartesian form, we take the square root of the product of the number multiplied by it’s complex conjugate as shown in equation 2.312.

r = |z| = (x + 𝑦𝑖)(x 𝑦𝑖) = x2 + y2 (2.312)

To add two complex numbers z1 and z2 we simply individually add the real and imaginary components. To do this, we use the Cartesian form as shown in equation 2.313.

z1 + z2 = x1 + x2 + (y1 + y2)i (2.313)

Similarly, to subtract two complex numbers z2 from z1 we subtract the real and imaginary parts individually. As with addition, to do this we use Cartesian form as shown in equation 2.314.

z1 z2 = x1 x2 + (y1 y2)i (2.314)

To multiply complex numbers in polar form, we can use rules of exponents and simply add the exponents as shown in equation 2.315.

z1z2 = r1r2ei(𝜃1+𝜃2) (2.315)

To multiply complex numbers in Cartesian form, we can foil as shown in equation 2.316.

z1z2 = (x1x2 y1y2) + (x1y2 + x2y1)i (2.316)

To divide complex numbers in polar form, we can again use the rules of exponents as shown in equation 2.317.

z1 z2 = r1 r2ei(𝜃1𝜃2) (2.317)

To divide complex numbers in Cartesian form, we must first multiply the denominator (the term we are dividing by) by it’s complex conjugate as shown in equation 2.318.

z1 z2 = x1 + y1i x2 + y2i x2 + y2i x2 y2i (2.318)

Simplifying equation 2.319, you may notice that the denominator becomes |z2|2 which is a real number. Therefore, our simplified quotient is shown in equation 2.319

z1 z2 = x1x2 y1y2 |z2|2 + x1y2 + x2y1 |z2|2 i (2.319)
Complex functions

A complex function is a function which acts on a complex number. This means that a complex number is taken in by the function and based on this input, a complex number is output by the function. Therefore, if our input is x + 𝑦𝑖 then our output will be u(x,y) + v(x,y)i as shown in equation 2.320.

f(z) = f(x + 𝑦𝑖) = u(x,y) + v(x,y)i (2.320)

In equation 2.320, u is a real valued function of (x,y) which constitutes the real part of the function output, and v is another real valued function of (x,y) which constitutes the imaginary part of the function output.

An example of a complex function is provided in equation 2.321.

f(z) = z2 + 1 (2.321)

Plugging in the Cartesian form of a complex number (equation 2.310)for z, we get equation 2.322.

f(z) = (x2 y2 + 1) + (2𝑥𝑦)i (2.322)

From equation 2.322, we can clearly see that u(x,y) = x2 y2 + 1, and v(x,y) = 2𝑥𝑦. A fundamental constraint with the graphing a complex functions is that we have two inputs x and y along with two outputs u(x,y) and v(x,y), making this a four dimensional dataset which cannot be easily shown in three dimensions. Therefore, the magnitude |f(z)| is typically plotted as a function of x and y. Such a plot is for the function in equation 2.322 is shown in Figure 2.15.

PIC

Figure 2.15: |f(z)| as a function of x and y for equation 2.322.
Why complex numbers are commonly used to model waves

Waves are fundamentally sinusoidal functions (either sin(𝜃) or cos(𝜃)). In many physics/engineering applications, we are interested in adding, subtracting, multiplying, and dividing wave functions (i.e. sinusoidal functions). From Euler’s identity (equation 2.248), we know that a complex number is sinusoidal in nature making it eligible for modeling waves. In addition, operating on complex numbers is relatively simple due to the availability of both the Cartesian and polar forms; adding/subtracting (equations 2.313 and 2.314) is easily done in Cartesian form, while multiplying/dividing (equations 2.315 and 2.317) is easily done in polar form. This simplicity of operations is why complex numbers are commonly used to model waves.