Calculus/Vector calculus

From Wikibooks, open books for an open world
Jump to navigation Jump to search
← Inverse function theorem, implicit function theorem Calculus Vector calculus identities →
Vector calculus

Vector calculus specifically refers to multi-variable calculus applied to scalar and vector fields. While vector calculus can be generalized to dimensions (), this chapter will specifically focus on 3 dimensions ()

Fields in vector calculus[edit | edit source]

A depiction of xyz Cartesian coordinates with the ijk elementary basis vectors.

Scalar fields[edit | edit source]

A scalar field is a function that assigns a real number to each point in space. Scalar fields typically denote densities or potentials at each specific point. For the sake of simplicity, all scalar fields considered by this chapter will be assumed to be defined at all points and differentiable at all points.

Vector fields[edit | edit source]

A vector field is a function that assigns a vector to each point in space. Vector fields typically denote flow densities or potential gradients at each specific point. For the sake of simplicity, all vector fields considered by this chapter will be assumed to be defined at all points and differentiable at all points.

A depiction of cylindrical coordinates and the accompanying orthonormal basis vectors.

Vector fields in cylindrical coordinates[edit | edit source]

The cylindrical coordinate system used here has the three parameters: . The Cartesian coordinate equivalent to the point is

Any vector field in cylindrical coordinates is a linear combination of the following 3 mutually orthogonal unit length basis vectors:

Note that these basis vectors are not constant with respect to position. The fact that the basis vectors change from position to position should always be considered. The cylindrical basis vectors change according to the following rates:

Any vector field expressed in cylindrical coordinates has the form:

Given an arbitrary position that changes with time, the velocity of the position is:

The coefficient of for the term originates from the fact that as the azimuth angle increases, the position swings around at a speed of .

A depiction of spherical coordinates and the accompanying orthonormal basis vectors.

Vector fields in spherical coordinates[edit | edit source]

The spherical coordinate system used here has the three parameters: . The Cartesian coordinate equivalent to the point is

Any vector field in spherical coordinates is a linear combination of the following 3 mutually orthogonal unit length basis vectors:

Note that these basis vectors are not constant with respect to position. The fact that the basis vectors change from position to position should always be considered. The spherical basis vectors change according to the following rates:

Any vector field expressed in spherical coordinates has the form:

Given an arbitrary position that changes with time, the velocity of this position is:

The coefficient of for the term arises from the fact that as the latitudinal angle changes, the position traverses a great circle at a speed of .

The coefficient of for the term arises from the fact that as the longitudinal angle changes, the position traverses a latitude circle at a speed of .

Volume, path, and surface integrals[edit | edit source]

Volume Integrals[edit | edit source]

Volume integrals have already been discussed in the chapter Multivariable calculus, but a brief review is given here for completeness.

Given a scalar field that denotes a density at each specific point, and an arbitrary volume , the total "mass" inside of can be determined by partitioning into infinitesimal volumes. At each position , the volume of the infinitesimal volume is denoted by the infinitesimal . This gives rise to the following integral:

Path Integrals[edit | edit source]

Given any oriented path (oriented means that there is a preferred direction), the differential denotes an infinitesimal displacement along in the preferred direction. This differential can be used in various path integrals. Letting denote an arbitrary scalar field, and denote an arbitrary vector field, various path integrals include:

, , , , and many more.

denotes the total displacement along , and denotes the total length of .

Calculating Path Integrals[edit | edit source]

To compute a path integral, the continuous oriented curve must be parameterized. will denote the point along indexed by from the range . must be the starting point of and must be the ending point of . As increases, must proceed along in the preferred direction. An infinitesimal change in , , results in the infinitesimal displacement along . In the path integral , the differential can be replaced with to get

Example 1

As an example, consider the vector field and the straight line curve that starts at and ends at . can be parameterized by where . . We can then evaluate the path integral as follows:


If a vector field denotes a "force field", which returns the force on an object as a function of position, the work performed on a point mass that traverses the oriented curve is

Example 2

Consider the gravitational field that surrounds a point mass of located at the origin: using Newton's inverse square law. The force acting on a point mass of at position is . In spherical coordinates the force is (note that are the unit length mutually orthogonal basis vectors for spherical coordinates).

Consider an arbitrary path that traverses that starts at an altitude of and ends at an altitude of . The work done by the gravitational field is:

The infinitesimal displacement is equivalent to the displacement expressed in spherical coordinates: .

The work is equal to the amount of gravitational potential energy lost, so one possible function for the gravitational potential energy is or equivalently, .


Example 3

Consider the spiral parameterized with respect to in cylindrical coordinates by . Consider the problem of determining the spiral's length with restricted to the range . An infinitesimal change of in results in the infinitesimal displacement:

The length of the infinitesimal displacement is .

The length of the spiral is therefore:



Surface Integrals[edit | edit source]

Given any oriented surface (oriented means that the there is a preferred direction to pass through the surface), an infinitesimal portion of the surface is defined by an infinitesimal area , and a unit length outwards oriented normal vector . has a length of 1 and is perpendicular to the surface of , while penetrating in the preferred direction. The infinitesimal portion of the surface is denoted by the infinitesimal "surface vector": . If a vector field denotes a flow density, then the flow through the infinitesimal surface portion in the preferred direction is .

The infinitesimal "surface vector" describes the infinitesimal surface element in a manner similar to how the infinitesimal displacement describes an infinitesimal portion of a path. More specifically, similar to how the interior points on a path do not affect the total displacement, the interior points on a surface to not affect the total surface vector.

The displacement between two points is independent of the path that connects them.

Consider for instance two paths and that both start at point , and end at point . The total displacements, and , are both equivalent and equal to the displacement between and . Note however that the total lengths and are not necessarily equivalent.

Similarly, given two surfaces and that both share the same counter-clockwise oriented boundary , the total surface vectors and are both equivalent and are a function of the boundary . This implies that a surface can be freely deformed within its boundaries without changing the total surface vector. Note however that the surface areas and are not necessarily equivalent.

The fact that the total surface vectors of and are equivalent is not immediately obvious. To prove this fact, let be a constant vector field. and share the same boundary, so the flux/flow of through and is equivalent. The flux through is , and similarly for is . Since for every choice of , it follows that .

The geometric significance of the total surface vector is that each component measures the area of the projection of the surface onto the plane formed by the other two dimensions. Let be a surface with surface vector . It is then the case that: is the area of the projection of onto the yz-plane; is the area of the projection of onto the xz-plane; and is the area of the projection of onto the xy-plane.

The boundary of is counter-clockwise oriented.

Given an oriented surface , another important concept is the oriented boundary. The boundary of is an oriented curve but how is the orientation chosen? If the boundary is "counter-clockwise" oriented, then the boundary must follow a counter-clockwise direction when the oriented surface normal vectors point towards the viewer. The counter-clockwise boundary also obeys the "right-hand rule": If you hold your right hand with your thumb in the direction of the surface normals (penetrating the surface in the "preferred" direction), then your fingers will wrap around in the direction of the counter-clockwise oriented boundary.

Example 1

Consider the Cartesian points ; ; ; and .

Let be the surface formed by the triangular planes ; ; and where the vertices are listed in a counterclockwise direction relative to the surface normal directions. The surface vectors of each plane are respectively ; ; and respectively which add to a total surface vector of .

Let be the surface formed by the single triangular plane where the vertices are listed in a counterclockwise direction relative to the normal direction. It can be seen that and share a the common counter clockwise boundary The surface vector is which is equivalent to .


Example 2

This example will show how moving a point that is in the interior of a "triangular mesh" does not affect the total surface vector. Consider the points where . Let the closed path be defined by the cycle . For simplicity, . For each , will denote the displacement of relative to . Like with , .

Let denote a surface that is a "triangular mesh" comprised of the closed fan of triangles: ; ; ...; ; where the vertices of each triangle are listed in a counterclockwise direction. It can be seen that the counterclockwise boundary of is and does not depend on the location of . The total surface vector for is:

Now displace by to get . The displacement vector of relative to becomes . The counterclockwise boundary is unaffected. The total surface vector is:

Therefore moving the interior point neither affects the boundary, nor the total surface vector.



Calculating Surface Integrals[edit | edit source]

To calculate a surface integral, the oriented surface must be parameterized. Let be a continuous function that maps each point from a two-dimensional domain to a point in . must be continuous and onto. While does not necessarily have to be one to one, the parameterization should never "fold back" on itself. The infinitesimal increases in and are respectively and . These respectively give rise to the displacements and . Assuming that the surface's orientation follows the right hand rule with respect to the displacements and , the surface vector that arises is .

In the surface integral , the differential can be replaced with to get .

Example 3

Consider the problem of computing the surface area of a sphere of radius .

Center the sphere on the origin, and using and as the parameter variables, the sphere can be parameterized in spherical coordinates via where and . The infinitesimal displacements from small changes in the parameters are:

causes

causes

The infinitesimal surface vector is hence . While not important to this example, note how the parameterization was chosen so that the surface vector points outwards. The area is .

The total surface area is hence:


The Gradient and Directional Derivatives[edit | edit source]

Given a scalar field that denotes a potential, and given a curve , a commonly sought after quantity is the rate of change in as is being traversed. Let be an arbitrary parameter for , and let denote the point indexed by . Given an arbitrary which corresponds to the point , then using the chain rule gives the following expression for the rate of increase of at , :

where is a vector field that denotes the "gradient" of , and is the unnormalized tangent of .

If is an arc-length parameter, i.e. , then the direction of the gradient is the direction of maximum gain: Given any unit length tangent , the direction will maximize the rate of increase in . This maximum rate of increase is .

Calculating total gain[edit | edit source]

Given the gradient of a scalar field : , the difference between at two different points can be calculated, provided that there is a continuous path that links the two points. Let denote an arbitrary continuous path that starts at point and ends at point . Given an infinitesimal path segment with endpoints and , let be an arbitrary point in . denotes the infinitesimal displacement denoted by . The increase in along is:

The relative error in the approximations vanish as . Adding together the above equation over all infinitesimal path segments of yields the following path integral equation:

This is the path integral analog of the fundamental theorem of calculus.

The gradient in cylindrical coordinates[edit | edit source]

Let be a scalar field that denotes a potential and a curve that is parameterized by : . Let the rate of change in be quantified by the vector . The rate of change in is:

Therefore in cylindrical coordinates, the gradient is:

The gradient in spherical coordinates[edit | edit source]

Let be a scalar field that denotes a potential and a curve that is parameterized by : . Let the rate of change in be quantified by the vector . The rate of change in is:

Therefore in spherical coordinates, the gradient is:

The Directional Derivative[edit | edit source]

Given a scalar field and a vector , scalar field computes the rate of change in at each position where the velocity of is . Scalar field can also be expressed as . Velocity can also be a vector field so depends on the position . Scalar field becomes .

In Cartesian coordinates where the directional derivative is:

In cylindrical coordinates where the directional derivative is:

In spherical coordinates where the directional derivative is:

What makes the discussion of directional derivatives nontrivial is the fact that can instead be a vector field . Vector field computes at each position where .

In cylindrical coordinates, basis vectors and are not fixed, and in spherical coordinates, all of the basis vectors , , and are not fixed. This makes determining the directional derivative of a vector field that is expressed using the cylindrical or spherical basis vectors non-trivial. To directly compute the directional derivative, the rates of change of each basis vector with respect to each coordinate should be used. Alternatively, the following identities related to the directional derivative can be used (proofs can be found here):

Given vector fields , , and , then

Given vector fields and , and scalar field , then

In cylindrical coordinates, and

In spherical coordinates, , and , and

Full Expansions

In Cartesian coordinates where and the directional derivative is:

In cylindrical coordinates where and the directional derivative is:

In spherical coordinates where and the directional derivative is:


The Divergence and Gauss's Divergence Theorem[edit | edit source]

Let denote a vector field that denotes "flow density". For any infinitesimal surface vector at position , the flow through in the preferred direction is . is the flow density parallel to the x-axis etc.

Given a volume with a closed surface boundary with an outwards orientation, the total outwards flow/flux through is given by the surface integral . This outwards flow is equal to the total flow that is being generated in the interior of .

For an infinitesimal rectangular prism (, , and ) that is centered on position , the outwards flow through the surface is:

All relative errors vanish as .

is the "divergence" of and is the density of "flow generation" at . As noted above, the total outwards flow through is the total flow generated inside of , which gives Gauss's divergence theorem:

This image depicts an example of the total flow across a closed boundary being the total flow generated inside the boundary.

In the image to the right, an example of the total flow across a closed boundary being the total flow generated in the interior of the boundary is given. The direction of the flow across each edge is denoted by the direction of the arrows, and the rate is denoted by the number of arrows. Each node inside the boundary is labelled with the rate of flow generation at the current node. It can be checked that a net total of 2 units of flow is being drawn into the boundary, and the total rate of flow generation across all interior nodes is a net consumption of 2 units.


The divergence in cylindrical coordinates[edit | edit source]

Let denote a vector field that denotes "flow density". In order to compute the divergence (flow generation density) of , consider an infinitesimal volume defined by all points where , , and . Note that is not a rectangular prism. Let , , and . Let be an arbitrary point from .

The volume of is approximately . The 6 surfaces bounding are described in the following table:

Surface approximate area direction approximate flow density
, ,
, ,
, ,
, ,
, ,
, ,

The total outwards flow through the surface of is:

All relative errors vanish as .

The divergence (flow generation density) is therefore:

a note about the approximations

A reader may wonder why the area of surface , , is approximated by instead of since the difference between and approaches 0 as . While the absolute difference between and approaches 0 as , the difference relative to the infinitesimal does not approach 0: .

With respect to the surface , , , the area can be approximated by , , or since is already a factor, and the differences between , , and relative to the infinitesimal do approach 0 as .


The divergence in spherical coordinates[edit | edit source]

Let denote a vector field that denotes "flow density". In order to compute the divergence (flow generation density) of , consider an infinitesimal volume defined by all points where , , and . Note that is not a rectangular prism. Let , , and . Let be an arbitrary point from .

The volume of is approximately . The 6 surfaces bounding are shown in the following table:

Surface approximate area direction approximate flow density
, ,
, ,
, ,
, ,
, ,
, ,

The total outwards flow through the surface of is:

All relative errors vanish as

The divergence (flow generation density) is therefore:


Divergence free vector fields[edit | edit source]

A vector field for which is a "divergence free" vector field. can also be referred to as "incompressible" (since the flow density of an incompressible fluid is divergence free) or "solenoidal" (since magnetic fields are divergence free).

A key property of a divergence free vector field is that the flux of through a surface is purely a function of the surface's boundary. If and are two surfaces which share the same counterclockwise oriented boundary , then . In other words, the flux is purely a function of . This property can be derived from Gauss's divergence theorem as follows:

To begin, it will be assumed that and do not intersect each other, except for at the common boundary . The argument presented here easily generalizes to cases where and do intersect each other. Invert the orientation of to get and combine and to get a closed surface , stitching the surfaces together along the seam . Let denote the volume which is the interior of , and it will also be assumed that is oriented outwards (which is the case if is "in front" of ).

Gauss's divergence theorem states that . The flux through is the flux through minus the flux through : . Therefore:

The Laplacian Operator[edit | edit source]

Given a scalar field , if the gradient is interpreted as denoting flow density, the rate of flow generation at each point is which is referred to as the "Laplacian" of and is denoted by (or alternately ).

The laplacian is effectively a measure of the "convexity" of at each point . When there is a net flow of the gradient away from , this means that is "low" compared to its neighboring points and that the convexity is positive. When there is a net flow of the gradient towards , this means that is "high" compared to its neighboring points and that the convexity is negative.

In Cartesian coordinates, the Laplacian is:

In cylindrical coordinates, the Laplacian is:

In spherical coordinates, the Laplacian is:

The Laplacian and Vector fields[edit | edit source]

Occasionally, the Laplacian operator is applied to a vector field as opposed to a scalar field. Other than for Cartesian coordinates, the Laplacian cannot be applied directly to each component, as in non-Cartesian coordinate systems, the basis vectors are subject to change. In cylindrical coordinates, basis vectors and are not fixed, and in spherical coordinates, all of the basis vectors , , and are not fixed. This makes determining the Laplacian of a vector field that is expressed using the cylindrical or spherical basis vectors non-trivial. To directly compute the Laplacian, the rates of change of each basis vector with respect to each coordinate should be used. Alternatively, the following identities related to the Laplacian can be used (proofs can be found here):

Given vector fields and , then

Given scalar field and vector field , then

In cylindrical coordinates, and

In spherical coordinates, , and , and

Full Expansions

In Cartesian coordinates where , the Laplacian is:

In cylindrical coordinates where , the Laplacian is:


The Curl and Stokes' Theorem[edit | edit source]

Given a scalar field and a curve with endpoints and , the difference between and is given by the following path integral involving the gradient field : . If is closed (), then . In other words, the "gain" of around a closed curve is always 0. Most vector fields are not the gradient of any scalar field however, and the gain of around a closed curve may not always be 0. This gives rise to the notion of circulation or "curl".

The path integral for a closed curve is the "circulation" of around . Stokes' theorem will show that the circulation around is the total circulation accumulated in the interior of .

Green's Theorem[edit | edit source]

A demonstration of how a large loop can be decomposed into a family of infinitesimal loops.

Quantifying "circulation density" is best introduced in 2 dimensions. Given a large counter-clockwise oriented loop that is confined to 2 dimensions, can be decomposed into a family of infinitesimal loops as shown on the right. Boundaries that are common to adjacent loops cancel each other out due to their opposite orientations, so the total circulation around is the sum of the circulations around each infinitesimal loop.

An infinitesimal rectangular loop.

Consider the infinitesimal rectangle . Let be an arbitrary point inside the rectangle, let and , and let be the counterclockwise boundary of .

The circulation around is approximately (the relative error vanishes as ):

As , the relative errors present in the approximations vanish, and therefore, for an infinitesimal rectangle,

is the "circulation density" at . Let be a counter-clockwise oriented loop with interior . The circulation around loop is the total circulation contained by : . This is Green's theorem.

Stokes' Theorem[edit | edit source]

Stokes' Theorem is effectively a generalization of Green's theorem to 3 dimensions, and the "curl" is a generalization of the quantity to 3 dimensions. An arbitrary oriented surface can be articulated into a family of infinitesimal surfaces, some parallel to the xy-plane, others parallel to the zx-plane, and the remainder parallel to the yz-plane. Let denote an arbitrary vector field.

Let be a surface that is parallel to the yz-plane with counter-clockwise oriented boundary . Green's theorem gives:

is positive if the normal direction to points in the positive x direction and is negative if otherwise. If the normal direction to points in the negative x direction, then is oriented clockwise instead of counter-clockwise in the yz-plane.

Decomposing a 3D loop into an ensemble of infinitesimal loops that are parallel to the yz, zx, or xy planes.

Repeating this argument for being parallel to the zx-plane and xy-plane respectively gives:

and

Treating as an ensemble of infinitesimal surfaces parallel to the yz-plane, zx-plane, or xy-plane gives:

This is Stokes' theorem, and is the "curl" of which generalizes the "circulation density" to 3 dimensions.

The direction of at is effectively an "axis of rotation" around which the counterclockwise circulation density in a plane whose normal is parallel to is . Out of all planes that pass through , the plane whose normal is parallel to has the largest counterclockwise circulation density at which is .

An arbitrary vector field that is differentiable everywhere is considered to be "irrotational" or "conservative" if everywhere, or equivalently that for all continuous closed curves .

The curl in cylindrical coordinates[edit | edit source]

Let denote an arbitrary vector field in cylindrical coordinates. By calculating the circulation densities in surfaces perpendicular to , , and , the curl can be computed:

The curl in spherical coordinates[edit | edit source]

Let denote an arbitrary vector field in spherical coordinates. By calculating circulation densities in surfaces perpendicular to , , and , the curl can be computed:


Irrotational vector fields[edit | edit source]

A vector field for which at all points is an "irrotational" vector field. can also be referred to as being "conservative" since the gain around any closed curve is always 0.

A key property of an irrotational vector field is that the gain of along a continuous curve is purely a function of the curve's end points. If and are two continuous curves which share the same starting point and end point , then . In other words, the gain is purely a function of and . This property can be derived from Stokes' theorem as follows:

Invert the orientation of to get and combine and to get a continuous closed curve , linking the curves together at the endpoints and . Let denote a surface for which is the counterclockwise oriented boundary.

Stokes' theorem states that . The gain around is the gain along minus the gain along : . Therefore:

Summary and Extensions[edit | edit source]

In summary:

The gradient of a scalar field is which denotes the rate of change in in each direction, at each point.
Given an oriented curve which starts at and ends at , the increase in along is: (the gradient theorem)
If a vector field denotes "flow density", then the divergence is which denotes the density of "flow generation" at each point.
Given a volume with outwards oriented surface , the total flow being generated inside is: (Gauss's divergence theorem)
The curl of a vector field is which denotes the "circulation density" at each point.
Given an oriented surface with a counter-clockwise oriented boundary , the total circulation present in is: (Stokes' theorem)


Extending the gradient theorem[edit | edit source]

The gradient theorem states that given an everywhere differentiable scalar field and a continuous oriented curve with endpoints and , that . This theorem can be extended to equate a surface integral with a volume integral, as opposed to equating a difference with a path integral.

Let be an arbitrary volume with outwards oriented surface . Let be an arbitrary line segment parallel to the x-axis that is completely contained by and that starts and ends on the surface of . Let this line segment have an infinitesimal cross-sectional area of . The volume integral of over is: . Let and be the infinitesimal surface portions of formed when intersects at and respectively. The x-component of and is and respectively. Adding up all possible line segments gives:

where is the x-component of the differential .

Repeating for the y-axis and z-axis gives:

where is the y-component of the differential .

where is the z-component of the differential .

This yields:

and hence:

The above integral equation is effectively a generalization of the gradient theorem.


Extending Stokes' Theorem[edit | edit source]

Stokes' theorem states that given an everywhere differentiable vector field and an oriented surface with counterclockwise boundary , that . This theorem can be extended to equate a surface integral with a volume integral, as opposed to equating a path integral with a surface integral.

Let be an arbitrary volume with outwards oriented surface . Let be arbitrary, and let be the cross-section of in the plane . Let be the counter-clockwise boundary of (the surface normal vectors of point in the positive x-direction). Green's theorem gives:

Now let the cross-section have an infinitesimal thickness , forming the volume slice . Let denote the infinitesimal strip of that wraps the cross-section ( is similar to except that is a surface with a non-zero infinitesimal width). Let be an infinitesimal portion of . Ignoring the component of that is parallel to , denotes a thin strip of surface that wraps around , and is parallel to the x-axis. The counterclockwise displacement along the boundary of manifested by is . Substituting into the path integral around gives:

Integrating over all cross-sections gives:

Repeating the above argument for the y-axis and z-axis gives:

This yields:

The above integral equation is effectively a generalization of Stokes' theorem.