Calculus/Chain Rule
The chain rule is a method to compute the derivative of the functional composition of two or more functions.
If a function, f, depends on a variable, u, which in turn depends on another variable, x, that is f = y(u(x)) , then the rate of change of f with respect to x can be computed as the rate of change of y with respect to u multiplied by the rate of change of u with respect to x.
If a function f is composed to two differentiable functions y(x) and u(x), so that f(x) = y(u(x)), then f(x) is differentiable and, |
The method is called the "chain rule" because it can be applied sequentially to as many functions as are nested inside one another.^{[1]} For example, if f is a function of g which is in turn a function of h, which is in turn a function of x, that is
,
the derivative of f with respect to x is given by
and so on.
A useful mnemonic is to think of the differentials as individual entities that can be canceled algebraically, such as
However, keep in mind that this trick comes about through a clever choice of notation rather than through actual algebraic cancellation.
The chain rule has broad applications in physics, chemistry, and engineering, as well as being used to study related rates in many disciplines. The chain rule can also be generalized to multiple variables in cases where the nested functions depend on more than one variable.
Contents
Examples[edit]
Example I[edit]
Suppose that a mountain climber ascends at a rate of 0.5 kilometer per hour. The temperature is lower at higher elevations; suppose the rate by which it decreases is 6 °C per kilometer. To calculate the decrease in air temperature per hour that the climber experiences, one multiplies 6 °C per kilometer by 0.5 kilometer per hour, to obtain 3 °C per hour. This calculation is a typical chain rule application.
Example II[edit]
Consider the function f(x) = (x^{2} + 1)^{3}. It follows from the chain rule that
Function to differentiate | |
Define u(x) as inside function | |
Express f(x) in terms of u(x) | |
Express chain rule applicable here | |
Substitute in f(u) and u(x) | |
Compute derivatives with power rule | |
Substitute u(x) back in terms of x | |
Simplify. |
Example III[edit]
In order to differentiate the trigonometric function
one can write:
Function to differentiate | |
Define u(x) as inside function | |
Express f(x) in terms of u(x) | |
Express chain rule applicable here | |
Substitute in f(u) and u(x) | |
Evaluate derivatives | |
Substitute u in terms of x. |
Example IV: absolute value[edit]
The chain rule can be used to differentiate , the absolute value function:
Function to differentiate | |
Equivalent function | |
Define u(x) as inside function | |
Express f(x) in terms of u(x) | |
Express chain rule applicable here | |
Substitute in f(u) and u(x) | |
Compute derivatives with power rule | |
Substitute u(x) back in terms of x | |
Simplify | |
Express as absolute value. |
Example V: three nested functions[edit]
The method is called the "chain rule" because it can be applied sequentially to as many functions as are nested inside one another. For example, if , sequential application of the chain rule yields the derivative as follows (we make use of the fact that , which will be proved in a later section):
Original (outermost) function | |
Define h(x) as innermost function | |
g(h) = sin(h) as middle function | |
Express chain rule applicable here | |
Differentiate f(g)^{[2]} | |
Differentiate g(h) | |
Differentiate h(x) | |
Substitute into chain rule. |
Chain Rule in Physics[edit]
Because one physical quantity often depends on another, which, in turn depends on others, the chain rule has broad applications in physics. This section presents examples of the chain rule in kinematics and simple harmonic motion. The chain rule is also useful in electromagnetic induction.
Physics Example I: relative kinematics of two vehicles[edit]
For example, one can consider the kinematics problem where one vehicle is heading west toward an intersection at 80 miles per hour while another is heading north away from the intersection at 60 miles per hour. One can ask whether the vehicles are getting closer or further apart and at what rate at the moment when the northbound vehicle is 3 miles north of the intersection and the westbound vehicle is 4 miles east of the intersection.
Big idea: use chain rule to compute rate of change of distance between two vehicles.
Plan:
- Choose coordinate system
- Identify variables
- Draw picture
- Big idea: use chain rule to compute rate of change of distance between two vehicles
- Express c in terms of x and y via Pythagorean theorem
- Express dc/dt using chain rule in terms of dx/dt and dy/dt
- Substitute in x, y, dx/dt, dy/dt
- Simplify.
Choose coordinate system: Let the y-axis point north and the x-axis point east.
Identify variables: Define y(t) to be the distance of the vehicle heading north from the origin and x(t) to be the distance of the vehicle heading west from the origin.
Express c in terms of x and y via Pythagorean theorem:
Express dc/dt using chain rule in terms of dx/dt and dy/dt:
Apply derivative operator to entire function | |
Sum of squares is inside function | |
Distribute differentiation operator | |
Apply chain rule to x(t) and y(t)} | |
Simplify. |
Substitute in x = 4 mi, y = 3 mi, dx/dt = −80 mi/hr, dy/dt = 60 mi/hr and simplify
Consequently, the two vehicles are getting closer together at a rate of 28 mi/hr.
Physics Example II: harmonic oscillator[edit]
If the displacement of a simple harmonic oscillator from equilibrium is given by x, and it is released from its maximum displacement A at time t = 0, then the position at later times is given by
where ω = 2 π/T is the angular frequency and T is the period of oscillation. The velocity, v, being the first time derivative of the position can be computed with the chain rule:
Definition of velocity in one dimension | |
Substitute x(t) | |
Bring constant A outside of derivative | |
Differentiate outside function (cosine) | |
Bring negative sign in front | |
Evaluate remaining derivative | |
Simplify. |
The acceleration is then the second time derivative of position, or simply dv/dt.
Definition of acceleration in one dimension | |
Substitute v(t) | |
Bring constant term outside of derivative | |
Differentiate outside function (sine) | |
Evaluate remaining derivative | |
Simplify. |
From Newton's second law, F = ma, where F is the net force and m is the object's mass.
Newton's second law | |
Substitute a(t) | |
Simplify | |
Substitute original x(t). |
Thus it can be seen that these results are consistent with the observation that the force on a simple harmonic oscillator is a negative constant times the displacement.
Chain Rule in Chemistry[edit]
The chain rule has many applications in Chemistry because many equations in Chemistry describe how one physical quantity depends on another, which in turn depends on another. For example, the ideal gas law describes the relationship between pressure, volume, temperature, and number of moles, all of which can also depend on time.
Chemistry Example I: Ideal Gas Law[edit]
Suppose a sample of n moles of an ideal gas is held in an isothermal (constant temperature, T) chamber with initial volume V_{0}. The ideal gas is compressed by a piston so that its volume changes at a constant rate so that V(t) = V_{0} - kt, where t is the time. The chain rule can be employed to find the time rate of change of the pressure.^{[3]} The ideal gas law can be solved for the pressure, P to give:
where P(t) and V(t) have been written as explicit functions of time and the other symbols are constant. Differentiating both sides yields
where the constant terms, n, R, and T, have been moved to the left of the derivative operator. Applying the chain rule gives
where the power rule has been used to differentiate 1/V, Since V(t) = V_{0} - kt, dV/dt = -k. Substituting in for V and dV/dt yields dP/dt.
Chemistry Example II: Kinetic Theory of Gases[edit]
A second application of the chain rule in Chemistry is finding the rate of change of the average molecular speed, v, in an ideal gas as the absolute temperature T, increases at a constant rate so that T = T_{0} + at, where T_{0} is the initial temperature and t is the time.^{[3]} The kinetic theory of gases relates the root mean square of the molecular speed to the temperature, so that if v(t) and T(t) are functions of time,
where R is the ideal gas constant, and M is the molecular weight.
Differentiating both sides with respect to time yields:
Using the chain rule to express the right side in terms of the with respect to temperature, T, and time, t, respectively gives
Evaluating the derivative with respect to temperature, T, yields
Evaluating the remaining derivative with respect to T, taking the reciprocal of the negative power, and substituting T = T_{0} + at, produces
Evaluating the derivative with respect to t yields
which simplifies to
Proof of the chain rule[edit]
Suppose is a function of which is a function of (it is assumed that is differentiable at and , and is differentiable at .
To prove the chain rule we use the definition of the derivative.
We now multiply by and perform some algebraic manipulation.
Note that as approaches 0, also approaches 0. So taking the limit as of a function as approaches 0 is the same as taking its limit as approaches 0. Thus
So we have
Exercises[edit]
References[edit]
- ↑ http://www.math.brown.edu/help/derivtips.html
- ↑ The derivative of is ; see Calculus/Derivatives of Exponential and Logarithm Functions.
- ↑ ^{a} ^{b} University of British Columbia, UBC Calculus Online Course Notes, Applications of the Chain Rule, http://www.ugrad.math.ubc.ca/coursedoc/math100/notes/derivative/chainap.html Accessed 11/15/2010.