Mathematical Proof/Print version
|This is the print version of Mathematical Proof
You won't see this message or any elements not part of the book's content when you print or preview this page.
Table of Contents
- Methods of Proof
- Proof and Computers
If you've ever taken a math class or talked to a mathematician, you know that we don't talk like normal people do--we have our own language that's 2-bit encoded. For example, when a waiter says "Would you like corn or beans with your dinner?" you know that he wants the answer of "corn" or "beans" but not "both." Well, logically, if you ask the question "Would you like corn or beans with your dinner?" the answer could be "yes."
This chapter will acquaint you with logical statements, logical reasoning, and the cryptic language of mathematicians.
As discussed in the introduction, logical statements are different from common English. We will discuss concepts like "or," "and," "if," "only if." (Here I would like to point out that in most mathematical papers it is acceptable to use the term "we" when referring to oneself. This is considered polite by not commanding the audience to do something nor excluding them from the discussion.)
Truth and Statements
A truth statement is one that is either true or false, not neither, and not both. Therefore, the sentence "This sentence is false." is not a truth statement because its truth value cannot be determined. However, the sentence "All people are cows." is a truth statement because its truth value can be determined, and is clearly false, since there are some people that are not cows. In mathematics, normally this phrase is shortened to statement to achieve conciseness and to avoid confusion.
The truth value of a statement is an evaluation of whether the statement is true (sometimes also referred or abbreviated as 1 or T) or false (sometimes referred also as 0 or F). Thus, the truth value of "2 < 3" is true and "4 > 5" is false.
In discussing logic and statements, it is common to use the letters P and Q as variables to denote statements. Throughout this section, this convention will be followed. We will now discuss logical connectors, or conjunctions that connect statements.
The connector "or" is used to connect two statements and make a third statement whose truth value is based on the first two. The statement "P or Q" is true if either P or Q is true, or if both of them are true. Therefore, the statement, "The internet is complex or math is exciting." is true, given that either or both statements are true. Another example, the statement "7 > 9 or 2 < 3" is true, because one of the connected statements is true. The only time the statement "P or Q" is false is when both P and Q are false; an example of a statement where both P and Q are false is the following "All people are American or all Americans are tall". Clearly, the statement "All people are American" is false, and is false also the statement "all Americans are tall".
"Or" is sometimes denoted with the symbol . So means "P or Q." This is also sometimes referred to as disjunction.
"And" is used to connect two statements, just as "or"; however, "P and Q" is only true when both P and Q are true, and false otherwise. The statement "The author of this article likes cheesecake and broccoli." is true since the statements "The author of this article likes cheesecake." and "The author of this article likes broccoli." are both true. Of course, it doesn't mean that I like them together on the same plate or in my mouth at the same time as one might infer.
Another example might be the statement "3<6 and 4<6", which is true, since both statements "3<6" and "4<6" are true. The statement "3<6 and 4>6" is not true since "4>6" is false.
"And" is sometimes denoted as . means "P and Q." This is referred to as conjunction.
Implications and Conditional Statements
This is another class of logical connectors, like "and" and "or." The three implication types are "if" , "only if" , and "if and only if" or "iff" . See also Necessary and sufficient conditions. The vast majority of mathematical proofs deal with statements such as (P only if Q) or (P if and only if Q).
To say that "Q is necessary for P" means that P cannot exist without Q. Another way of saying this is "P only if Q" or "if P then Q". The notation for this is (P implies Q).
The statement "Sunday is necessary for Easter" is true because if it is Easter, then it must be also Sunday. Yet, the statement "Water is necessary for humans" is, in a logical sense, false because something that is a human is probably not water as well. (However, the statement "Having water is necessary for humans" is true, because to be a human being something must contain water.) The statement "P only if Q" is false only when P is true and Q is false, because Q is a necessary condition (it must be true) for P to be true.
To say that "P is sufficient for Q" means "P cannot exist without Q" or "if P then Q". The notation for this is . This statement is only false when P is true and Q is false. This is the same statement as , but the roles are reversed; the former, however, is more common.
The interesting thing about an implication (an if-then statement) is that in the case that P is false, is always true. Thus, "If the world is flat, then the Sun does not exist" is a true statement. This kind of a statement is called vacuously true because Q can be replaced with any statement and the implication is still true. In fact, the mathematician Bertrand Russell boasted, "Give me any false statement and any other statement to prove and I will prove it". Given the statement "1 = 2" he was asked to prove that he was God. "Consider the set ", he replied, "since 1 = 2, the two elements are one, and therefore Russell = God".
"If" and "only if" statements are called conditional statements. These two are related, in fact they are the converse of each other. Converse simply means to switch the direction of the arrow. (The converse of is .)
Necessary and Sufficient
To say that "P is necessary and sufficient for Q" means "P is true exactly when Q is true" or "P implies Q and Q implies P". This means that P and Q always have the same truth value—that is, they are both true or both false simultaneously. In mathematical symbols this is expressed as . This is referred to as a biconditional. When is a true statement, we say that P and Q are logically equivalent.
We have explored truth statements and logical connectors used to create new statements. We will now look at two more variations of logical statements.
Negation refers to toggling a truth statement's truth value. The result is not always negative, or false, it is only the opposite of the original. Thus, the negation of "false" is "true." This is denoted by a tilde (~) or the negation symbol .
It will be shown below in a truth table that the negation of the statement , which is , is equivalent to . Thus, to use loose terminology, "not or" is "and" (similarly, "not and" is "or").
The contrapositive is related to negation. However, it is only defined for a conditional statement. It basically means negate both P and Q and switch the direction of the conditional. That is, the contrapositive of is ; or, equivalently, . It will be shown below that the contrapositive of a statement is equivalent to the statement itself.
A truth table is helpful in visualizing all of these logical relations. The table is filled in by considering all possible combinations of true and false for P and Q and then filling in the results for the various connectors mentioned above.
The Or and And Table
If, Only if, and If and only if Table
Note that the column can be obtained from . Truth tables are very useful in helping beginners understand logical relationships.
Negation and Contrapositive Table
In the following exercises, P, Q, R, and S will represent truth statements.
A. Construct the truth tables for the following statements, and give their converse and contrapositive:
B. Negate the following statements:
While a comprehensive list of notation is included in the appendix, that is meant mostly as a reference tool to refresh the reader of what notation means. This section is to introduce the notation to the reader and explain its usage.
Basic Set Theory
This is not meant to be a comprehensive or rigorous definition of set theory. We will define a minimal amount of set-theoretical objects, so that the concept of mathematical thinking can be understood. In this book, we will use capital letters for sets and lowercase letters for elements of sets. This convention is not to discourage creativity or to bore your socks off, but to avoid confusion.
An axiom is something that is assumed, or believed to be true. It is where mathematical proof starts; you cannot prove the axioms, you merely believe them and use them to prove other things. There are different sets of axioms, the most current and widely-used being Zermelo–Fraenkel set theory. I will give a list of axioms here that will suffice for the studies in this book.
- A set exists. (A set, for our purposes, will be a collection of objects that we will call elements. A set is said to contain its elements, and the elements are said to be contained in the set.)
- An empty set exists. This set will be denoted as and will contain no elements.
- Two sets are equal if and only if they contain the same elements.
- If and are sets, then there is a set containing only the elements of and . This is called the union of and .
- If is a set and is a truth statement defined for all contained in , then there is a set so that is in whenever is true.
- The set of counting numbers exists; or, an infinite set exists.
Some of these are worded rather formally, which is a tendency for mathematicians. First of all, let's explain why we need these axioms.
The first axiom says "a set exists." So you might ask—don't we know that it exists? Can't we just define it to exist? The answer is, yes, and that's why it's an axiom. Axioms are supposed to be self-evident truths. Now that we've established the fact that sets exist, why would we want one with no elements? Well, the empty set turns out to be both a very useful and a very annoying set. You'll learn to be good friends with the empty set by the end of the text.
Axiom 3 could be considered a definition, rather than an axiom, of what we mean when we say that two sets are equal. Axiom 4 just says that if we have two sets then we can get a new set with all of those elements in it. For example, the set of all people and all dogs.
The fifth axiom is probably the most confusing. All it says is that if we have a set and we want to pick out certain elements, we can do that. For example, out of the set of all integers, we can choose the ones that are even, or the ones that are positive, or the ones that are perfect squares.
Finally, the infinity axiom is nice because we will do lots of things with infinite sets.
As mentioned above, a set will be a collection of elements. For example, let be the set of all cheesecakes, and let be the set of all things that are chocolate. Mathematically, this would be denoted as:
The vertical bar | is read "such that". We can select elements in that way by using Axiom 5 from above. In the case of , the predicate (truth statement) used to select elements is:
Note that we have implicitly assumed the existence of a universal set of all elements over which we are making a selection. In the example above, this universal set could be the set of all pastries. In general, if a universal set is not specified, we shall assume that we are talking about the real numbers . Thus, can be read as: " is the set of all real numbers strictly greater than ."
To say that is an element of is equivalent to saying that contains . These concepts are notated mathematically by (x belongs to the set A) and . If is not a member of then we write (x does not belong to the set A).
The union of and is defined in axiom 4. It consists of all the elements in and and is denoted . We can also use the | to say
The intersection of and is the set containing all things that are in both and . The notation for this is .
If then and are said to be disjoint. This means that there is nothing that is in both sets. For example, if is the set of all even integers and is the set of all odd integers, then they are disjoint.
Notice that the logical connectors coincide with the set operators . This is deliberate, since the concepts are related. This is obvious when the two symbols are juxtaposed:
Quantifiers are used to establish what elements are currently being discussed. They are like adjectives in English—they tell how much or what kind of thing we're talking about.
The most common quantifier is for all. This is written mathematically as . It is also "for each" or "for every." It is used to make statements like "All humans have eyeballs." That is, if is the set of all humans and is the set of all things with eyeballs, then
which is read "For all in , is in ." This introduces a special relationship between sets that is called a subset. In this case, is a subset of since every element of is also in . (For the sake of the logical argument, just assume there aren't any people without eyeballs.) We write this as
This notation, , is ambiguous because some authors use it to mean just a subset, while others use it to mean a proper subset (meaning there is an element of that is not in , and thus and are not equal) and use to denote that is a subset of . In this book we will go with the convention that could mean that is a proper subset of or that = , and we will use when we wish to emphasize that .
This is probably as common as for all and is equally useful. Its mathematical symbol is . It is almost always followed by a "such that" statement. For example, "There exists a computer that has 8GB of RAM." and are often used in pairs, such as, "Everyone has a mother.", or, worded logically, "For each human, there exists a mother for that human.". Let H be the set of all humans and M be the set of all mothers. Then we have
or, when and are understood,
This quantifier is also read as there is or there are. To signify that there is only one of something, we say "There exists a unique..." and place an exclamation point after the exists symbol: .
In the same way that "not and" gives "or", "not for all" gives "there exists." That is, the opposite of the statement "All cheesecakes are chocolate." is "There is a cheesecake that is not chocolate." In logical terms,
The above statement is read "the negation of 'For all x in A, P(x) is true.' is 'There is an x in A such that P(x) is false."
As we have seen, such that can be used in at least two cases: in conjunction with there exists and in picking out elements of a set. Of course, if you think about it, these two are really the same application because the statement "there exists" gives you the set of all things that exist, and the such that statement decreases the size of that set to focus just on the things in which you are interested. Such that is normally denoted as a colon(:) or as a vertical bar (|), and sometimes as "s.t."
This is a very helpful phrase in making proofs more concise and less redundant. For example, assume we have two integers x and y and that we know one of them is odd and one of them is even. Instead of trying to do two different parallel proofs, one where we assume that x is even and y is odd and another where we assume that y is even and x is odd, we simply say "Without loss of generality, assume x is even." Then we continue with the proof. This is done since the exact same argument applies if y actually was the even number, all we have to do is relabel x and y.
Now, we're not going to begin a discussion of astronomy. In mathematics, the universe is overall, biggest set that your discussion is referring to. For example, if the universe is not restricted, then the set of all things would truly be the set containing every single thing. However, if your universe is the set of all things on Earth, then the "set of all things" would not include Jupiter, since Jupiter is not on Earth.
In arithmetic, difference means the distance between two numbers—how far apart they are on the number line. In set theory, difference means something slightly different, but the same notation is used. ( denotes the difference of A and B.) The difference is the set of all things that are in A that are not in B.
In normal English, we use this concept when we say things like "Everyone with no demerits gets an A in this course."
The complement of a set contains everything that is not in the original set. This definition only makes sense when a universe is understood. The complement is usually denoted by . If U is the universe, then the complement of A is defined to be .
The diagram to the right is a Venn diagram. A Venn diagram shows the relationships of sets. Note that in the figure, U is the universe, and A and B are sets in U. The blue portion is the complement of . This is a general drawing, since it is not known whether there are elements in A and B. If is known to be empty, then they may be drawn disjoint.
The complement of a set is essentially the same as the negation of a statement. That is,
if , then .
Thus, complements are used when saying what something is not.
- Express the following statements in terms of sets, using difference or complement.
- All people that have two legs.
- All mythological creatures that are not Greek.
- All pudding pies that have no cream on top.
- Draw a Venn Diagram to illustrate the following.
- Negate the following statements.
- All quick, brown foxes jump over some lazy dog.
Methods of Proof
There are many different ways to prove things in mathematics. This chapter will introduce some of those methods.
A constructive proof is the most basic kind of proof there is. It is a proof that starts with a hypothesis, and a person uses a series of logical steps and a list of axioms, to arrive at a conclusion.
Parts of a theorem
A theorem is a proven statement that was constructed using previously proven statements, such as theorems, or constructed using axioms. Some theorems are very complicated and involved, so we will discuss their different parts.
The hypothesis is the "if" statement of a theorem. In a way, it is similar to an axiom because it is assumed to be true in order to prove a theorem. We will consider a simple example.
- Theorem 2.1.1. If A and B are sets such that and , then A=B.
In this theorem, the hypothesis is everything before the word "then." This is a very simple proof. We need to prove that for every x, . For the purpose of analyzing proofs, we will define and . The most common way to prove an "if and only if" statement is to prove necessity and sufficiency separately
- So we start by showing that Therefore, we assume that P(x) is true. That is, Since we assumed, by hypothesis, that , we know that , which means that Q(x) is true.
- Now we show that , so we assume that Q(x) is true. This means that . Since we know that we know that so P(x) is true.
By these two conclusions, we see that
Now, by axiom 3, A=B, since This concludes the proof. This is a very trivial proof, but its point was to show how to use a hypothesis or set of hypotheses in order to reach the desired conclusion. This method here is the most common in proving that two sets are equal. You prove that each set is a subset of the other.
The part of the theorem after the word "then" is called the conclusion. The proof of a theorem is merely the logical connection between the hypothesis and the conclusion. Once you've seen and proved a few theorems, a conclusion is almost predictable. For example, what conclusion would you naturally draw from the following two statements?
- All Americans are people.
- All people live on Earth.
These two statements are the hypotheses. To word this as a theorem, we would have "If all Americans are people and all people live on Earth, then all Americans live on Earth." This statement is what most people would call completely obvious and requires no proof. However, to show how this concept is applied in mathematics, we will abstract this theorem and prove it.
- Theorem 2.1.2. If and , then .
To see how this relates to our problem, let A be the set of all Americans, B the set of all people, and C the set of all things that live on Earth.
To show that , we need to show that So we suppose By hypothesis, so Also by hypothesis, , so Since this was true for any arbitrary we have shown that
While a definition is not usually part of a theorem, they are commonly introduced immediately before a theorem, in order to make the theorem make sense or to help in proving it.
- Definition 2.1.3. If a set A has only finitely many elements, then the order of A, denoted by |A|, is the number of elements in A.
This definition gives meaning to the following theorem.
- Theorem 2.1.4. If A and B are finite sets such that A = B, then |A|=|B|.
Here we take advantage of the fact that A is a finite set. Let n be the integer such that |A| = n. Then index the elements of A so that Now , we have . So we see that B has at least n elements, that is Also, every element of B is in A (by hypothesis), so it follows that there are no more elements in B than there are in A, so , thus |B| = n = |A|, which concludes the proof.
There are different terms that mathematicians like to use in stating mathematical results. Theorem is probably the most common and well-known, especially to non-mathematicians. There are, however, a couple other main terms used in the mathematical world. For lack of a better term, I have lumped them together in the category of "pseudo-theorems", since they are like theorems, but are different.
A lemma is a "small theorem." When a result is less profound, more trivial, or boring, it can be called a lemma. A lemma is also used to make the proof of a theorem shorter. That is, if a chunk of a proof can be pulled off and proved separately, then it is called a lemma and the proof of the theorem will say something to the effect of "as proved in the lemma."
For example, the following lemma will help to make the proof of Theorem 2.1.4 more concise.
- Lemma 2.1.5. If A and B are finite sets and then .
As you might guess, this is one motivation for the use of the symbol , since it is similar in appearance to <.
Let n = |A|. Then number the elements of A, so Then for each i from 1 to n we see that , which means that B has at least n different elements, or that which is what we were trying to prove.
Now if we use this lemma twice on Theorem 2.1.4, we will get a very brief proof. Since we know that Also, since , we see that Now we use a fact about numbers, that if and , it must follow that x = y.
A corollary is similar to a lemma in that it is usually a small and not as important as a theorem. However, a corollary is usually a result that follows almost immediately from a theorem.
Assume that we had proved the theorem that "All people are pigs." Then a corollary would be "People who have two legs are pigs." which clearly follows from the first result. A slightly more interesting corollary would be "People can be sold for bacon when they die." since it is common knowledge that bacon comes from pigs.
So we see that a corollary is something that follows from a preceding theorem with minimal argument to support it. It is either "common sense" or "obvious" to the reader that it is a direct consequence of the theorem.
- Prove that the following sets are equal. Verify it with a truth table or a Venn Diagram. You may assume that A, B, and C are nonempty sets. Also assume that U is the universe.
- Prove that if A and B are finite sets then and that equality holds when
Something to think about
- We have defined the order or size of a set for a finite set. Would it make sense to define this order for an infinite set? How would you tell whether two infinite sets are the same size?
- If you know that can you show that
Proof by Contrapositive
The contrapositive of a statement negates the conclusion as well as the hypothesis. It is logically equivalent to the original statement asserted. Often it is easier to prove the contrapositive than the original statement. This section will demonstrate this fact.
The idea of the contrapositive is proving the statement "There is no x such that P(x) is false." as opposed to "P(x) is true for every x."
This is not to be confused with a Proof by Contradiction. If we are trying to prove the statement , we can do it constructively, by assuming that P is true and showing that the logical conclusion is that Q is also true. The contrapositive of this statement is , so we assume that Q is false and show that the logical conclusion is that P is also false. However, in a proof by contradiction, we assume that P is true and Q is false and arrive at some sort of illogical statement such as "1=2."
A proof revisited
The most basic example would be to redo a proof given in the last section. We proved Theorem 2.1.4 to be true by the constructive method. Now we can prove the same result using the contrapositive method. For ease of reading, I will change the wording of the theorem.
- Theorem 2.2.1 (also Theorem 2.1.4). If A and B are finite sets and , then .
The wording is probably more natural in the 2.1.4 version, but this shows how the proof is to be done. We assume that we have two finite sets A and B and that they do not have the same number of elements. So let and . Then, number the elements in A and B, so and . Since , either or . Without loss of generality, we assume that . Consider the set . Since A has only n elements, we can take out at most n elements from B, leaving at least m-n elements in B-A. This shows that there is at least one element in B that is not in A, therefore .
I'm sure we all know how to do arithmetic already, but mathematicians like to be "rigorous." That means that we like to have clear definitions of everything we use.
- Axiom 7. The numbers 0 and 1 exist.
- Definition 2.2.2. The operator + is defined so that 1+0=1 and is an infinite set. We write the elements of this set as and define .
- Definition 2.2.3. The operator is defined so that and We also define
The above definitions are just formalities. We know what numbers are and how they work. This is just a mathematical definition. Notice that only integer multiplication has been defined. Now we need one more definition to prove the following theorem.
- Definition 2.2.4. An integer is said to be even if it is a multiple of two. That is, 'n' is even if n = 2k for some integer k. If this is not true, then it is said to be odd. The property of whether a number is even or odd is its parity.
- Theorem 2.2.5. If x and y are integers such that is odd, then
To prove this by contrapositive, we assume that and show that is even. If , then , which by Definition 2.2.3 is , so is a multiple of 2 and is therefore even.
Proofs by contrapositive are very helpful in proving biconditional statements. Recall that a biconditional is of the form (P if and only if Q). To prove a biconditional we need to prove that and However, if we use the contrapositive, we can show and
Prime numbers are very interesting in number theory, but also in arithmetic. In fact, the Fundamental Theorem of Arithmetic has to do with primes. Here we will not give a proof, just a statement of the the theorem.
- Definition 2.2.6. A prime number is a positive integer that has no multiple other than 1 and itself. A number that is not prime is called composite.
- Theorem 2.2.7 (Fundamental Theorem of Arithmetic). Every integer has a unique factorization of primes, excluding reorderings.
This theorem will be very useful in the proof of the next theorem.
- Theorem 2.2.8. An integer n is even if and only if its square is even.
- First we do a constructive proof. Suppose that n is an even integer. Then by defintion of even, for some integer k. Then . Since is an integer because it is the product of integers, we see that is even. This shows the "only-if" part of the theorem.
- To show the "if" part, we use a proof by contrapositive. Assume that n is not even, or that n is odd. Let be the prime factorization of n. Then none of the are 2 for any i. We consider the square and notice that there are no multiples that are equal to 2, so we conclude that is odd. This proves the theorem.
- Prove the following by contrapositive:
- An integer n is even if and only if n+1 is odd.
- If n and m have the same parity then n+m is even.
Proof by Contradiction
The method of proof by contradiction is to assume that a statement is not true and then to show that that assumption leads to a contradiction. In the case of trying to prove this is equivalent to assuming that That is, to assume that is true and is false. This is known by its latin reductio ad absurdum (reduction to absurdity), since it ends with a statement that cannot be true.
Square root of 2
A good example of this is by proving that is irrational. Proving this directly (via constructive proof) would probably be very difficult (if not impossible). However, by contradiction we have a fairly simple proof.
- Proposition 2.3.1.
Proof: Assume is rational. Then , where and are relatively prime integers (a and b have no common factors). and . So
But since is even, must be even as well, since the square of an odd number is also odd. Then we have , or
The same argument can now be applied to to find . However, this contradicts the original assumption that a and b are relatively prime, and the above is impossible. Therefore, we must conclude that is irrational.
Of course, we now note that there was nothing in this proof that was special about 2, except the fact that it was prime. That's what allowed us to say that was even since we knew that was even. Note that this would not work for 4 (mainly because ) because does not imply that
In English, the procedure is this: Assert that a statement is false, and then prove yourself wrong. (Thereby proving the original statement was true.) This is a form of Modus Tollens.
For many students, the method of proof by contradiction is a tremendous gift and a trojan horse, both of which follow from how strong the method is. In fact, the apt reader might have already noticed that both the constructive method and contrapositive method can be derived from that of contradiction.
However, its reach goes farther than even that, since the contradiction can be anything. Even if we ignore the criticisms from constuctivism, this broad scope hides what you lose; namely, you lose well-defined direction and conclusion, both of which have to be replaced with intuition.
Lastly, even in nonconstructive company, using the method in the first row of the table above is considered bad form (that is, proving something by pseudo-constructive proof), since the proof-by-contradiction part of it is nothing more than excess baggage.
- It is a simple exercise to see that any rational number may be written in this form.
Proof by Induction
The beauty of induction is that it allows a theorem to be proven true where an infinite number of cases exist without exploring each case individually. Induction is analogous to an infinite row of dominoes with each domino standing on its end. If you want to make all the dominoes fall, you can either:
- push on the first one, wait to see what happens, and then check each domino afterwards (which may take a long time if there's an infinite number of dominoes!)
- or you can prove that if any domino falls, then it will cause the domino after it to fall. (i.e. if the first one falls then the second one will fall, and if the second one falls then the third one will fall, etc.)
Induction, essentially, is the methodology outlined in point 2.
Parts of Induction
Induction is composed of three parts:
- The Base Case (in the domino analogy, this shows the first domino will fall)
- The Induction Hypothesis (in the domino analogy, we assume that a particular domino will fall)
- The Inductive Step (in the domino analogy, we prove that the domino we assume will fall will cause the next domino to fall)
Weak induction is used to show that a given property holds for all members of a countable inductive set, this usually is used for the set of natural numbers.
Weak induction for proving a statement (that depends on ) relies on two steps:
- is true for a certain base step. Usually the base case is or
- . That is, given that is true, is also true.
If these two properties hold, one may induce that the property holds for all elements in the set in question. Returning to the example, if you are sure that you called your neighbor, and you knew that everyone who was called in turn called his/her neighbor, then you would be guaranteed that everyone on the block had been called (assuming you had a linear block, or that it curved around nicely).
The first example of a proof by induction is always 'the sum of the first n terms:'
- Theorem 2.4.1. For any fixed
- Base step: , therefore the base case holds.
- Inductive step: Assume that . Consider .
So the inductive case holds. Now by induction we see that the theorem is true.
Reverse induction is a seldom-used method of using an inductive step that uses a negative in the inductive step. It is a minor variant of weak induction. The process still applies only to countable sets, generally the set of whole numbers or integers, and will frequently stop at 1 or 0, rather than working for all positive numbers.
Reverse induction works in the following case.
- The property holds for a given value, say .
- Given that the property holds for a given case, say , Show that the property holds for .
Then the property holds for all values .
I learnt today from http://people.math.carleton.ca/~ckfong/hs14a.pdf (see example there) that reverse indcution is also usable in the general case: "to establish the validity of a sequence of propositions Pn (n ≥ 1), it is enough to establish the following
(a) Pn is valid for inﬁnitely many n.
(b) If Pn+1 is valid, then so is Pn.
It can be the case that we can easily prove P1 and if P for n=m so P for n=2m. In this case we have (a) for the infinitely many n = 2 exp k (for k >= 0).
In weak induction, for the inductive step, we only required that for a given , its immediate predecessor () satisfies the theorem (i.e., is true). In strong induction, we require that not only the immediate predecessor, but all predecessors of satisfy the theorem. The variation in the inductive step is:
- If is true for all then is true.
The reason this is called strong induction is fairly obvious--the hypothesis in the inductive step is much stronger than the hypothesis is in the case of weak induction. Of course, for finite induction it turns out to be the same hypothesis, but in the case of transfinite sets, weak induction is not even well-defined, since some sets have elements that do not have an immediate predecessor.
Used in proving theorems involving transfinite cardinals. This technique is used in set theory to prove properties of cardinals, since there is rarely another way to go about it.
We first define the notion of a well-ordered set. A set is well-ordered if there is a total order < on and that whenever is non-empty, there is a least-element in . That is, such that .
An inductive set is a set such that the following hold:
- (where is the least element of )
- If then such that
Of course, you look at that and say "Wait a minute. That means that !" And, of course you'd be right. That's exactly why induction works. The principle of induction is the theorem that says:
- Theorem 2.4.2. If is a non-empty well-ordered set and is an inductive subset of then .
The proof of this theorem is left as a very simple exercise. Here we note that the set of natural numbers is clearly well-ordered with the normal order that you are familiar with, so is an inductive set. If you accept the axiom of choice, then it follows that every set can be well-ordered.
A proof by counterexample is not technically a proof. It is merely a way of showing that a given statement cannot possibly be correct by showing an instance that contradicts a universal statement. For example, if you are trying to prove the statement "All cheesecakes are baked in Alaska." and you did not know whether to prove it by contrapositive or contradiction, all I would have to do is bake a cheesecake right in front of you here in Texas and then you would know that your efforts had been in vain.
Consider the following statement:
Every set is countable.
Of course a counterexample to this would be the interval from . To show that this interval isn't countable, we assume that it is, so then there is a bijective function since is bijective we may write the elements of in a list here we write the numbers in in their decimal expansions.
For the purposes of the proof, we need each real number to be uniquely defined by a decimal. This is of course true in almost all cases (take .01, for instance... it's equal to .00999999999...). In fact, the cases when it's not true are exactly the cases when the decimal ends in a string of zeros, or "terminates" (a separate proof would be needed for this, but for now accept it as true). Therefore, let the decimals all be in "nonterminating form," so that there's another bijection between the decimal expansions and the real numbers.
Now let if is odd and 1 if is even. Now I claim that is not listed here. To show this suppose it were then for some but if is odd and if , this is not possible so we conclude that is not in the list, this shows we may not have a surjection from
Although this is a counterexample, we still had to PROVE that it was in fact a counterexample and in doing so used both a proof by contradiction (this was the overall method of the proof) by a construction (of ). Although there may be more then one counterexample to any given false claim, you must always provide by a proof or argument that your counterexample works.
== The Curry-Howard Correspondance ==
The Curry-Howard correspondence establish a close relationship between computer programs and mathematical proofs.
1.1, Logical Reasoning.
These are the answers to the exercises in section 1.1, Logical Reasoning
- Truth tables
2. Negated statements
These are the answers to the exercises in section 1.2, Notation
- Use set theory notation
- , where P is the set of all people and L is the set of all things with two legs.
- , where M is the set of all mythological creatures and G is the set of all things that are Greek.
- , where A is the set of all pies, B is the set of all things with pudding in them, and C is the set of all things with cream on the top.
- Venn Diagrams
- Negated statements
- There is a quick, brown fox that does not jump over any lazy dog.
Symbols Used in this Book
This is a list of all the mathematical symbols used in this book.
- Closed interval notation. Signifies the set of all numbers between a and b (a and b included)
- A logical "or" connector. A truth statement whose truth value is true if either of the two given statements is true and false if they are both false.
- A logical "and" connector. A truth statement whose truth value is true only if both of the two given statements is true and false otherwise.
- A logical "not" unary operator. A truth statement whose value is opposite of the given statement.
- Set delimiters. A set may be defined explicitly (e.g. ), or pseudo-explicitly by giving a pattern (e.g. . It may also be defined with a given rule (e.g. , the set of all x such that P(x) is true).
- The "element of" binary operator. This shows element inclusion in a set. If x is an element of A we write
- The "set inclusion" or "subset" binary operator. If all the elements of A are in B, then we say that A is a subset of B and write Note that in this book, when
- The union of two sets. A set containing all elements of two given sets.
- The intersection of two sets. A set containing all the elements that are in both of two given sets.
- The science of addition and multiplication (subtraction and division are included, since they are the inverse operations of addition and multiplication). Proof by Contrapositive
- A self-evident truth. It is the foundation of logical reasoning. A statement that is accepted as true without proof, which may be assumed in proving that other things are true.Notation
- A collection of open sets in a set such that the intersection of any two open sets in contains a set Proof by Contradiction
- The result of a given conditional statement. (The "then" clause of a theorem.) This is also sometimes referred to as the result. Constructive Proof
- Conditional statement
- An "if" or an "only-if" statement. It is conditional because its truth value is determined by the truth value of two other statements. Logical Reasoning
- The converse and negation of a conditional. The contrapositive of is . Logical Reasoning
- The "reverse" of a conditional statement. The converse of is . Logical Reasoning
- That which follows, usually without any necessary argument, from a given result. Constructive Proof
- See factor.
- An integer n divides an integer m, if n is a factor of m, equivalently, if m is a multiple of n, or, equivalently, if there's a integer k such that . Proof by Contrapositive
- One of the objects in a set. Notation
- See Logically Equivalent.
- An integer that divides a given integer. (e.g. 3 is a factor of 6.) This is the "opposite" of multiple. Proof by Contrapositive
- A result whose proof is fairly simple or one that is used to simplify or break down a larger argument. Constructive Proof
- Logcially Equivalent
- Two statements that are simultaneously true or simultaneously false are logically equivalent. Logical Reasoning
- An integer obtained by multiplying two integers together. (e.g. 4 is a mulitple of 2). This is the "opposite" of factor. Proof by Contrapositive
- The opposite of a truth statement. The negation of
falseand vice-versa. Logical Reasoning
- Open set
- A set that is an element of a topology defined on a set Proof by Contradiction
- A lemma, theorem, or corollary. A statement of "if-then" that has been proven to be true. Also, the conclusion of such a statement. Constructive Proof
- A collection of items, or elements. Notation
- See Truth Statement.
- A main result. Usually the proof is somewhat involved and the result is interesting and useful. Constructive Proof
- Topological Space
- A set together with a topology that satisfy the topology axioms. Proof by Contradiction
- A collection of subsets of a given set that satisfy the topology axioms. Proof by Contradiction
- Truth Statement
- A statement whose truth value can be determined. Therefore, it is either true or false. Logical Reasoning
- Truth Value
- The assessment of whether a statement is true or false. Logical Reasoning
GNU Free Documentation License
||As of July 15, 2009 Wikibooks has moved to a dual-licensing system that supersedes the previous GFDL only licensing. In short, this means that text licensed under the GFDL only can no longer be imported to Wikibooks, retroactive to 1 November 2008. Additionally, Wikibooks text might or might not now be exportable under the GFDL depending on whether or not any content was added and not removed since July 15.|
Version 1.3, 3 November 2008 Copyright (C) 2000, 2001, 2002, 2007, 2008 Free Software Foundation, Inc. <http://fsf.org/>
Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed.
The purpose of this License is to make a manual, textbook, or other functional and useful document "free" in the sense of freedom: to assure everyone the effective freedom to copy and redistribute it, with or without modifying it, either commercially or noncommercially. Secondarily, this License preserves for the author and publisher a way to get credit for their work, while not being considered responsible for modifications made by others.
This License is a kind of "copyleft", which means that derivative works of the document must themselves be free in the same sense. It complements the GNU General Public License, which is a copyleft license designed for free software.
We have designed this License in order to use it for manuals for free software, because free software needs free documentation: a free program should come with manuals providing the same freedoms that the software does. But this License is not limited to software manuals; it can be used for any textual work, regardless of subject matter or whether it is published as a printed book. We recommend this License principally for works whose purpose is instruction or reference.
1. APPLICABILITY AND DEFINITIONS
This License applies to any manual or other work, in any medium, that contains a notice placed by the copyright holder saying it can be distributed under the terms of this License. Such a notice grants a world-wide, royalty-free license, unlimited in duration, to use that work under the conditions stated herein. The "Document", below, refers to any such manual or work. Any member of the public is a licensee, and is addressed as "you". You accept the license if you copy, modify or distribute the work in a way requiring permission under copyright law.
A "Modified Version" of the Document means any work containing the Document or a portion of it, either copied verbatim, or with modifications and/or translated into another language.
A "Secondary Section" is a named appendix or a front-matter section of the Document that deals exclusively with the relationship of the publishers or authors of the Document to the Document's overall subject (or to related matters) and contains nothing that could fall directly within that overall subject. (Thus, if the Document is in part a textbook of mathematics, a Secondary Section may not explain any mathematics.) The relationship could be a matter of historical connection with the subject or with related matters, or of legal, commercial, philosophical, ethical or political position regarding them.
The "Invariant Sections" are certain Secondary Sections whose titles are designated, as being those of Invariant Sections, in the notice that says that the Document is released under this License. If a section does not fit the above definition of Secondary then it is not allowed to be designated as Invariant. The Document may contain zero Invariant Sections. If the Document does not identify any Invariant Sections then there are none.
The "Cover Texts" are certain short passages of text that are listed, as Front-Cover Texts or Back-Cover Texts, in the notice that says that the Document is released under this License. A Front-Cover Text may be at most 5 words, and a Back-Cover Text may be at most 25 words.
A "Transparent" copy of the Document means a machine-readable copy, represented in a format whose specification is available to the general public, that is suitable for revising the document straightforwardly with generic text editors or (for images composed of pixels) generic paint programs or (for drawings) some widely available drawing editor, and that is suitable for input to text formatters or for automatic translation to a variety of formats suitable for input to text formatters. A copy made in an otherwise Transparent file format whose markup, or absence of markup, has been arranged to thwart or discourage subsequent modification by readers is not Transparent. An image format is not Transparent if used for any substantial amount of text. A copy that is not "Transparent" is called "Opaque".
Examples of suitable formats for Transparent copies include plain ASCII without markup, Texinfo input format, LaTeX input format, SGML or XML using a publicly available DTD, and standard-conforming simple HTML, PostScript or PDF designed for human modification. Examples of transparent image formats include PNG, XCF and JPG. Opaque formats include proprietary formats that can be read and edited only by proprietary word processors, SGML or XML for which the DTD and/or processing tools are not generally available, and the machine-generated HTML, PostScript or PDF produced by some word processors for output purposes only.
The "Title Page" means, for a printed book, the title page itself, plus such following pages as are needed to hold, legibly, the material this License requires to appear in the title page. For works in formats which do not have any title page as such, "Title Page" means the text near the most prominent appearance of the work's title, preceding the beginning of the body of the text.
The "publisher" means any person or entity that distributes copies of the Document to the public.
A section "Entitled XYZ" means a named subunit of the Document whose title either is precisely XYZ or contains XYZ in parentheses following text that translates XYZ in another language. (Here XYZ stands for a specific section name mentioned below, such as "Acknowledgements", "Dedications", "Endorsements", or "History".) To "Preserve the Title" of such a section when you modify the Document means that it remains a section "Entitled XYZ" according to this definition.
The Document may include Warranty Disclaimers next to the notice which states that this License applies to the Document. These Warranty Disclaimers are considered to be included by reference in this License, but only as regards disclaiming warranties: any other implication that these Warranty Disclaimers may have is void and has no effect on the meaning of this License.
2. VERBATIM COPYING
You may copy and distribute the Document in any medium, either commercially or noncommercially, provided that this License, the copyright notices, and the license notice saying this License applies to the Document are reproduced in all copies, and that you add no other conditions whatsoever to those of this License. You may not use technical measures to obstruct or control the reading or further copying of the copies you make or distribute. However, you may accept compensation in exchange for copies. If you distribute a large enough number of copies you must also follow the conditions in section 3.
You may also lend copies, under the same conditions stated above, and you may publicly display copies.
3. COPYING IN QUANTITY
If you publish printed copies (or copies in media that commonly have printed covers) of the Document, numbering more than 100, and the Document's license notice requires Cover Texts, you must enclose the copies in covers that carry, clearly and legibly, all these Cover Texts: Front-Cover Texts on the front cover, and Back-Cover Texts on the back cover. Both covers must also clearly and legibly identify you as the publisher of these copies. The front cover must present the full title with all words of the title equally prominent and visible. You may add other material on the covers in addition. Copying with changes limited to the covers, as long as they preserve the title of the Document and satisfy these conditions, can be treated as verbatim copying in other respects.
If the required texts for either cover are too voluminous to fit legibly, you should put the first ones listed (as many as fit reasonably) on the actual cover, and continue the rest onto adjacent pages.
If you publish or distribute Opaque copies of the Document numbering more than 100, you must either include a machine-readable Transparent copy along with each Opaque copy, or state in or with each Opaque copy a computer-network location from which the general network-using public has access to download using public-standard network protocols a complete Transparent copy of the Document, free of added material. If you use the latter option, you must take reasonably prudent steps, when you begin distribution of Opaque copies in quantity, to ensure that this Transparent copy will remain thus accessible at the stated location until at least one year after the last time you distribute an Opaque copy (directly or through your agents or retailers) of that edition to the public.
It is requested, but not required, that you contact the authors of the Document well before redistributing any large number of copies, to give them a chance to provide you with an updated version of the Document.
You may copy and distribute a Modified Version of the Document under the conditions of sections 2 and 3 above, provided that you release the Modified Version under precisely this License, with the Modified Version filling the role of the Document, thus licensing distribution and modification of the Modified Version to whoever possesses a copy of it. In addition, you must do these things in the Modified Version:
- Use in the Title Page (and on the covers, if any) a title distinct from that of the Document, and from those of previous versions (which should, if there were any, be listed in the History section of the Document). You may use the same title as a previous version if the original publisher of that version gives permission.
- List on the Title Page, as authors, one or more persons or entities responsible for authorship of the modifications in the Modified Version, together with at least five of the principal authors of the Document (all of its principal authors, if it has fewer than five), unless they release you from this requirement.
- State on the Title page the name of the publisher of the Modified Version, as the publisher.
- Preserve all the copyright notices of the Document.
- Add an appropriate copyright notice for your modifications adjacent to the other copyright notices.
- Include, immediately after the copyright notices, a license notice giving the public permission to use the Modified Version under the terms of this License, in the form shown in the Addendum below.
- Preserve in that license notice the full lists of Invariant Sections and required Cover Texts given in the Document's license notice.
- Include an unaltered copy of this License.
- Preserve the section Entitled "History", Preserve its Title, and add to it an item stating at least the title, year, new authors, and publisher of the Modified Version as given on the Title Page. If there is no section Entitled "History" in the Document, create one stating the title, year, authors, and publisher of the Document as given on its Title Page, then add an item describing the Modified Version as stated in the previous sentence.
- Preserve the network location, if any, given in the Document for public access to a Transparent copy of the Document, and likewise the network locations given in the Document for previous versions it was based on. These may be placed in the "History" section. You may omit a network location for a work that was published at least four years before the Document itself, or if the original publisher of the version it refers to gives permission.
- For any section Entitled "Acknowledgements" or "Dedications", Preserve the Title of the section, and preserve in the section all the substance and tone of each of the contributor acknowledgements and/or dedications given therein.
- Preserve all the Invariant Sections of the Document, unaltered in their text and in their titles. Section numbers or the equivalent are not considered part of the section titles.
- Delete any section Entitled "Endorsements". Such a section may not be included in the Modified version.
- Do not retitle any existing section to be Entitled "Endorsements" or to conflict in title with any Invariant Section.
- Preserve any Warranty Disclaimers.
If the Modified Version includes new front-matter sections or appendices that qualify as Secondary Sections and contain no material copied from the Document, you may at your option designate some or all of these sections as invariant. To do this, add their titles to the list of Invariant Sections in the Modified Version's license notice. These titles must be distinct from any other section titles.
You may add a section Entitled "Endorsements", provided it contains nothing but endorsements of your Modified Version by various parties—for example, statements of peer review or that the text has been approved by an organization as the authoritative definition of a standard.
You may add a passage of up to five words as a Front-Cover Text, and a passage of up to 25 words as a Back-Cover Text, to the end of the list of Cover Texts in the Modified Version. Only one passage of Front-Cover Text and one of Back-Cover Text may be added by (or through arrangements made by) any one entity. If the Document already includes a cover text for the same cover, previously added by you or by arrangement made by the same entity you are acting on behalf of, you may not add another; but you may replace the old one, on explicit permission from the previous publisher that added the old one.
The author(s) and publisher(s) of the Document do not by this License give permission to use their names for publicity for or to assert or imply endorsement of any Modified Version.
5. COMBINING DOCUMENTS
You may combine the Document with other documents released under this License, under the terms defined in section 4 above for modified versions, provided that you include in the combination all of the Invariant Sections of all of the original documents, unmodified, and list them all as Invariant Sections of your combined work in its license notice, and that you preserve all their Warranty Disclaimers.
The combined work need only contain one copy of this License, and multiple identical Invariant Sections may be replaced with a single copy. If there are multiple Invariant Sections with the same name but different contents, make the title of each such section unique by adding at the end of it, in parentheses, the name of the original author or publisher of that section if known, or else a unique number. Make the same adjustment to the section titles in the list of Invariant Sections in the license notice of the combined work.
In the combination, you must combine any sections Entitled "History" in the various original documents, forming one section Entitled "History"; likewise combine any sections Entitled "Acknowledgements", and any sections Entitled "Dedications". You must delete all sections Entitled "Endorsements".
6. COLLECTIONS OF DOCUMENTS
You may make a collection consisting of the Document and other documents released under this License, and replace the individual copies of this License in the various documents with a single copy that is included in the collection, provided that you follow the rules of this License for verbatim copying of each of the documents in all other respects.
You may extract a single document from such a collection, and distribute it individually under this License, provided you insert a copy of this License into the extracted document, and follow this License in all other respects regarding verbatim copying of that document.
7. AGGREGATION WITH INDEPENDENT WORKS
A compilation of the Document or its derivatives with other separate and independent documents or works, in or on a volume of a storage or distribution medium, is called an "aggregate" if the copyright resulting from the compilation is not used to limit the legal rights of the compilation's users beyond what the individual works permit. When the Document is included in an aggregate, this License does not apply to the other works in the aggregate which are not themselves derivative works of the Document.
If the Cover Text requirement of section 3 is applicable to these copies of the Document, then if the Document is less than one half of the entire aggregate, the Document's Cover Texts may be placed on covers that bracket the Document within the aggregate, or the electronic equivalent of covers if the Document is in electronic form. Otherwise they must appear on printed covers that bracket the whole aggregate.
Translation is considered a kind of modification, so you may distribute translations of the Document under the terms of section 4. Replacing Invariant Sections with translations requires special permission from their copyright holders, but you may include translations of some or all Invariant Sections in addition to the original versions of these Invariant Sections. You may include a translation of this License, and all the license notices in the Document, and any Warranty Disclaimers, provided that you also include the original English version of this License and the original versions of those notices and disclaimers. In case of a disagreement between the translation and the original version of this License or a notice or disclaimer, the original version will prevail.
If a section in the Document is Entitled "Acknowledgements", "Dedications", or "History", the requirement (section 4) to Preserve its Title (section 1) will typically require changing the actual title.
You may not copy, modify, sublicense, or distribute the Document except as expressly provided under this License. Any attempt otherwise to copy, modify, sublicense, or distribute it is void, and will automatically terminate your rights under this License.
However, if you cease all violation of this License, then your license from a particular copyright holder is reinstated (a) provisionally, unless and until the copyright holder explicitly and finally terminates your license, and (b) permanently, if the copyright holder fails to notify you of the violation by some reasonable means prior to 60 days after the cessation.
Moreover, your license from a particular copyright holder is reinstated permanently if the copyright holder notifies you of the violation by some reasonable means, this is the first time you have received notice of violation of this License (for any work) from that copyright holder, and you cure the violation prior to 30 days after your receipt of the notice.
Termination of your rights under this section does not terminate the licenses of parties who have received copies or rights from you under this License. If your rights have been terminated and not permanently reinstated, receipt of a copy of some or all of the same material does not give you any rights to use it.
10. FUTURE REVISIONS OF THIS LICENSE
The Free Software Foundation may publish new, revised versions of the GNU Free Documentation License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. See http://www.gnu.org/copyleft/.
Each version of the License is given a distinguishing version number. If the Document specifies that a particular numbered version of this License "or any later version" applies to it, you have the option of following the terms and conditions either of that specified version or of any later version that has been published (not as a draft) by the Free Software Foundation. If the Document does not specify a version number of this License, you may choose any version ever published (not as a draft) by the Free Software Foundation. If the Document specifies that a proxy can decide which future versions of this License can be used, that proxy's public statement of acceptance of a version permanently authorizes you to choose that version for the Document.
"Massive Multiauthor Collaboration Site" (or "MMC Site") means any World Wide Web server that publishes copyrightable works and also provides prominent facilities for anybody to edit those works. A public wiki that anybody can edit is an example of such a server. A "Massive Multiauthor Collaboration" (or "MMC") contained in the site means any set of copyrightable works thus published on the MMC site.
"CC-BY-SA" means the Creative Commons Attribution-Share Alike 3.0 license published by Creative Commons Corporation, a not-for-profit corporation with a principal place of business in San Francisco, California, as well as future copyleft versions of that license published by that same organization.
"Incorporate" means to publish or republish a Document, in whole or in part, as part of another Document.
An MMC is "eligible for relicensing" if it is licensed under this License, and if all works that were first published under this License somewhere other than this MMC, and subsequently incorporated in whole or in part into the MMC, (1) had no cover texts or invariant sections, and (2) were thus incorporated prior to November 1, 2008.
The operator of an MMC Site may republish an MMC contained in the site under CC-BY-SA on the same site at any time before August 1, 2009, provided the MMC is eligible for relicensing.
How to use this License for your documents
To use this License in a document you have written, include a copy of the License in the document and put the following copyright and license notices just after the title page:
- Copyright (c) YEAR YOUR NAME.
- Permission is granted to copy, distribute and/or modify this document
- under the terms of the GNU Free Documentation License, Version 1.3
- or any later version published by the Free Software Foundation;
- with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts.
- A copy of the license is included in the section entitled "GNU
- Free Documentation License".
If you have Invariant Sections, Front-Cover Texts and Back-Cover Texts, replace the "with...Texts." line with this:
- with the Invariant Sections being LIST THEIR TITLES, with the
- Front-Cover Texts being LIST, and with the Back-Cover Texts being LIST.
If you have Invariant Sections without Cover Texts, or some other combination of the three, merge those two alternatives to suit the situation.
If your document contains nontrivial examples of program code, we recommend releasing these examples in parallel under your choice of free software license, such as the GNU General Public License, to permit their use in free software.