Definitions
A nuisance in rst learning graph theory is that there are so many de nitions. They all correspond to intuitive ideas, but can take a long time to absorb. Worse, the same thing often has several names and the same name sometimes means slightly di erent things to di erent people! It's a big mess, but muddle through.
2.2 Not-So-Simple Graphs
There are actually many variants on the de nition of a graph. The de nition in the preceding section really only describes simple graphs. There are many ways to complicate matters.
2.1 Simple Graphs
A graph is a pair of sets (V E ). The elements of V are called vertices. The elements of E are called edges. Each edge is a pair of distinct vertices. Graphs are also sometimes called networks. Vertices are also sometimes called nodes. Edges are sometimes called arcs. Graphs can be nicely represented with a diagram of dots and lines as shown in Figure 2 As noted in the de nition, each edge (u v ) 2 E is a pair of distinct vertices u v 2 V . Edge (u v ) is said to be incident to vertices u and v . Vertices u and v are said to be adjacent or neighbors. Phrases like, \an edge joins u and v " and \the edge between u and v " are comon. A computer network is can be modeled nicely as a graph. In this instance, the set of vertices V represents the set of computers in the network. There is an edge (u v) if there is a direct communication link between the computers corresponding to u and v .
What is a Proof? 1 What is a Proof?
Axiom 1 If a = b and b = c, then a = c.
This seems very reasonable! But sometimes the right choice of axiom is not clear.
Axiom 2 (Euclidean geometry) Given a line l and a point p not on l, there is exactly one line
many lines through p parallel to l.
No one of the three preceding axioms is better than the others all yield equally good proofs. Of course, a di erent choice of axioms makes di erent propositions true. Still, a set of axioms should not be chosen arbitrarily. In particular, there are two basic properties that one would want in any set of axioms it should be consistent and complete.
For example, suppose A is a set of students, B is a set of recitations, and f de nes the assignment
1 The Division Rule
We will state the Division Rule twice, once informally and then again with more precise notation.
Theorem 1.1 (Division Rule) If B is a nite set and f : A 7! B maps precisely k items of A to
In the example above, we found six ways to choose two elements from the set S = fA B Cg with
1.2 Triple-Scoop Ice Cream Cones
Baskin-Robbins is an ice cream store that has 31 di erent avors. How many di erent triple-scoop ice cream cones are possible at Baskin-Robbins? Two ice cream cones are considered the same if one can be obtained from the other by reordering the scoops. Of course, we are permitted to have two or even three scoops of the same avor.
1 The Monty Hall Problem
In the 1970's, there was a game show called Let's Make a Deal, hosted by Monty Hall and his assistant Carol Merryl. At one stage of the game, a contestant is shown three doors. The contestant knows there is a prize behind one door and that there are goats behind the other two. The contestant picks a door. To build suspense, Carol always opens a di erent door, revealing a goat. The contestant can then stick with his original door or switch to the other unopened door. He wins the prize only if he now picks the correct door. Should the contestant \stick" with his original door, \switch" to the other door, or does it not matter? We will analyze the probability that the contestant wins with the \switch" strategy that is, the contestant chooses a random door initially and then always switches after Carol reveals a goat behind one door. The analysis can be broken down into four steps.
1 Annuities
Would you prefer a million dollars today or $50,000 a year for the rest of your life? This is a question about the value of an annuity. An annuity is a nancial instrument that pays out a xed amount of money at the beginning of every year for some speci ed number of years. In particular, an n-year, m-payment annuity pays m dollars at the start of each year for n years. In some cases, n is nite, but not always. Examples include lottery payouts, student loans, and home mortgages. There are even Wall Street people who specialize in trading annuities. A key question is what an annuity is worth. For example, lotteries often pay out jackpots over many years. Intuitively, $50,000 a year for 20 years ought to be worth less than a million dollars right now. If you had all the cash right away, you could invest it and begin collecting interest. But what if the choice were between $50,000 a year for 20 years and a half million dollars today? Now it is not clear which option is better. In order to answer such questions, we need to know what a dollar paid out in the future is worth today. We will to assume that money can be invested at a xed annual interest rate p. These days a good estimate for p is around 8% we'll use this value for the rest of the lecture. Here is why the interest rate p matters. Ten dollars invested today at interest rate p will become (1 + p) = $10:80 in a year, (1 + p)2 $11:66 in two years, and so forth. Looked at another way, ten dollars paid out a year from now are only really worth 1=(1 + p) $9:26 today. The reason is that if we had the $9:26 today, we could invest it and would have $10.00 in a year anyway. Therefore, p determines the value of money paid out in the future.
Lecture Notes 1 Proof by Induction
n i
In this notation, the pattern of terms in the summation is made explicit. In two important special cases, the de nition of the summation 1 + 2 + 3 + : : : + n requires some care: If n = 1, then 1+2+3+ : : :+ n = 1. There is only one term in the summation the appearance of 2 and 3 and the presence of both 1 and n is misleading! If n 0, then 1 + 2 + 3 + : : : + n = 0. There are no terms at all in the summation, so the result is zero. The text of a proof by induction should consist of four parts. We'll discuss each part below and then write a proof for the preceding theorem following this outline. 1. State that the proof is by induction. This immediately conveys the general structure of the argument. 2. Specify the induction hypothesis, P (n). Sometimes, the choice of P (n) will come directly from the theorem statement. In this case P (n) will simply be the predicate 1+2+3+ : : : + n = n(n + 1)=2. Other times, the choice of P (n) is not obvious at all we will see an example of this at the end of the lecture. 3. The basis step: prove P (0). The \basis step" or \base case" is a proof of the predicate P (0). In this case, we must prove that 1 + 2 + 3 + : : : + n = n(n + 1)=2 when n = 0. This is trivial because both sides of the equation are zero. 4. The inductive step: prove that 8n 2 N P (n) ) P (n + 1). Begin the inductive step by writing, \For n 0, assume" P (n) \to prove" P (n + 1). (Substitute in the statements of the predicates P (n) and P (n + 1).) Then verify that for all n 0, P (n) indeed implies P (n + 1). A complete proof of the theorem that follows this four-part formula is given below. Proof. The proof is by induction. Let P (n) be the predicate 1 + 2 + 3 + : : : + n = n(n + 1)=2. In the base case, P (0) is true because both sides of the equation are zero. For n 0, assume 1 + 2 + 3 + : : : + n = n(n + 1)=2 to prove that 1 + 2 + 3 + : : : + n + (n + 1) = (n + 1)(n + 2)=2. We can now reason as follows. 1 + 2 + 3 + : : : + n + (n + 1) = ( + 1) + (n + 1) 2 2 n + n + 2n + 2 = 2 2 n + 3n + 2 = 2 (n + 1)(n + 2) = 2
Lecture 3 11 Sep 97
Lecture Notes 1 Strong Induction
Claim 2.1
8, it is possible to produce n cents of postage from 3 and 5 cents stamps.
Now let's preview the proof. A proof by strong induction will have the same four-part structure as an ordinary induction proof. The base case, P (8), will be easy because we can produce 8 cents of postage from a 3 cent stamp and a 5 cent stamp. In the inductive step we assume that we know how to produce 8 9 : : : n cents of postage, and we have to show how to produce n + 1 cents of postage. One way is rst to create n ; 2 cents of postage using strong induction and then to add a three cent stamp. We have to be careful there is a pitfall here. If n is 8 or 9, then we can not use the trick of creating n + 1 cents of postage from n ; 2 cents and a 3 cent stamp. In these cases, n ; 2 is less than 8. None of the assumptions that strong induction permits help us make less than 8 cents of postage. Fortunately, making n + 1 cents of postage when n is 8 or 9 is very easy we can do this directly. Proof. The proof is by strong induction. Let P (n) be the predicate that it is possible to produce n cents of postage from 3 and 5 cent stamps. In the basis step, P (8) is true because 8 cents of postage can be made from a 3 cent stamp and a 5 cent stamp.
1.1 Markov's Inequality
We can obtain a weak bound on the probability that at least one of the events A1 A2 : : : AN occurs using Markov's Theorem:
2 Ex(T ) 1
1.2 An Alternate Proof
We can obtain Fact 1 in another way. Recall Boole's Inequality, which says that for any events A1 A2 : : : AN , we have: Pr(A1 A2 : : : AN ) Pr(A1 ) + Pr(A2) + : : : + Pr(AN )
Lecture Notes
This lecture is devoted to one rather general probability question. Let A1 A2 : : : AN be mutually independent events over the same sample space. Let the random variable T be the number of these events that occur. What is the probability that at least k events occur? That is, what is Pr(T k)? This question comes up often. For example, suppose we want to know the probability that at least k heads come up in N tosses of a coin. Here Ai is the event that that the coin is heads on the i-th toss, T is the total number of heads, and Pr(T k) is the probability that at least k heads come up. As a second example, suppose that we want the probability of a student answering at least k questions correctly on an exam with N questions. In this case, Ai is the event that the student answers the i-th question correctly, T is the total number of questions answered correctly, and Pr(T k) is the probability that the student answers at least k questions correctly. There is an important di erence between these two examples. The rst example is a special case in that all events Ai have equal probability that is, the coin is as likely to come up heads on one ip as on another. In particular, suppose that the coin comes up heads with probability p. Then all events Ai have probability p, and T has the now-familiar binomial distribution: ! n pk (1 ; p)n;k = f (k) Pr(T = k) = k np We studied the binomial distribution extensively two weeks ago. Therefore, we already have an answer to the question posed above in the special case where all events Ai have equal probability. In the second example, however, some exam questions might be more di cult than others. If question 1 is easier than question 2, then the probability of event A1 is greater than the probability of event A2. This lecture focuses on questions of this more general type in which the events Ai may have di erent probabilities.
De nition A tree is a connected n-node graph with exactly n ; 1 edges.
The vertices in a tree can be classi ed into two categories. Vertices of degree at most one are called leaves, and vertices of degree greater than one are called internal nodes. Trees are usually drawn as in Figure 1 with the leaves on the bottom. Keep this convention in mind otherwise, phrases like \all the vertices below..." will be confusing. (The English mathematician Littlewood once remarked that he found such directional terms particularly bothersome, since he habitually read mathematics reclined on his back!) Trees arise in many problems. For example, the le structure in a computer system can be naturally represented by a tree. In this case, each internal node corresponds to a directory, and each leaf corresponds to a le. If one directory contains another, there there is an edge between the associated internal nodes. If a directory contains a le, then there there is an edge between the internal node and a leaf. There are several ways to describe trees that are equivalent to the preceding de nition.
pi j m ) pi j p1 p2 ) pi j 1
pn + 1
Lecture 13: Lecture Notes
The rst implication follows by substituting the de nition of m. The second follows because pi divides the product p1p2 pn, and so must divide 1 in order to divide the sum. But no prime divides 1, so we have a contradiction. Therefore, there are an in nite number of primes. Proving that the set of primes is in nite is relatively easy, but the next example shows that determining whether a set is nite or in nite can be tricky.
De nition An in nite set S is said to be countably in nite i there exists a bijection f : N 7! S .
A set is countable if it is nite or countably in nite. Many familiar sets are countable: N , the even numbers, primes, integers modulo a constant k, etc. The formal de nition of a countable set is equivalent to the notion of a listable set, since listing the elements of a set gives a bijection with N and vice-versa. If we can list the elements of S as s0 s1 s2 : : : , then we can construct a bijection f : N 7! S de ned by f (i) = si. In the reverse direction, if we have such a bijection, then f (0) f (1) f (2) : : : is a list of all the elements of S . Sometimes we can list the elements of a set S , but the corresponding bijection f : N 7! S is very hard to compute. For example, in principle we could list the elements in any subset of N . However, most of these subsets are bizarre and di cult to describe writing a program to compute f (i) might be di cult or even impossible. The good news is that to prove that S is countable, we must only prove that a bijection f : N 7! S exists we do not have to write f explicitly or to give an algorithm to compute f (i).
Lecture 19: Lecture Notes
How do we compute Pr(A j B )? Since we are given that the person lives in Cambridge, all outcomes outside of event B are irrelevant these irrelevant outcomes are diagonally shaded in the gure. Intuitively, Pr(A j B ) should be the fraction of Cambridge residents that are also MIT students. That is, the answer should be the probability that the person is in set A \ B (horizontally shaded) divided by the probability that the person is in set B . This gives:
the person is a Cambridge resident?
This is a conditional probability question. It can be concisely expressed in a special notation. In general, Pr(A j B ) denotes the probability of event A, given event B . In this example, Pr(A j B ) is the probability that the person is an MIT student, given that he or she is a Cambridge resident.
\ Pr(B j B ) = Pr(B B )B ) Pr( Pr(B ) = Pr(B ) = 1
The Product Rule can be generalized to many events.
Rule 1.2 (Product Rule, general case) Let A1 A2 : : : An be events.
Pr(A \ B ) = Pr(B ) Pr(A j B ) (We are now using the term \Product Rule" for two separate ideas. One is the de nition of Pr(A j B ) given above, and the other is the formula for the cardinality of a product of sets. In this lecture, the term always refers to the rule above.) As an example, what is Pr(B j B )? That is, what is the probability of event B , given that event B happens? Intuitively, this ought to be 1! The Product Rule gives exactly this result:
Lecture 19: Lecture Notes
2.1 A Two-out-of-Three Series
The MIT computer science department's famed D-league hockey team, The Halting Problem, is playing a 2-out-of-3 series. That is, they play games until one team wins a total of two games. 1 The probability that The Halting Problem wins the rst game is 2 . For subsequent games, the probability of winning depends on the outcome of the preceding game the team is energized by victory and demoralized by defeat. Speci cally, if The Halting Problem wins a game, then they 2 have a 3 chance of winning the next game. On the other hand, if the team loses, then they have 1 only a 3 chance of winning the following game. What is the probability that The Halting Problem wins the 2-out-of-3 series, given that they win the rst game? This problem involves two types of conditioning. First, we are told that the probability of the team winning a game is 2=3, given that they won the preceding game. Second, we are asked the odds of The Halting Problem winning the series, given that they win the rst game.
\ Pr(A j B ) = Pr(A B )B ) Pr(
Rearranging terms gives the following rule, which we will regard as the de nition of Pr(A j B ).
Rule 1.1 (Product Rule, special case) Let A and B be events.
2/3 WW 1/3 WLW 1/3 1/18
L 1/3
2/3 2/3
WLL LWW1/9 1/9源自L1/2W L
L 1/3
1/18 1/3
1st game outcome
2nd game outcome
event A: event B: outcome 3rd game outcome win the win the outcome series? 1st game? probability
2 Conditional Probability Examples
This section contains ve examples of conditional probability problems. In each case, the solution requires only the Product Rule and the four-step method that we applied to the Monty Hall problem.
Pr(A1 \ A2 \ : : : \ An ) = Pr(A1) Pr(A2 j A1) Pr(A3 j A1 \ A2 ) : : : Pr(An j A1 \ : : : \ An;1) This generalization of the Product Rule can be proven from the special case by induction on n, the number of sets. We will not give a proof here, however.
Step 1: Find the Sample Space
The sample space for the hockey series is worked out with a tree diagram in Figure 2. Each internal node has two children, one corresponding to a win for The Halting Problem (labeled W ) and one corresponding to a loss (labeled L). The sample space consists of six outcomes, since there are six leaves in the tree diagram.
Massachusetts Institute of Technology 6.042J/18.062J: Mathematics for Computer Science Professor Tom Leighton
Lecture 19 6 Nov 97
Lecture Notes
This lecture concerns the probability of one event, given that some other event happens. This is called a conditional probability. Trying to solve conditional probability problems by intuition can be very di cult. On the other hand, we can chew through these problems with the same four-step method that we used for the Monty Hall problem: 1. 2. 3. 4. Find the sample space. De ne events of interest. Compute outcome probabilities. Compute event probabilities.