Welcome to this first course in the "Design and Analysis of Algorithms". I imagine many of you are already clear on your reasons for taking this course, but let me begin by justifying the course's existence and giving you several different motivations for learning about algorithms. So what is an algorithm? Well, we're not going to be needed a precise definition in this course, but essentially, an algorithm is a well defined set of rules, a recipe, in effect, for solving some kind of computational problem. So for example, maybe you're given a bunch of numbers and you want to rearrange them into sorted order. Maybe you're given a road network with an origin and a destination, and you want to compute the shortest path from point A to point B. Maybe you're given a bunch of tasks with deadlines, and you want to know whether or not it's feasible to accomplish all of those tasks by the respective deadlines. So, why study algorithms? Well first of all, understanding the field of algorithms and also the related field of data structures, is crucial for doing serious work in pretty much any other branch of computer science. That's the precise reason why, here at Stanford University, this is the course that's required for all of the degrees that the computer science department grants. Be it a bachelor's, a master's, or Ph.D degree's, we insist that you have mastery of the field of algorithms. So what are some examples of uses in the rest of computer science. Well, if you're doing routing in a communication network, that piggybacks on classical shortest path algorithms. The effectiveness of public-key cryptography really rests on that of number theoretic algorithms. In, say, computer graphics, you need the computational primitives that are supplied by the study of geometric algorithms. Database indices rely on balanced search-tree data structures as covered in this course. Computational biology uses dynamic programming algorithms to measure similarity among genomes. And the list goes on and on. A second reason to study algorithms is that they play a key role in modern technological innovation. Obviously, I could give any number of examples here, and let me just state one super obvious one, which is that search engines use a tapestry of algorithms to efficiently compute the relevance of various webpages. The most famous such algorithm which you may have heard of is the PageRank algorithm, in use by Google, indeed. In a December 2010 report to the United States White House, the President Council of Advisors on Science and Technology argued that, "in many areas, performance gains due to improvements in algorithms have vastly exceeded even the dramatic performance gains due to increased processor speed", as you'd be familiar with in the form of Moore's Law. Third, although this is getting significantly outside the scope of this course, algorithms are increasingly being used to provide a novel lens on processes outside of computer science and technology. For example, the study of quantum computation has provided a new and computational view point on quantum mechanics. Price fluctuations in economic markets can be fruitfully viewed as an algorithmic process. And even evolution can be usefully thought of as a surprisingly effective search algorithm. The last two reasons I'm gonna give you might sound a little flippant, but, I think there's more than a grain of truth to both of them. Now, I don't know about you, but back when I was a student, my favorite classes were always challenging classes. But, after I struggled through them I somehow felt like I had a few more IQ points than when I started. So, I hope this course provides a similar experience for many of you: that, on the one hand, it's a bit of a struggle -- you find the concepts challenging, but perhaps you feel just a, a tinge smarter after we're done. Finally, I hope that by the end of the course, a constant fraction of you will agree with me that designing and analyzing algorithms is simply fun. It's an endeavor that requires a rare blend of creativity and precision. And it can certainly be frustrating at times. But even more than that, it is addictive. So let's now descend from these lofty generalities and get much more concrete. And also, let's remember that we've all been learning and using algorithms since we were little kids. So once upon a time, in roughly 3rd grade or so, you learned how to multiple two numbers. Now you probably weren't thinking in these terms at the time, but multiplying two numbers is certainly a well defined computational problem, and that procedure you learned back in 3rd grade or so is indeed an algorithm. So let's just make that a little bit more precise. In this computational problem we're given as input two numbers, let's say, with N digits. And to make things interesting, why don't you think about n as being really quite large, say, in the thousands. Maybe we're implementing an algorithm that's going to be used in a cryptography application, where you need to keep track of really quite large numbers. So if we call the two infinite numbers X and Y, the problem is simply to compute their products X times Y. So a quick digression, I'll certainly be the first to admit that my handwriting is not the greatest. I got a C in penmanship back in elementary school, and I think the teacher was being, a little generous. But, you know, it's an acquired taste but trust me, you will get used to it. Okay, back to integer multiplication. Now, when we talk about procedures for multiplying two numbers, we're gonna be interested in counting how many steps are required in order to execute, the multiplication. So how do we count a step? We'll talk more about this later, but for multiplying two numbers, let's just call a step the addition or multiplication of two single-digit numbers. So let's review the integer multiplication algorithm that we learned back in grade school just by working through a concrete example. Specifically, let's think of n = 4, so let's look at two 4-digit numbers. Let's say 5, 6, 7, 8. And 1, 2, 3, 4. [sound of door closing] Now as you'll recall, the procedure we learned way back when was just to take each digit at the bottom number, and multiply it by each of the top numbers. And then to take each of those, N partial products, and add them up. So, for example, you start with the 4, you multiple it by 8, you get 32, carry the 3. 4 times 7 is 28, add the 3, you get 1. Carry the 3, and so on. So that gives you this first partial product, 22712. Then you do a shift, so you effectively put a 0 in this final digit, and you repeat that procedure using the next digit, the 3. So again, 3 times 8 is 4, carry the 2, and so on. And you can compute the final two partial products using the 2 and the 1. And having computed all of the partial products, you just add them up to get the final product. Now, the question I'm interested in, is: how much work, how many primitive operations did we do to multiply these two numbers and, more generally, how many does it require to multiply to N-digit numbers as a function of N? Well, just to get sort of a ballpark viewpoint for what's going on, we started with two N-digit numbers. And at the end of the day, we basically filled out a grid of size roughly N by N, give-and- take a little bit. So just in ballpark terms, it seems that multiplying two N-digit numbers require essentially a quadratic number of operations, a small number of operations to fill each entry in this grid. The grid is N by N roughly. So that's roughly N^2 operations. And a little more detail, we can look at each of the partial product separately. So, to compute say this first partial product, what did we do? We took the four, we multiplied it by each of the N digits on top. We also did some carries, so that affects things a little bit. But in the end, we did somewhere between say N and 2N, primitive operations to compute this first partial product. And that's true, of course, for each of the N partial products. So that's roughly N operations for each of the N partial products. So that's roughly N^2 operations to compute all of the partial products. Then we have to add it all up at the end, but that takes just an extra number of constant times n primitive operations to do all of those additions. So summarizing, overall, the number of primitive operations required to multiply two N-digit numbers using this procedure grows quadratically with the length N. So, for example, if you double the length of two numbers, you expect to do roughly four times as many primitive operations, if you use this iterative algorithm for multiplying two numbers. Now, depending on what type of 3rd grader you were, you might well have accepted this procedure as being the unique, or at least the optimal way of multiplying to N-digit numbers together. And if you wanna be an exponent algorithm designer, that kind of obedient attitude is something you're gonna have to learn to discard. Here's a favorite quote of mine. It's from a quite early textbook in the field, The Design and Analysis of Computer Algorithms by Aho, Hopcroft and Ullman. And after highlighting a number of algorithmic design techniques, they conclude by saying, "perhaps the most important principle for the good algorithm designer is to refuse to be content". So that's a beautifully accurate quote. I might rephrase it more succinctly by just saying, "if you want to be a good algorithm designer, you should adopt the following mantra: You should always be asking "can we do better?"". This is a particularly appropriate question to ask when you're faced with some kind of naive or obvious algorithm for solving a problem, like the 3rd grade algorithm for integer multiplication. This will come up over and over again. We'll see an algorithm, there will be an obvious solution but, with some extra algorithmic ingenuity, by detecting subtle structure in the problem, we'll be able to do significantly qualitatively better than the naive or obvious solution to the problem. So let's apply this cocky attitude to the problem of multiplying two integers. Let's just suppose, as a working hypothesis, that there is some procedure which is better than what we learned back in elementary school. What could it possibly look like? Well, as someone with some programming experience, you know that there are not only iterative algorithms, iterative programs, like the one we just outlined for integer multiplication. But there are also recursive programs. These are programs that call themselves typically, on an input of a similar form but with smaller size. So as a working hypothesis, let's imagine that there's some interesting recursive approach to multiplying two integers. What must such an algorithm look like? Well it must somehow fundamentally involve calling itself on smaller problems, that is, on smaller numbers, numbers with fewer digits. So what kind of decomposition on numbers could be used to enable this kind of recursive approach. Well, if you think about it, there's a pretty natural way to break a number with a bunch of digits into a couple numbers with fewer digits. Namely, just take the first half of the digits, the first N/2 digits, regard that as a number in its own array. And the second half of the digits, regard that as a number in its own array. So perhaps this decomposition will have some use in enabling a recursive attack on how to multiply two integers. So we're gonna investigate that in more detail on this next slide. Given X and Y, each with N/2 digits, we're going to represent X as terms of its first half of the digits A, and its second half of the digits, B. Similarly, we will write Y the second input, in terms of its first half and second half of digits, C and D. And in this expansion, A, B, C, and D are all entered with 2-digit numbers. I'm just assuming for convenience here that N is even. This would extend to the case where N is odd in the natural way where you break it into N/2 rounded up and N/2 rounded down. So in our previous example, where X was 5678 and Y was 1234, we would have A being equal to 56, the first two digits of the four in X, and B would be the other two digits. And similarly for C and D in a decomposition of Y. So now in just purely mechanical way, we can expand the product of X times Y in terms of misrepresentation in terms of these smaller numbers A, B, C and D. So X times Y, by representation, is just the same thing as (10 ^ (N/2) times A + B) times quantity (ten to the (N/2) C + D). And now, if we expand this in group like terms, we get a ten to the N times AC + a ten to the N/2 times AD + BC. So that's combining two terms from the expansion +BD. And I am gonna call this expression circled in green star(). Now what I want you to think about and make sure you understand is that, this expansion that I circled and called star, immediately suggests a recursive algorithm for multiplying two integers. Now it's not clear whether that algorithm's going to be fast or slow, or whether this is a good idea or a crackpot idea, but certainly it's a legitimate recursive approach to multiplying two integers. Specifically, the relevant products in star, namely AC, AD, BC, and BD all involve smaller numbers than what we started with. Those are all N/2 digit numbers, whereas our original inputs had N digits. So we can legitimately solve those recursively. You can invoke our same algorithm to call to compute those in a recursive way. [sound]. Now, once we've invoked our multiplication algorithm recursively four times to compute these four relevant products. Now we can just, compute the expression in star in the obvious way. We take AD and BC, we add them together, using the just standard 1st grade iterative addition algorithm. Then we pad both AC and the result of that addition by a bunch of zeroes -- N/2 zeroes or N zeroes as appropriate. Now we have these three constituent terms, and we just add them up again using the grade school algorithm to compute the overall expression. So to keep this intro lecture brisk, I haven't discussed the base case of this recursive algorithm. As hopefully you all know, recursive algorithms do need base cases. At some point, the recursion has gotta stop. Otherwise, your algorithm is never gonna terminate. So in multiplication, all you do is if you were passed as input two single digit numbers, you'd multiply them in one primitive operation, and return the result. If the numbers have two or more digits, you would recurse in the way we described here. If there's one digit, you just compute them, and you're done. So who knows whether this recursive algorithm is a good idea or not? It's totally not obvious. You should not even necessarily have any intuition about how this compares to the iterative algorithm you learned back in elementary school. So in lieu of answering that question, although we will answer it, several lectures hence, I wanna show you a second, still more clever, recursive algorithm for integer multiplication. So this goes by the name Karatsuba multiplication, after the founder, although really sort of the key point is a trick pointed out by the mathematician Gauss, in the early 19th century. So to explain this algorithm, let me write once again our expansion in terms of the N/2 digit numbers, so we have the product X times Y, which we wrote as: (10^N)AC + (10^(N/2))(AD+BC) + BD. Now, the previous approach was, in recognition of the four products that we see in this expression, we made four recursive calls. So here's the trick, and this is really what Gauss figured out over 200 years ago, which is that, it seems there are four products, but fundamentally, in this expression, there's really only three quantities that we care about. There's the AC, the coefficient of this first term, there's AD+BC, the coefficient of this middle term, and there's BD. So we care about AD and BC as quantities individually, only in as much as we care about their sum. We really don't care about them individually. So that motivates the question. Perhaps, we can somehow uncover these three different quantities using only three recursive calls rather than four. In fact we can, and that's Karatsuba multiplication. So, to be more specific, our first 2 recursive calls are shared with the previous recursive algorithm. You go ahead and compute AC and BD recursively. The third step is the clever one where we recursively compute the product of quantity A plus B with C plus D. Now expanding, what are we going to get when we recursively compute this product? We're going to get AC plus BD plus AD plus BC. And here is Gauss' trick, for observation, which is that the result of the third recursive call minus each of the first two -- what happens. Well the AC is gonna to cancel the AC, the BD is gonna to cancel the BD. And we will be left with AD plus BC, which is exactly that middle co-efficient that we cared about. So, we can indeed compute the sum of AD and BC without separately computing each of them. And that's exactly what we wanted. So, what does this buy us? This gives us another second recursive algorithm for multiplying two integers, that makes use of fewer recursive calls, only three recursive calls, rather than four. And as before, there's a little bit of work to do beyond the recursive calls but not much. You just have to do some padding by zeros and adding up the results. So there's two points I want you to appreciate about all these integer multiplication algorithms. The first point is that the algorithm design's phase is incredibly rich. Much richer than you might have initially had intuition for. Multiplying two integers, what else can you do other than the elementary school algebra? Well, what have we seen? You can do a lot of other things. In particular, the second recursive algorithm looks nothing like what we learned back in grade school. And you'll see this over and over again, that, with sufficient ingenuity, sufficient cleverness, you can solve problems in ways much differently and sometimes much better than using the naive straightforward algorithm. The second takeaway point from this discussion of integer multiplication algorithms is that sometimes you have interesting algorithmic ideas for which it's totally not obvious what kinds of properties, or what kind of performance guarantees those algorithms enjoy. So in particular, confronted with these three different integer multiplication algorithms, how can we NOT wonder which of the three is the best? Which of these requires the fewest number recursive operations to multiply two integers? In particular, does one of these novel recursive approaches do better than the algorithm we learned back in elementary school? The answer to that question is totally non-obvious and it requires non-trivial mathematical analysis to answer. In the upcoming lectures I'll provide you with the tools to not only precisely understand and answer this question but more generally to predict the running times of an entire genre of divide-and-conquer algorithms, like Karatsuba multiplication. So, stay tuned.