Math 55 - Fall 2007 - Lecture notes # 11 - Sep 21 (Friday)

   Keep reading Chapter 3 (sections 3.4 though 3.7)
   We will not cover section 3.8 (matrices - topic of Ma 54)

   Goals of these sections: Integer algorithms and number theory, basis for:
             how to generate hash tables 
             how to generate random numbers
             how computer arithmetic works (hardware or software)
             how to do encryption/decryption 
                (keeping your passwords safe!)

   Next goals: Basic properties of primes
               greatest common divisor gcd(a,b)
               division algorithm
               hash tables
               random numbers
               Euclidean algorithm for gcd 

   Primes
   DEF: if a and b are integers, a != 0, say a|b if a divides b, ie.
        exists integer f such that b=a*f; else say a /| b
    EX: 2|2, 2|100, 1| anything, anything | 0, 2 /| 1001, 3 | 111111, 9 | 72252
        (reminder of rules for whether 3|a, 9|b, proofs later)
    Thm: a|b and a|c => a|(b+c)
         Proof: a|b <=> b=a*f1 for some f1, a|c <=> c=a*f2 for some f2,
                so a|b and a|c -> b+c=a*(f1+f2) -> a|b+c
    Thm: a|b => a|b*c; a|b and b|c => a|c
 ASK&WAIT: why?
    DEF a positive integer p>1 is a prime if the only positive integers which
        divide it are 1 and p; else composite
    EX: 2,3,5,7,11,13,... are prime
    Theorem (Fundamental Theoremm of Arithmetic): every positive integer
      has a unique prime factorization, where the factors are written in
      increasing order.
    Proof: wait till we learn induction in Chapter 4
    EX: 100 = 2*2*5*5 = 2^2 * 5^2; 1024 = 2^10
 ASK&WAIT: how many primes are there? Why?

    DEF Suppose a and b are integers, not both 0.
        gcd(a,b) = greatest common divisor of a and b, is 
                 = largest integer d such that d|a, d|b
 ASK&WAIT:  why exclude a=b=0?
 ASK&WAIT:  gcd(6,9)?, gcd(1,101)?, gcd(0,234)?, 
     DEF if gcd(a,b)=1, we say a and b are relatively prime.
 ASK&WAIT suppose a = 2^5  * 3^2  * 5^1
                  b = 2^4  * 3^3  * 5^2 
             what is gcd(a,b)? 
          suppose a = 2^n2 * 3^n3 * 5^n5
                  b = 2^m2 * 3^m3 * 5^m5
             what is gcd(a,b) =? 

     First algorithm for computing gcd(a,b):
        1) factor a = 2^n1 * 3^n2 * 5^n3 * ...
        2) factor b = 2^m1 * 3^m2 * 5^m3 * ...
        3) let  gcd = 2^min(m1,n1) * 3^min(m2,n2) * 5^min(m3,n3) * ...
     Later: a (much faster!) algorithm to compute gcd(a,b), 
            without factoring a and b

    DEF a,b, postive integer, lcm(a,b) = least common multiple
        = smallest positive integer divisible by a and b
 ASK&WAIT   EX: lcm(6,9)?, lcm(1,101)? 
 ASK&WAIT   Suppose a = 2^5  * 3^2  * 5^1
                    b = 2^4  * 3^3  * 5^2 - what is lcm(a,b)? 

     Algorithm for computing lcm(a,b):
        1) factor a = 2^n1 * 3^n2 * 5^n3 * ...
        2) factor b = 2^m1 * 3^m2 * 5^m3 * ...
        3) let  gcd = 2^max(m1,n1) * 3^max(m2,n2) * 5^max(m3,n3) * ...

 ASK&WAIT   What is gcd(a,b)*lcm(a,b)?

    Theorem (division algorithm) given integers a, d>0 (divisor), there is a
        unique q (quotient) and r (remainder) such that 0<=r<d, a=q*d+r
 ASK&WAIT:  is this an algorithm? 

    DEF a integer, d>0, then a mod d = r, remainder after dividing a by d
   Note: in C,C++, this is written a%d
   EX:  7 mod 3 = 1, since 7=1+2*3; 3 mod 7 = 3; anything mod 1 = 0.
 ASK&WAIT:  what is 87813134 mod 1000 
 ASK&WAIT:  what is 27 mod 8 = 11011_2 mod 2^3 

 Application of Division Algorithm: Hashing functions. 
    A "hash table" is a data structure
    where you can store data and search for it (usually) very quickly  (CS61B)
    EX: You want to store records (student ID#, name, grades) in a data base, 
        and look up records quickly given the student ID#, an 8-digit integer. 
        We could have a table of length 10^8,
          array Student[100000000]
        where Student(i) contains the record with the name, grades of student i
        and just look at entry i to find data for student i.
        But this is too large a table, since it is too large to fit in memory, 
        and since there are many fewer students than 10^8. Instead we use a 
        smaller table (say size 10^5 for Berkeley, enough to hold all students, 
        and a little more) and do the following:

          array Student[100000]
          a = f(i)    ... compute address a in table of record of student i
          record = Student[a]

        Hash function f(i) needs to map 8 digit integers to 5 digit integers,
        to look up in table of length 10^5. It should spread the data
        out across the table evenly, to use the whole table.
   
        Simple function to use: f(i) = i mod 10^5 
ASK%WAIT: If you know i as a decimal number, what is i mod 10^5?
        More generally, for a table of length m, we would use f(i) = i mod m
        EX: data for student i = 87654321 stored at address a=f(i)=54321
ASK&WAIT:   what happens if two students i and j have same a=f(i)=f(j)?

 Application: Random number generation, or how "rand" function works in C,C++

    Random number generation means producing a sequence of integers 
    x(1), x(2), ... all in the range [0,N-1], where each x(i) is chosen 
    ``at random''. For example, if N=6, we could roll a die to get each x(i): 
    each value from 0 to N-1 is equally likely, and each x(i) is 
    "independent" or unrelated to all other x(j). We want an algorithm
    to produce such a sequence efficiently.

    Uses: 1) game programs (so game different each time)
          2) many fast algorithms (quicksort)
          3) programs to simulate real world events which occur at random:
               simulate data traffic in new network design
               simulate elevator traffic in new building design
               simulate bits of fluid in a turbulent air flow between
                  a disk head and a disk surface in a new disk design or
                  over an airplane wing in a new airplane design
          4) related idea used for hash functions
   
    We will use a simple algorithm (based on division algorithm) 
    to generate x(i+1) from x(i), so x(i) is not random in sense of 
    rolling dice (since it is easy to predict x(i+1) from x(i), if you
    know the formula used in the algorithm, but it will "look random", 
    and be good enough for purposes described above. 
ASK&WAIT:  Why not use a really random function to generate x(i), 
           e.g. counting ticks on a Geiger counter, or looking at
           certain stock exchange data (eg 3rd lowest digit of volume,
           used in "numbers racket" - illegal gambling)
    The formula is x(n+1) = a*x(n) mod n, 
    This is called a linear congruential method, because the formula
    involves a linear function a*x(n) and a congruence (or mod)
    EX: x(n+1) = 3*x(n) mod 7 yields 1 3 2 6 4 5 1 3 2 6 4 5 ...
    Thm: Any linear congruential random number generator generates
         a periodic sequence, i.e. it eventually repeats the
         same sequence over and over
ASK&WAIT: Why? How long can the random sequence be before it repeats?
    EX: x(n+1) = 4*x(n) mod 7 yields 1 4 2 1 4 2 ...
        So choice of a, n important 
    EX: Bad choice: n=1000, a=541, x(1)=347; 
             only 49 different values of x(i), so period only 49,
             not very random looking
        Better choice: n=997, a=541, x(1)=347; 
             get all 997 possible different numbers before it repeats.
             hard for a human to see a pattern, still easy for a computer
        Good choice: n=2^15-1, a=7^5, x(1)=347; 
             much better, period 2^15-1, looks really random
    For how to pick a,n well see Knuth, "Art of Computer Programming", vol 2


 Application of Division Algorithm:
    Computing the gcd(a,b) using the Euclidean algorithm
    (much cheaper than factoring a and b into prime factors)

         % assume a and b nonnegative, at least one nonzero
         x = a  
         y = b  
         while y != 0
           r = x mod y
           x = y
           y = r
         end while
         return(x)

     EX: gcd(14,10)
           x = a = 14, y = b = 10
           Loop 1: r=14-1*10=4;   Loop 2: r=10-2*4=2;   Loop 3: r=4-2*2=0;
                   x=10;                  x=4;                  x=2;
                   y=4;                   y=2;                  y=0;  return(2)

       Proof of correctness of algorithm:
        two things to prove: 1) that it terminates in a finite 
                                number of steps and 
                             2) that it returns the right answer
         Proof of 1):
            After each pass through the while-loop, x and y get replaced 
              by new values.
            In particular, y gets replaced by r=x mod y
            By the definition of mod, 0 <= r < y, so the new value of y is
              strictly less than the old value of y, and at least 0
            Since y keeps decreasing, and is >= 0, it must eventually hit 0,
              at which point the while loop stops
ASK&WAIT:  What is an upper bound on the number of time we go around the loop?

         Proof of 2):
           We will do this in three steps:
           1) we will show that gcd(x,y) at the end of the loop is the same 
              as gcd(x,y) at the end of the loop, after x and y are updated
           2) when we finally exit the loop, (x,y) = (x,0), and so
              gcd(x,y) = gcd(x,0) = x, which is what we return
           3) Therefore gcd(a,b)  = gcd(x,y) before start of loop
                                  = gcd(x,y) after one pass through loop
                                  = gcd(x,y) after two passes through loop
                                  = ...
                                  = gcd(x,y) after last pass through loop
                                  = gcd(x,0) since y=0 when loop terminates
                                  = x, which is what we return

          To finish proof, need to prove step 1):
          Lemma: gcd(x,y) = gcd(y, x mod y)
          Proof: Let r = x mod y, so x = q*y+r, 0 <= r < y.  We will show that 
                 d|x and d|y if and only if d|y and d|r. Thus x and y
                 have the same set of common divisors as y and r.
                 In particular they must have the same greatest common divisor.
                 First suppose d|x and d|y, then we have to show d|y and d|r.
                    d|y is easy, and since r=x-q*y, d|r too.
                 Second suppose d|y and d|r, then we have to show d|x and d|y.
                    d|y is easy, and since x = q*y+r, d|x too.

   Note: Since gcd(x,y) stays the same after each pass through the loop, we
         call gcd(x,y) a "loop invariant". Finding a loop invariant is a
         common proof technique for proving programs compute the right answer.

  EX: Find integers s and t so that s*10 + t*14 = gcd(10,14) = 2
      More generally, we can always find s and t so that s*a + t*b = gcd(a,b)

ASK&WAIT: Guess s and t 

      More systematically, work forwards through Euclidean algorithm,
         finding integers ax and bx so x = ax*a + bx*b
         and     integers ay and by so y = ay*a + by*b
         at the end of each loop iteration

      x = a = 14; y = b = 10;
      Loop 1: r=14-1*10=4;     Loop 2: r=10-2*4=2;   Loop 3: r=4-2*2=0;
              x=10;                    x=4;                  x=2;
              y=4;                     y=2;                  y=0;  return(2)
      Start of Loop 1: x = a = 1*a + 0*b  y = b = 0*a + 1*b
                             = ax*a+bx*b        =ay*a +by*b
      End of Loop 1:   x = y = 0*a + 1*b  y = r = x-1*y = (1*a+0*b)-1*(0*a+1*b)
                                                        = 1*a -1*b
                             = ax*a+bx*b                =ay*a +by*b
      Start of Loop 2: x and y are same as at end of Loop 1
      End of Loop 2:   x = 1*a -1*b       y = r = x-2*y = (0*a+1*b)-2*(1*a-1*b)
                                                        = -2*a+3*b
                         = ax*a+bx*b                    = ay*a +by*b
      Start of Loop 3: x and y are same as at end of Loop 3
      End of Loop 3:   x = -2*a+3*b       y = 0
                         = ax*a+bx*b                     
      Finally, s=ax=-2 and t=bx=3 satisfy gcd(a,b)=s*a+t*b as desired.

ASK&WAIT:  If x = ax*a + bx*b and y = ay*a + by*b at the beginning of the
           loop body, what are they at the end?
           Notation: use x,y,ax,bx,ay,by to mean values at start of loop,
                         x',y',ax',bx',ay',by' to mean values at end of loop,

 Fact: cost of algorithm is O(log min(x,y)) (proof later)
       much less than factoring!
 Will use gcd later again.