一定愛配于天立教授開放課程食用:課程連結

Data Storage(II)

Fraction in binary

8 bits representation
- Ex (Use the table of binary): 10010101 => 1(001)(0101) => ()(+)=-
On the widely used 64-bit computers，the exponent takes 11 bits，and the mantissa takes 52 bits

The precision is beyond the limitation of mantissa.
- Ex (Use the table of binary): 2 -> 10.101(base two，fixed point) => .10101 => 0(010)(1010) => 2 ## Normalized form
- The first bit of mantissa is 1
- 0's floating-point representation is all 0
Normalization:
- Ex (Use the table of binary): 00100011 => 0(010)(0011) => .0011 => .1100 => 0(000)(1100)
IEEE normalized form
- The left-most bit in mantissa is always 1 (Ex: .0101 -> 1.0101)
- Standard normalized form is (s)(eee)(mmmm) => () 1.mmmm
  - Ex(Use the table of binary): 01100011 => (0)(110)(0011) => 1.0011

Ex (Use the table of excess):

4 + + = (01111000 + 00111000) + 00111000 -> make the exponent the same (01111000 + 01110000) + 01110000 = 4 (the result is wrong)
4 + + = 01111000 + (00111000 + 00111000) = 4

Images:
1. GIF: 256 colors, dictionary encoding
2. JPEG: Lossy / lossless encoding
Audios:
1. MP3: Lossy encoding
Videos:
1. MPEG: Lossy encoding

The reason of compressing data is to remove redundency
To correct the communication error => we add redundancy
Error detection => instead of correcting errors, it can only check if the errors occurs
- Applications(detection):
  1. ID numbers
  2. ISBN
  3. Parity code
Error correcting => Can correct errors to some degree

Taiwan ID:
1. Convert the first English letter into a number(xy):
2. = x + 9y
3. = i = 1 + 2 ... +8
4. Check code => = 10 - (( + ) mod 10)
ISBN-10 ISBN　template: 0-273-75139
1. Compute S = 0 10 + 2 9 +7 8 + 3 7 + 7 6 + 5 5 + 1 4 + 3 3 + 9 2 = 193
2. M = S mod 11 = 6
3. N = 11 - M =5
  - If N = 10 the check code is X
  - If N = 11 the check code is 0
  - Otherwise, the check code is the number N
4. The ISBN code 0-273-75139-5
  - Parity Bits
    1. Making the quantity of 1s odd
    2. The technique is used in communication and RAID
An error-correcting code(ECC)
1. (3, 1) repetition code (can correct 1-bit errors) -> seperate the data into groups with 1 bit in each group, and add two bits to authenticate the data
Hamming distance
1. Comparing two binary data， hamming distance is the amount of different bits.
Error correction with hamming distance
1. Maximizing hamming distance along the symbols
  - Sample code book:
  Ex: received 010100 * Hammig distance:
  According to the chart, we will correct 010100 to 011100(D)
Generating (7, 4)Hamming code