CS2107 Introduction to Information Security

CS2107 Introduction to Information Security

Topic 0 Overview

Topic 1 Encryption

Attack Model

Classical Ciphers

Substitution Cipher

Permutation Cipher

One Time Pad

Modern Ciphers

Symmetric vs Asymmetric Key Cryptography

Block Cipher

Stream Cipher

Examples of Attacks

Triple DES & Meet-in-the-middle

Padding Oracle Attack

Cryptography Pitfalls

Topic 2 Authentication Credential

Topic 3 Autheticity (Data Origin)

Public Key Cryptography

Classroom RSA

Cryptographic Hash

Keyed Hash

Application of Unkeyed Hash

Application of Keyed Hash

Example of Attacks

Birthday Attack

Pitfalls

Topic 4 Part 1 Public Key infrastructure

Topic 4 Part 2 Channel Security

Topic 5 Network Security

Topic 6 Web Security

Topic 7 & 8 Secure Programming

Topic 9 Access Control

Access Control Model

Specifying Access Rights

Intermediate Control

Tutorial Terminologies

Topic 0 Overview

C-I-A triad

Confidentiality	Integrity	Availability
Anonymity, Privacy, Covert Channel	Non-Repudiation, Authenticity	People who need access to data can get it.

Other

Accountability
Traitor-Tracing
Plausible deniability

Topic 1 Encryption

Attack Model

Attackere's Goal	Description
Total Break	The attacker wants to find the key.
Partial Break	The attacker may want to decrypt a ciphertext but not interested in knowing the secret key, or the attacker may simply want to extract some information about the plaintext.
Distinguishability	With some non-negligible probability more than $\frac{1}{2}$ , the attacker can correctly distinguish the ciphertexts of a given plaintext from the ciphertext of another given plaintext.

If attacker can ahieve total break, the attacker also can achieve partial break and disringuishability.
We want to design a secure system that can prevent attacker from achieving the weakest goal (distinguishability).

Attacker's Capability	Description
Ciphertext only attack	The attacker is given a collection of ciphertext. The attacker may know some properties of the plaintext. (The attacker can't choose the plaintext.)
Known plaintext attack	The attacker is given a collection of plaintext and their corresponding ciphertext. (The attacker can't choose the plaintext.)
Chosen plaintext attack (CPA)	The attacker has access to an oracle. The attacker can choose and feed any plaintext to the oracle and obtain the corresponding ciphertext (all encrypted with the same key). The attacker can access the oracle many times, as long as within the attacker's compute power. He can see the ciphertext and then choose the next input. We call this black-box an encryption oracle.
Chosen ciphertext attack (CCA2)	Same as chosen plaintext attack, but here, the attacker chooses the ciphertext and the black-box outputs the plaintext. We call the black-box a decryption oracle.

There are practical scenarios where the attacker has access to a weaker form of decryption oracle. Example: Padding Oracle
If a cipher can defend against decryption oracle, then the cipher can defend against all other weaker forms.

Classical Ciphers

Substitution Cipher

Encryption

Plaintext	Key	Ciphertext
$X = x_{1} x_{2} x_{3} \dots x_{n}$	$S$ (a substitution table, 1-1 onto function)	$E_{s} (X) = S (x_{1}) S (x_{2}) S (x_{3}) \dots S (x_{n})$

Decryption:

Ciphertext	Key	Plaintext
$C = c_{1} c_{2} c_{3} \dots c_{n}$	$S$	$D_{s} (C) = S^{- 1} (c_{1}) S^{- 1} (c_{2}) S^{- 1} (c_{3}) \dots S^{- 1} (c_{n})$

Attack:

Exhaustive Search
- Consider a substitution cipher with table size 27.
- Then the size of key space is
  $27!$ , which needs approximately
  $2^{94}$ loops.
- Infeasible using current compute power.
Ciphertext only attack (💥)
- Given the ciphertext and the attacker knows the plaintext is in English letter, the attacker can use exhaustive search.
- But there are more efficient method: frequency analysis attack
- The frequency of letters used in English is not uniform.
- Given a sufficiently long ciphertext, the attacker may guess the plaintext by mapping frequent characters in the ciphertext to the frequent character in English.
Known plaintext attack (💥)
- Given a plaintext and the corresponding ciphertext, the attacker can figure out the replacement of the characters.
- With sufficiently long ciphertext, the full table can be found.
- Example: X = hello_world, C = hnllqpoqilb, then we know
a b c d e f g h i j k l m n o p q r s t u v w x y z _

b n h l q i o p

Remark

Performing substitution twice using two tables does not increase difficulty of attack. It simply reduces to one table.

Permutation Cipher

Encryption

Description: The plaintext is grouped into blocks of

t

characters, then apply a secret permutation to each block by shuffling the characters.

Key: The key is the permutation

p

and the block size

t

. We can write

p = (p_{1}, p_{2}, p_{3}, \dots, p_{t})

which indicate the character at position

i

is shifted to position

p_{i}

Decryption

Attack

Known plaintext attack (💥)
- Given a plaintext and the corresponding ciphertext, the attacker may figure out the key.
- Example: plaintext = aabbbbababaa, ciphertext = babaabbbabaa
- Image Not Showing Possible Reasons
  The image was uploaded to a note which you don't have access to
  The note which the image was originally uploaded to has been deleted
  Learn More →
- So, the key will be
  $t = 4$ ,
  $p = (4, 2, 1, 3)$
Ciphertext only attack (💥)
- similar to Substitution Cipher

Remark

Performing permutation twice using two permutations does not increase difficulty of attack. It simply reduces to one permutation.
By using both Substitution and Permutation Ciphers, the attacks become more difficult.

One Time Pad

Encryption

Plaintext	Key	Ciphertext
$x_{1}, x_{2}, \dots, x_{n}$	$k_{1}, k_{2}, \dots, k_{n}$	$(x_{1} \oplus k_{1}), (x_{2} \oplus k_{2}), \dots, (x_{n} \oplus k_{n})$

Decryption

Ciphertext	Key	Plaintext
$c_{1}, c_{2}, \dots, c_{n}$	$k_{1}, k_{2}, \dots, k_{n}$	$(c_{1} \oplus k_{1}), (c_{2} \oplus k_{2}), \dots, (c_{n} \oplus k_{n})$

Remark

The key cannot be re-used. That is, a key can only be used once. (For S and P, a same key is being used to encrypt multiple plaintexts.)
So, a 1 GB plaintext requires a 1 GB key to encrypt.
The long key renders one-time-pad impractical.
Attacker can derive the key if given the ciphertext and corresponding plaintext, but such key is useless, since it will not be used any more.
It is shown that one-time-pad leaks no information of the plaintext. (CS4236)
It is unbreakable.

Modern Ciphers

Designs of modern ciphers take into considerations of known-plaintext-attack, frequency analysis and other known attacks (except CC2).

We can quantify the security of an encryption scheme by the length of the key.

Example:

DES (Data Encryption Standard, 1977)
- The key length is 56 bits. Hence, the exhaustive search needs to loop for
  $2^{56}$ times in the worst case.
- Too short.
RC4 (Rivest's Cipher 4, 1987)
- Broken in some adoptions
AES (Advanced Encryption Standard, 2001)
- The block length is 128 bits. The key length can be 128, 192 or 256 bits.
- Currently, no known attacks on AES.

Symmetric vs Asymmetric Key Cryptography

Symmetric key cryptography
- uses the same key for both encryption and decryption
- used often in bulk encryption
- need a secure channel to establish the secret key for any two entities
- implemented as either block ciphers or stream ciphers
- Example: DES, RC4, AES
Asymmetric key cryptography
- a.k.a Public key cryptography (PKC)
- uses two different key for encryption and decryption
- used often in data authentication
- only need a secure broadcast channel to distribute the public key
- Example: RSA, ElGamal, Paillier, Post-Quantum cryptography
- More details: Public Key Cryptography

Block Cipher

A block cipher has fixed size input and output.
DES and AES are block cipher.
AES is designed for 128 bits (16 bytes) input and output.
For large plaintext, it will be divided into bloacks, then apply block cipher.
The method of extending encryption from a single block to multiple blocks is called mode-of-operation.
Mode of operation in this note: ECB, CBC, CTR, GCM

ECB (Electronic Code Book)

Plaintext is divided into blocks.
Block cipher is applied to each block, using same key.
But ECB leaks information.
Since blocks are encrypted using the same key, same blocks will be encrypted into same ciphertext.

CBC (Cipher Block Chaining)

Plaintext is divided into blocks.
The first block will XOR with the IV.
Next block will XOR with the previous encrypted block.
Encryption:
Initialization Vector (IV) is an arbitrary value chosen during encryption, and is different in different encryption.
IV will be included/sent with the ciphertext.
$y_{0} = IV$ ,
$c_{1} = E_{k} (x_{1} \oplus y_{0})$ ,
$c_{i} = E_{k} (x_{i} \oplus c_{i - 1})$ ,
$i > 1$
CBC also leaks information, similar reason with ECB.
Decryption:

CTR (Counter Mode)

Similar to CBC.
But, now IV is incrementing.
Encryption:

GCM

Details omitted in this course.
An "Authenticated-Encryption".
Ciphertext consists of an extra tag for authentication.
It is secure in the presence of decryption oracle.

Stream Cipher

Encryption:
1. Given plaintext,
  $m$ and secret key, randomly generate IV.
2. From secret key and IV, generate a long pseudorandom sequence,
  $K$ .
3. Ciphertext will made up of IV and
  $(K \oplus m)$ .
Decryption:
1. Given the key and the ciphertext with the IV.
2. Extracts the IV from the ciphertext.
3. From the key and IV, generates the long sequence.
4. Performs xor to get the plaintext.

Examples of Attacks

Triple DES & Meet-in-the-middle

2DES

If we want to improve DES, we can encrypt the plaintext twice or more, using different keys.
Let's consider this double encryption under known plaintext attack. Then the attacker has a plaintext and the corresponding ciphertext, and wants to find the two secret keys.
Using exhaustive search,
$2^{56 + 56}$ loops required, so the key-strength seems to increased to 112.
But it can be less with meet-in-the-middle attack.

Meet-in-the-middle Attack

Known plaintext attack.
Assume attacker has a pair of
$(m, c)$ plaintext and the corresponding ciphertext. Note:
$c = {DES}_{k 1} ({DES}_{k 2} (m))$
Process:
- Take the plaintext
  $m$ , encrypt it using all possible keys.
- Let
  $C$ be the set of all possible ciphertexts.
- Take the corresponding ciphertext
  $c$ , decrypt it using all possible keys.
- Let
  $M$ be the set of all possible plaintext.
- Find common element in
  $C$ and
  $M$ .
- The key used to encrpyt
  $m$ that results in the common element
  $X$ , will be
  $k_{1}$ .
- The key used to decrypt
  $c$ that results in the common element
  $X$ , will be
  $k_{2}$ .
For 2DES, the number of "guesses" are reduced to
$2^{56} + 2^{56} = 2^{56 + 1}$ .
In general, for
$k$ -bit keys, it reduces the number of crypto operations to
$2^{k + 1}$ using approx
$2^{k + 1}$ units of storage space.

3DES

a.k.a 3DES, TDES, TDEA, 2TDES, 3TDES
Since DES and 2DES are not secure enough, we use 3DES.
There are 2 versions:
- a)
  $E_{k_{1}} (E_{k_{2}} (E_{k_{1}} (p)))$
- b)
  $E_{k_{1}} (D_{k_{2}} (E_{k_{1}} (p)))$
Both versions are believed to have same level of security.
Version (b) can be more convenient.

Padding Oracle Attack

The block size of AES is 128 bits (16 bytes).
Suppose the length og the plaintext is 25 bytes, it will be fitted into two blocks.
The remaining 7 bytes will be padded with some values.
There are many ways to pad, but the number of padded bits must be encoded together, so the receiver will know the length of the plaintext.

PKCS#7

PKCS#7 is a padding standard.
Suppose the block size is 8 bytes, and the last block has 5 bytes, which mean 3 extra bytes needed.

If the last block has 8 bytes, an extra bloack is added where each byte is 08.

Padding Oracle Attack

Attacker has: ciphertext
$(iv, c)$ , where
$c$ is encrypted using key
$k$ .
Attacker's goal: obtain the plaintext.
A padding oracle will accept any cipher text, and output whether the plaintext is in correct padding format.

Assumption:
- Attacker has access to the padding oracle.
- Attacker knows how many bytes is padded.
AES CBC mode is not secure against padding oracle attack.
Process:
- Assume the block size is 8 bytes,
  $c$ is 1 block.
- Attacker wants to find out
  $x_{5}$ .
1. Calculate
  $iv'$ from
  $iv$ by xor with the last 4 bytes (see image), where
  $t$ is some value.
  - Why choose
    $07$ for last 3 bits:
    - Originally,
      $(v_{8} \oplus d_{k} (c_{8})) = 03$
    - Since we want
      $x_{5}$ and we know last 3 bits are
      $03$
    - We want to somehow make it x1 x2 x3 x4 04 04 04 04
    - We notice
      $(07 \oplus v_{8}) \oplus d_{k} (c_{8}) = 07 \oplus (v_{8} \oplus d_{k} (c_{8})) = 07 \oplus 03 = 04$
    - So choose
      $07$
2. Feeds the padding oracle with
  $(iv', c)$ .
3. If output YES, then
  $(t \oplus x_{5})$ must be
  $04$ , and we got the correct
  $t$ .
  - $(t \oplus v_{5}) \oplus d_{k} (c_{5}) = t \oplus (v_{5} \oplus d_{k} (c_{5})) = t \oplus x_{5}$
  - Since we choose
    $07$ ,
    $(t \oplus x_{5})$ must be
    $04$
  - Hence,
    $x_{5} = t \oplus 04$
4. If output NO, repeat steps 1-4 with different
  $t$ .

Remark

We can easily extend the algorithm to find the full plaintext.
It is easy to determine the initial padding length. (see tutorial question)
CTR mode also vulnerable to padding oracle.
Schemes that are secure against decryption oracle are also known as IND-CCA2 secure.
GCM, or other “authenticated encryption”, is believed to be IND-CCA2 secure, and thus secure against padding oracle attack.

Cryptography Pitfalls

Wrong choices of IV
Reusing one-time-pad
Predictable secret generation
Designing your own cipher
Reliance on Obscurity: Kerckhoffs’s Principle

A system should be secure even if everything about the system, except the secret key, is public knowledge.

Topic 2 Authentication Credential

Topic 3 Autheticity (Data Origin)

Public Key Cryptography

Public key scheme (PKS) uses different keys for encryption and decryption.
There is no need to clearly indicate which key is for encryption, which key is for decryption.
Both keys can be used to perform either encryption or decryption, but not both.
Public key is shared with everyone, private key is kept secret by user.
Public key can be used for encryption or decryption, depends on the context. (same goes for private key)
Without PKS, every individual would need to keep a symmetric secret key pair with everyone else.
- Keys needed for symmetric-key setting:
  $\frac{n (n - 1)}{2} = O (n^{2})$
- Keys needed for asymmetric-key setting:
  $2 n = O (n)$
Example: RSA, ElGamal, Paillier

Classroom RSA

We call it “classroom” because the variant used in practice is different, with padding and special considerations in choosing the primes, etc. Also called “textbook” RSA.

Setup:

Owner randomly chooses 2 large primes
$p$ ,
$q$ and computes
$n = p q$
Owner randomly chooses an encryption exponent
$e$ s.t.
$gcd (e, (p - 1) (q - 1)) = 1$
Owner finds the decryption exponent
$d$ where
$d \times e mod (p - 1) (q - 1) = 1$
Owner publishes
$(n, e)$ as public key, and safe-keeps
$d$ as the private key

Encryption:

Public key:
$(n, e)$
Given message
$m$ , ciphertext
$c = m^{e} mod n$

Decryption:

Private key:
$d$ , Decryption key:
$(n, d)$
Given ciphertext
$c$ , message
$m = c^{d} mod n$

Remarks:

Note that for any positive
$m < n$ , and any pair of public/private keys,
$Decrypt(Encrypt(m)) = m$
$(m^{e})^{d} mod n = m$
Can we compute
$p, q, d, e$ and
$modulo$ efficiently?
- To find random prime
  $p, q$ :
  - Randomly pick a number
    $p$
  - If
    $p$ is a prime, output
    $p$ and halts. Otherwise, repeat step 1-2.
  - The probability that the randomly chosen
    $p$ is prime is non-negligible. This is by the well-known "Prime Number Theorem". Probability of a randomly chosen 1024-bit number being a prime is around ln(21024)~ (1/710). So likely takes about 710 trials to get a 1024-bit prime number. Fast but too slow for real-time application.
  - There is fast algorithm to determine whether a number is a prime.
- Choose a small
  $e$ so that encryption can be done very fast. It turns out that the value of
  $e$ won’t affect the security. But it can’t be too small due to some attack. It is common to set
  $e = 65537$ .
- The value of
  $d$ can be efficiently computed from
  $e$ and
  $n$ using the extended Euclidean algorithm.
- Using Chinese Reminder Theorem, one can speedup decryption about 2x faster. Public key doesn’t need to be large (but not too small). To achieve speedup in encryption, some scheme choose a fixed small public key such as
  $65537$ , which is
  $2^{16} - 1$
We can use the decryption key
$d$ to encrypt, and encryption key
$e$ to decrypt. In other words, we can swap the role of
$d$ and
$e$ , so that anyone in the public can decypt, but only the owner can encrypt. This property is useful in desgining signature scheme.

Issues:

Security:
- Same as symmetric-key encryption, some forms of IV is required so that encryption of the same plaintext at different time would give different ciphertext.
- Classroom RSA has an interesting “homomorphic” property. These properties are useful in applications, but they also lead to attacks. Such properties can be destroyed using padding.
- The standard Public-Key Cryptography Standards (PKCS)#1, add “optimal padding”. https://en.wikipedia.org/wiki/PKCS_1
- RSA and "discrete log based" PKC will be broken with Quantum computer.
Performance:
- RSA is signigicantly slower than AES.
- A 128-bit AES has the same key strength as a 3072-bit RSA (NIST recommendation).
- If we want to encrypt a large file
  $F$ , it would be very slow to directly use RSA on
  $F$ . Instead, it is typically carried out in this way:
  1. Chooses a random AES 128-bit key
    $k$
  2. Encrypts
    $k$ using PKC to get
    $y$
  3. Encrypts
    $F$ using AES with
    $k$ as the key to get
    $C$ .
  4. The final ciphertext will be
    $(y, C)$

Cryptographic Hash

A hash is a function that takes in an arbitrarily long message as input and outputs a fixed-size digest.

Security Requirements:

Preimage resistant
- Given a digest
  $d$ , it is difficult to find a
  $m$ s.t.
  $h (m) = d$
Second-preimage resistant
- Given
  $m_{1}$ , it is difficult to find a second preimage
  $m_{1} \neq m_{2}$ s.t.
  $h (m_{1}) = h (m_{2})$
Collision-resistant
- It is difficult to find two different messages
  $m_{1} \neq m_{2}$ that "hashes" into the same digest, i.e.
  $h (m_{1}) = h (m_{2})$

Popular Hash:

Broken: SHA-0, SHA-1, MD5
Partially broken: SHA-2
No known attack: SHA-3

Keyed Hash

A keyed-hash is a function that takes an arbitrarily long message and a secret key as input, and outputs a fixed-size MAC.

Security Requirements:

Without knowing the key, it is difficult to forge the MAC

Popular Keyed-Hash (MAC):

CBC-MAC
HMAC

CBC-MAC

Based on AES operated under CBC mode.

HMAC

Baed on SHA.

|| means concatenation

Application of Unkeyed Hash

How do we know if a file we downloaded is authentic?

Unkeyed Hash for Integrity Protection

Let
$F$ be the original file.
We obtain the digest
$h (F)$ from the secure channel.
We then obtain the file
$F^{'}$ , whose origin claims that the file is
$F$ .
We can compute and compare the digests
$h (F)$ and
$h (F^{'})$ :
- If
  $h (F) = h (F^{'})$ , then
  $F = F^{'}$ (with high confidence)
- If
  $h (F) \neq h (F^{'})$ , then the file integrity is compromised

Assumptions:

There is a secure channel to send a short piece of information (digest).

Application of Keyed Hash

In the previous example of using unkeyed hash, we assume that there is a secure channel to send the digest. What if we don't have a secure channel to deliver the digest?

In such scenarios, we can protect the digest with the help of some secrets.

In the symmetric key setting, it is called the mac.
In the public key setting, it is called the digital signature.

Note that there is no issue on confidentiality. In fact, the data F can be sent in clear.
Typically, the mac is appended to F. Hence, mac is also called the authentication tag, or authentication code.

Signature

Asymmetric-key version of MAC.

Here, the owner uses the private key to generate the signature. The public can use the public key to verify the signature. So, anyone can verify the authenticity of the data, but only the person who know the private key can generate the signature.

Procedure:

The original file
$F$ is first hashed using a hash function.
The hash value is then passed through the signature function together with the private/signature key
$k_{pri}$ to produce signature
$s$ .
$s$ is then appended to
$F$ and sent to the recipient via an insecure channel.
The receipient then passes the received signature
$s^{'}$ and the public/verification key
$k_{pub}$ into the verification function.
The verification function will produce a hash value.
The file
$F^{'}$ is then pass through a hash function to return a hash value.
The hash values are compared to see if the file has been modified, or if the digital signature is valid.

Notice that in this setting, the private key is used to encrypt, while the public key is used to decrypt.

Signature scheme achieves Non-repudiation.

Why hash the file before signing?

For greater efficiency.
Since hash value is a unique representation of the data, it is sufficient to sign the hash instead of original file.

Popular Signature scheme:

RSA based: RSASSA-PSS, RSASSA-PKCS1
Discrete log based: Digital Signature Algorithm (DSA)

Example of Attacks

Birthday Attack

From Wikipedia:

The birthday paradox is a veridical paradox: it seems wrong at first glance but is, in fact, true. While it may seem surprising that only
$23$ individuals are required to reach a
$50 %$ probability of a shared birthday, this result is made more intuitive by considering that the birthday comparisons will be made between every possible pair of individuals. With
$23$ individuals, there are ⁠
$\frac{23 \times 22}{2} ⁠ = 253$ pairs to consider, far more than half the number of days in a year.

This can be applied to the hash function. Suppose we have

M

messages, and each message is tagged with a value randomly chosen from

1, 2, 3, \dots, T

Then the probability that there is a pair of messages tagged with the same value is approximately:

Prob(collision) = 1 - e^{(- \frac{M^{2}}{2 T})}

In particular, when

M > 1.17 \sqrt{T}

, then

Prob(collision) > 1 - e^{(\frac{1.17}{2})} > 0.5

Suppose the hash gives 128-bit digest. Then

T = 2^{128}

. By choosing

M = 1.17 (2^{64})

, with probability more than 0.5, birthday attack succeed.

For the birthday attack variant, we are looking at calculating the probability of a random set of elements sharing at least one element with another set of distinct elements, with both sets being subsets of an even larger set. Below is an useful theorem:

Let
$S$ be a set of
$k$ distinct elements where each element is an
$n$ bits binary string. Let us independently and randomly select
$m$
$n$ -bit binary strings and put them in the set
$T$ . The probability that
$S$ has non-empty intersection with
$T$ is more than
$1 - {2.7}^{- k m 2^{- n}}$

Straightforward method to find collision

Birthday attack

Suppose H() is a hash with k-bit digest.

Find collision:

Randomly pick two messages m₁, m₂.
If H(m1) = H(m2), then output m₁, m₂ and halt.
Repeat step 1 to 3.

Suppose H() is a hash with k-bit digest.

Constructs a set S of M = ⎡1.17* 2^k/2⎤ unique randomly chosen messages.
Compute the digest of each message m in S.
Check whether there are two messages in S having the same digest. If so, output m₁, m₂. Otherwise, output “Fail”.

The expected number of rounds taken by the above algorithm is 2^k and thus the expected number of hashes is more than 2^k. If k = 128, as analyzed in tutorial one, this is computationally infeasible.

When k = 128, one round would take ~2⁶⁴ hashes, significantly lower than before. And this has >50% chance will succeeds.

Pitfalls

Using encryption for the purpose of authentication

Encryption is designed to provide confidentiality. It does not guarantee integrity and authenticity.

For example, a company that communicates instructions via SMS does not provide integrity and authenticity if it simply encrypts its message. A sucure design could use MAC instead of encryption.

Simply adding MAC to the instructions is not sufficiently secure. It is still vulnerable to "replay attack". For example, an adversary can intercept one of the messages, and simply repeatedly send the message to receiver, which result in some command or instruction being repeatedly executed. To prevent replay attack, a cryptographic "noice" is required, which is a random or preudo-random number issued that prevents old communication from being reused.

Topic 4 Part 1 Public Key infrastructure

Topic 4 Part 2 Channel Security

Topic 5 Network Security

Topic 6 Web Security

Topic 7 & 8 Secure Programming

Topic 9 Access Control

Access Control Model

Access control setup security perimeter/boundary. Design of the boudary is guided by:
- Principle of least privilege
  - access rights that are not required to complete the role will not be assigned
- Compartmentalization
  - limits the access to information to persons or other entities on a need-to-know basis to perform certain tasks
- Defense in depth / Swiss Cheese Model
- Segregation of duties
  - breaks down processes so that no single person is responsible for every stage in a process

Separation between policy and mechanism:

A principal (or subject) wants to access an object with some operation. The reference monitor either grants or denies the access.
Principals: humas users
Subjects: entity in the system the operate on behalf of the principals

Types of accesses:
- Observe
  - reading a file
- Alter:
  - writing a file
  - deleting a file
  - changing properties
- Action:
  - executing a program
Access rights:
- Discretionary access control
  - owner of the object decides the rights
- Mandatory access control
  - system-wide policy decides the rights
  - strict rules that everyone must follow

Specifying Access Rights

r: read, w: write, x: execute, s: execute as owner, o: owner

Access Control Matrix

Example:

	my.c	mysh.sh	sudo	a.txt
root	{r,w}	{r,x}	{r,s,o}	{r,w}
Alice	{r,w}	{r,x,o}	{r,s}	{r,w,o}
Bob	{r,w,o}	{}	{r,s}	{}

Pros: can specify the access right for all pairs of principals and objects
Cons: the table would be very large, and thus different to manage

Access Control List

The access control list stores the access rights to an object as a list.

Example:

my.c	-> (root, {r,w})	-> (Alice, {r,w})	-> (Bob, {r,w,o})
mysh.sh	-> (root, {r,x})	-> (Alice, {r,x,o})
sudo	-> (root, {r,s,o})	-> (Alice, {r,s})	-> (Bob, {r,s})
a.txt	-> (root, {r,w})	-> (Alice, {r,w,o})

Pros: easy to find out all subjects that can access an object
Cons: difficult to find out all objects that a subject can access

Capabilities

A subject is given a list of capabilities, where each capability is the access rights to an object.

Example:

root	-> (my.c, {r,w})	-> (mysh.sh, {r,x})	-> (sudo, {r,s,o})	-> (a.txt, {r,w})
Alice	-> (my.c, {r,w})	-> (mysh.sh, {r,x,o})	-> (sudo, {r,s})	-> (a.txt, {r,w,o})
Bob	-> (my.c, {r,w,o})	-> (sudo, {r,s})

Pros: easy to find out all objects that a subject can access
Cons: difficult to find out all subjects that can assess an object

Notes

UNIX file system employs ACL
The size of lists for ACL and Capabilities are still too large
Another approach is to group the subjects and objects and define access rights on the defined groups. So, we need intermediate control

Intermediate Control

It is not practical for an owner to specific each single entries in the access control matrix. So, we need some ways to simplify the representation. One method is to "group" the subjects/objects and define the access rights on the group. This is called intermediate control.

Recalls that UNIX employs ACL, but instead of specifying all subjects, UNIX's ACL only specifies the rights for owner, group and world/others.

Role-based access control

The grouping can be determined by the "role" of the subject.

A role associates with a collection of procedures. In order to carry out these procedures, access rights to certain objects are required.

To design the access right of a role, we should follow the least privilege principle.

Privileges

The term privilege is sometime used to describe access right. Privilege can also be viewed as an intermediate control. It can be represented by a number, e.g. 1,2,3. if a subject can access an object, another subject with higher privilege can also access the object.

Protection rings

In OS, "privilege" is often called protection rings. They are the same but with different name.

Each object (data) and subject (process) is assigned a number. Whether a subject can access an object is determined by their respective assigned number.

Object with smaller number are more important. If a process is assigned a number

i

, we say that the process runs in ring

i

. We call processes with lower ring number as having "higher privilege".

Unix has only 2 rings, superuser and user.

Two well-known models:

Bell-LaPadula: Confidentiality

no read up
- A subject does not have read access to object in higher level. This prevent a lower level from getting info in the higher level.
no write down
- A subject does not have append-right to object in lower level. This prevents a malicious insider from passing information to lower levels. (e.g. a clerk working in the highly classified department is forbidden to gossip with other staff)

Biba: Integrity

no write up
- A subject does not has “write” access to objects in higher level. This prevent a malicious subject from poisoning upper level data, and thus ensure that a process will not get compromised by lower level subjects.
no read down
- A subject does not has read access to objects lower level. This prevents a subject from reading data poisoned by lower level subjects.

Tutorial Terminologies

Terminologies	Meaning
Cryptology
Cryptanalysis
Cryptography
NSA
NIST
cryptography backdoor
Key Escrow
Decryption order
Graphical Passwords
covert channel
end-to-end encryption
Single Sign-On
retinal vs iris scan
Forward Secrecy
Insider threat

CS2107 Introduction to Information Security

Topic 0 Overview

Topic 1 Encryption

Attack Model

Classical Ciphers

Substitution Cipher

Encryption

Decryption:

Attack:

Remark

Permutation Cipher

Encryption

Decryption

Attack

Remark

One Time Pad

Encryption

Decryption

Remark

Modern Ciphers

Symmetric vs Asymmetric Key Cryptography

Block Cipher

ECB (Electronic Code Book)

CBC (Cipher Block Chaining)

CTR (Counter Mode)

GCM

Stream Cipher

Examples of Attacks

Triple DES & Meet-in-the-middle

2DES

Meet-in-the-middle Attack

3DES

Padding Oracle Attack

PKCS#7

Padding Oracle Attack

Remark

Cryptography Pitfalls

Topic 2 Authentication Credential

Topic 3 Autheticity (Data Origin)

Public Key Cryptography

Classroom RSA

Cryptographic Hash

Keyed Hash

CBC-MAC

HMAC

Application of Unkeyed Hash

Application of Keyed Hash

Signature

Example of Attacks

Birthday Attack

Pitfalls

Topic 4 Part 1 Public Key infrastructure

Topic 4 Part 2 Channel Security

Topic 5 Network Security

Topic 6 Web Security

Topic 7 & 8 Secure Programming

Topic 9 Access Control

Access Control Model

Specifying Access Rights

Access Control Matrix

Access Control List

Capabilities

Notes

Intermediate Control

Role-based access control

Privileges

Protection rings

Tutorial Terminologies

Read more

Japanese Beginner Particles

LAJ1201 Notes

CS2107 Qustions

CS2106 Questions