Syntax, Semantics, Soundness, Completeness

"In a rough and ready sort of way it seems to me fair to think of the semantics as being what we want to say and the syntax as how we have to say it."

Christopher Strachey, a pioneer of programming languages, in his seminal paper Fundamental Concepts in Programming Languages

Questions

What is meaning?
Where does meaning come from?
How do we give meaning to a sequence of (apparently meaningless) symbols?
If computation consists of the blind application of formal rules, then how can computations mean anything?

Syntax

Recall that we started with an example of a formal language given by

    num ::= 1 | num + 1
    exp ::= num | exp + exp | exp * exp

Of course, this language was intended to be a language for arithmetic. We build "numbers" by starting at 1 and doing +1 if we want to get "one more". And we build expressions by doing "addition" + and "multiplication" *.

Actually, in the following, I may slip into the shorter form of

    exp ::= 1 | exp + exp | exp * exp

The longer form has the advantage that it has a "type" num for numbers, where as the shorter one leaves it up to interpretation which expressions should be considered as numbers. We will agree that numbers are represented by expressions that are bracketed to the left as in

((1 + 1) + 1)

and not by expresssions that are bracketed to the right as in

(1 + (1 + 1))

. This is really an arbitrary decision coming down to our convention to read from left to right. But while the longer, many-sorted, or multi-typed, version with the two types num and exp is more principled, the shorter version will be more convenient to present in the following.

In any case, note that on its own, these grammars do not tell us that the languages are about arithmetic (this is why I put "numbers", "one more", "addition", "multiplication" in quotes). A grammar only defines a formal language, which is just the set of terms that can be formed according to its rules. For example, we can use these rules to deduce that

    (1+1)+1

is a num and similarly that

    ((1+1)+1)+(1+1)

is an exp. (Exercise: Write down the corresponding deductions/derivations.)

In other words, on its own, these terms do not have meaning. We say that they are just syntax. We may think of the term (1+1)+1 as 3. But for the moment this is just how we think about the formal language. It is not part of the definition. (And this is important, because we want to be able to use the formal language to make machines–-which cannot think–-to perform arithmetic.)

Equations and Computations

Next we added equations

    X + ( Y + Z ) = ( X + Y ) + Z
    X * 1 = X
    X * ( Y + Z ) = X * Y + X * Z
    X * ( Y * Z ) = ( X * Y ) * Z
    X + Y  = Y + X
    X * Y  = Y * X

We have seen how to use the equations to compute, for example, what we think of as 3+2=5. Formally, we compute

    ((1+1)+1)+(1+1) = (((1+1)+1)+1)+1)

applying the rule of associativity for + once. We have also seen that 2+3=5 needs two applications of associativity. Therefore, in some sense, 3+2 is different from 2+3. But in another sense, both are the same because they both evaluate to 5.

Remark: Note that the variables X, Y, Z are not part of our language of arithmetic. They are merely place holders for the terms (ie expressions) of the language.

How can we make sense of these different notions of equality?

And can we make sense of the following two questions:

    Are the equations true? (soundness of the equations)
    Did we write down enough equations? (completeness of the equations)

Of course, we learned in high-school maths that the equations are true. But, so far, we are just talking about a formal language. So if we can write

    X * 1 = X

then why not

    X + 1 = X ?

Semantics

The big idea that is missing so far is how to give meaning to a language. We also say: What is the semantics of our formal language, what is the semantics of the syntax?

Meaning comes from talking about something that everybody has agreed upon or understands. For us, there are two possibilities: We all understand (at least to some degree that should suffice for this course)

discrete mathematics
computing machines

In the first case we speak about mathematical or denotational semantics, in the second case of operational semantics. Both are two different but equivalent ways of explicating the meaning of a programming language.

Denotational Semantics

Let us write

L

for the formal language of expressions given by the grammar for exp above.

Let

N = {1, 2, 3, \dots}

be the natural numbers.

The meaning function, or semantics,

[[-]] : L \to N

involves three different languages and we use, just for now, three different colours to help keeping them apart.

Blue

to denote the formal language,

red

for the language of mathematics (in our example just the natural numbers with addition and multiplication) and black for the informal "meta"-language in which we freely mix English with mathematical notation.

We can now define the meaning function

[[-]]

\begin{aligned} [[1]] & = 1 \\ [[t + 1]] & = [[t]] + 1 \\ [[t + t^{'}]] & = [[t]] + [[t^{'}]] \\ [[t * t^{'}]] & = [[t]] \cdot [[t^{'}]] \end{aligned}

To understand this definition, remember what you learned about induction on natural numbers in discrete mathematics. For natural numbers, we have two operations, the constant "0" and the successor "+1". This is similar to the first two cases in the definition of

[[-]]

above. In addition, expressions exp also have two binary operations. This just means that an inductive definition over expressions also needs to have two cases for those. (For more details see the lecture on induction.)

Remark: Note that the definition of the function

[[-]]

is what we call structure preserving: The

+

on the left is mapped to a

+

on the right, etc. This principle is key to all of engineering in general. It is what allows us to build big systems from small systems and to control how the small systems interact. Here this general principle is visible in the simple fact that an equation such as

[[t + t^{'}]] = [[t]] + [[t^{'}]]

explains the meaning of the "big system"

[[t + t^{'}]]

as a function of the meanings of the "small systems"

[[t]]

and

[[t^{'}]]

. This principle is also called compositional semantics. It plays a major role not only in computer science and other engineering disciplines but also in linguistics as it is used to explain the meaning of natural languages.

To get a feeling for how such an inductive definition works we do a little computation where we inductively/recursively decompose terms (Exercise: fill in the dots in the equational reasoning below.) until we can apply the base case and finally add up in the natural numbers.

\begin{aligned} [[((1 + 1) + 1) + (1 + 1)]] & = [[((1 + 1) + 1)]] + [[1 + 1]] \\ = \dots \\ = [[1]] + [[1]] + [[1]] + [[1]] + [[1]] \\ = 1 + 1 + 1 + 1 + 1 \\ = 5 \end{aligned}

Operational Semantics

We can also read the definition of

[[-]]

above as a program, a so-called interpreter, that evaluates expressions to numbers.

Here, blue is the programming language for which we write the interpreter, black is the programming language in which we write the interpreter, and red is the language of the machine on which the interpreter is running.

In the operational interpretation of the equations, we read them from left-to-right as rules for rewriting expressions. Since the interpreter is running on the same machine that is adding the numbers, we do not need to distinguish black and red anymore and replace black by red.

Let us illustrate this with the example above again:

\begin{aligned} ((1 + 1) + 1) + (1 + 1) & \to ((1 + 1) + 1) + 1 + 1 \\ \to \dots \\ \to 1 + 1 + 1 + 1 + 1 \\ \to 1 + 1 + 1 + 1 + 1 \\ \to 5 \end{aligned}

Soundness and Completeness

Having put in the work of making precise the meaning of our formal language, we can now also make precise our two quesitons from the beginning of the lecture.

First recall the equations

\begin{aligned} X + (Y + Z) & \approx (X + Y) + Z \\ X \cdot 1 & \approx X \\ X \cdot (Y + Z) & \approx X \cdot Y + X \cdot Z \\ X \cdot (Y \cdot Z) & \approx (X \cdot Y) \cdot Z \\ X + Y & \approx Y + X \\ X \cdot Y & \approx Y \cdot X \end{aligned}

We now replace = by

\approx

in order to emphasise that these equations are equations we impose on syntax. And on syntax 3+2 is different from 2+3. Nevertheless we can impose as a requirement that

3 + 2 \approx 2 + 3

and then use this knowledge to compute with or reason about the syntax.

A consequence of the equations above is that the syntax "modulo the equations" agrees exactly with semantics.

For example, while

((1 + 1) + 1) + (1 + 1)

is different, as a string or a tree, from

(1 + 1) + ((1 + 1) + 1)

, we can now use the equation

X + Y \approx Y + X

to compute/reason that

((1 + 1) + 1) + (1 + 1) \approx (1 + 1) + ((1 + 1) + 1)

which we know to be true for natural numbers.

In programming languages jargon, this agreement between syntax and semantics is formulated by saying that the equations are sound and complete.

But we still need to say what precisely we mean by equations being sound and complete:

Soundness:
$t \approx t^{'} ⟹ [[t]] = [[t^{'}]]$ for all
$t, t^{'} \in L$ .

Completeness:
$[[t]] = [[t^{'}]] ⟹ t \approx t^{'}$ for all
$t, t^{'} \in L$ .

Exercise:

Convince yourself that the equations are sound.

Remark on Completeness:

Remember how we came up with the language and the equations. We started from our knowledge of natural numbers and defined the formal language accordingly. Then we discovered the equations by needing them for computations/proofs. For example, we needed associativity to get 3+2=5. After adding multipliction, we also needed distributivity. And then we added some more equations we knew that were true. But how do we know that we have enough? How can we be sure that we can now do all computations in our language with this small number of equations? Or can we do with fewer equations?

Conclusion

Activity: Discuss inhowfar the following answers the questions from the beginning of the lecture.

If we need to bootstrap meaning from nothing, the best we can do is to find ways to directly represent the structure of interest.

For example, we can define numbers by saying that 0 is a number and if

n

is a number then

S n

is a number (and nothing else is a number).

Then we can give meaning to expressions exp ::= 1 | exp + exp | exp * exp by mapping them into numbers.

Alternatively, we can give meaning to expressions by axiomatising their structure, that is, by writing out the equations for + and *.

Computations then capture meaning, because computations rely on the same equations that axiomatise the structure.