Meaning in Syntax

$\newcommand{\sem}[1]{[\![#1]\!]}$ # Meaning in Syntax ## Learning Outcomes Students will understand how to describe meaning syntactically by using the notion of a (computable) equivalence relation. (This lecture will not focus on computability, but we will come back to this issue in the following.) ## Introduction and Summary ### Meaning via Interpretation into a Known Domain As discussed in [a previous lecture](https://hackmd.io/@alexhkurz/BJkd1qSDS), we often describe the meaning of a language (or more general of any kind of formalism) with the help of a meaning function $$\sem{-} : \cal L \to \cal D$$ where $\cal L$ is a "language" that needs to be explained and $\cal D$ is the domain of interpretation. **Examples:** One example from natural languages we encountered in the course was in our digression on Descartes who famously said in $\mathcal L=$ Latin "cogito ergo sum" which we understand by interpreting it in $\cal D=$ English by "I think therefore I am". Here are some more examples. $\cal L$ | $\cal D$ |:---:|:---:| German | English Java | Java Virtual Machine C++ | Assembly Arithmetic | Mathematical Numbers **Exercise:** Make your own examples. Explain the caclulator you implemented in terms of the table above. ### Meaning via a Computable Equivalence Relation To take up the example of natural language translation again: How did you learn your first language? Certainly not by translating it into another language. Or how does a [thesaurus](https://www.thesaurus.com/browse/small) work? Or, for example, if you wanted to look up a work like [epistemology](https://www.merriam-webster.com/dictionary/epistemology) in a dictionay a translation in another language would probably not help you. Instead you get the information that epistemology is equivalent to the study or a theory of the nature and grounds of knowledge especially with reference to its limits and validity. **Remark:** Note that when I write "is equivalent to" above, I am referring to an [equivalence relation](). We will denote equivalence relations by various symbols such as $=$ or $\approx$ or $\equiv$. In this lecture, we show that we can discribe a meaning function $$\sem{-} : \cal L \to \cal D$$ without having to rely on an a priori understanding of a known domain $\cal D$, if we can describe the equivalence relation $t\equiv t'$ defined as $$ t\equiv t' \ \stackrel{\mathrm{def}}{\Longleftrightarrow} \ \sem{t}=\sem{t'}$$ in a computable way. **Notation:** We used before $\approx$ as a symbol for equivalence in our concrete examples, but it is also quite common to use $\equiv$ as a symbol for an equivalence relation. We tend to use $\equiv$ if we think of the equivalence relation as specified by a meaning function. But this is only a difference in point of view. As we will see in this lecture, every meaning functions gives an equivalence relation and, conversely, every equivalence relation defines a meaning functions. ## Meaning in Syntax In [a previous lecture](https://hackmd.io/@alexhkurz/BJkd1qSDS) we gave meaning to a formal language by mapping it to a structure we were familiar with, in our case the natural numbers we know from mathematics or the natural numbers as implementd on a computing machine. Is there way of capturing the meaning that resides in the mapping $\sem{-}$ by staying completely on the syntactic side? Obviously impossible? The answer seems to be yes, when we think eg about sending DNA into outer space. Even if a superintelligence would discover our DNA, they would not be able to learn much about us from it. The meaning of DNA relies on a sophisticated environment surrounding it. But maths could be different. I believe we have some evidence that the maths is the same throughout the universe. So there is some hope to represent maths in some objective way ... I am speculating here and we can come back to the question later ... The speculations of the previous paragraph are relevant to our endeavour of making machines perform meaningful tasks: How do we make machines perform meaningful tasks, if the machines cannot understand the meaning of what they should do? The basic idea is that we make the machines follow rules that are clearly specified. But then the question is: How do we make sure that the rules capture what we mean? One big idea tackling this question is our topic here. To simplify the big question, let us concentrate on one example: Can we capture the maths of numbers by pure syntax? [^first] Let us [recall](https://hackmd.io/@alexhkurz/BJkd1qSDS) the formal language [^second] exp ::= 1 | exp + exp | exp * exp and its semantics $$\sem{-} :\mathcal L \to \mathbb N$$ which maps an expression to the corresponding number. For example, $(1+1)+1$ and $1+(1+1)$ are mapped the same number 3, but are different expressions. We will now revisit our discussion of [syntax, semantics, soundness, complete](https://hackmd.io/@alexhkurz/BJkd1qSDS) in the light of what we learned about [equivalence relations](https://hackmd.io/@alexhkurz/SJ1cc-dDr). The crux of the matter is the following **Exercise:** - Let $f:A\to B$ be a function. Show that $$a\equiv a' \ \stackrel{\rm def}{=} \ f(a)=f(a')$$ defines an equivalence relation, the ***equivalence relation induced by*** $f$. - Show that if $f$ is onto, then the induced function $$A/{\equiv}\to B$$ is a bijection. Why is it important to know that we get a bijection? Because this says that there is a perfect correspondence (one-to-one and onto, also known as an ***isomorphism***) between $A/{\equiv}$ and $B$. In other words, up to naming conventions, $A/{\equiv}$ are exactly the same $B$ sets. We now apply this to the meaning-function $$\sem{-} :\mathcal L \to \mathbb N$$ and we see that the set $\mathcal L/{\equiv}$ of equivalence classes is, up to renaming of the elements, the same as $\mathbb N$. In other words, if we are able to describe $\equiv$ by a set of rules, then we have captured the semantics $\mathbb N$ by pure syntax, amenable to computations by a machine. So can we describe $\equiv$ by a set of rules? Yes, we can! Because ... the equivalence relation induced by $\sem{-}$ is given by the familiar equations \begin{align} X + ( Y + Z ) & \approx ( X + Y ) + Z \\ X \cdot 1 &\approx X \\ X \cdot ( Y + Z ) & \approx X \cdot Y + X \cdot Z \\ X \cdot ( Y \cdot Z ) & \approx ( X \cdot Y ) \cdot Z \\ X + Y & \approx Y + X \\ X \cdot Y & \approx Y \cdot X \end{align} **Question:** Are all equations above needed? Is there one missing? **Remark:** In the [discrete maths lecture](https://hackmd.io/@alexhkurz/SJ1cc-dDr) we not only learned about equivalence relations but emphasised equivalence relations that are the equivalence closure $\stackrel{\ast}{\leftrightarrow}$ of a "one-step-computation" relation $\to$. During the next lectures, we will investigate when such a relation can be used to "rewrite to unique normal form". This is important: If every equivalence class has a unique normal form, then this normal form can be used to represent every element in the class. For example, write $n$ for the normal form of the equivalence class of elements $e$ such that $\sem{e}=3$. We then can identify $n$ with $3$, or, in other words, we can consider $n$ just as a different notation, or encoding, of 3 itself. Thus, as a slogan, what was syntax has become semantics. We have found meaning in syntax. [^first]: Of course, everybody who has ever used a calculator knows that the answer is yes. But let us forget for a moment that we know the answer in this example. Let us start thinking from scratch and discover a big idea that also applies to much more difficult examples. [^second]: I slightly simplified the grammar by putting everything in one line. It is safe to ignore this difference for the current discussion.