---
tags: FYP
---
# Ochmanski's Theorem
## Preliminary
***Definition 1.1***
Let $M$ be a monoid and $X \subseteq M$. The ***left quotient*** of $X$ by an element $u \in M$ is denoted as $X/u := \{m \in M : um \in X\}$.
Let $\equiv_X$ be an equivalence class where $u \equiv_X v$ iff $X/u = X/v$. We call $\equiv_X$ as ***left syntactic relation*** of $X$
Alternatively, $u \equiv_X v$ iff $um \in X$ if and only if $vm \in X$ for all $m$.
The number of distinct left quotients of $X$ is the same as the size of the set of equivalence classes $\equiv_X$.
***Definition 1.2***
Let $M$ be a monoid and $X \subseteq M$. The ***$k$-syntactic quotient*** of $X$ by $P = (p_1, \dots, p_k) \subseteq M^k$ is $X // P = \{(q_0, \dots, q_k) : q_0p_1q_1 \dots p_kq_k \in X, q_i \in M\}$.
We can form an equivalence relation $(x_1, \dots, x_k) \equiv_{X,k} (y_1, \dots, y_k)$ if $X // (x_1, \dots, x_k) = Y // (y_1, \dots, y_k)$
***Definition 2***
Let $M = M(\Sigma, I)$ be a trace monoid. Let $L \subseteq \Sigma^*$ be a language. Denote the ***closure*** of the language $L$ with respect to $I$ as $f_I(L) = \bigcup [L]_I$.
***Definition 3***
Let $M = M(\Sigma, I)$ be a trace monoid. Let $T \subseteq M$ be a trace language. Denote $Lex(T)$ as a set of strings over $\Sigma^*$, where each string is used to denote each trace in $T$, and the string is lexicographically the smallest. Formally, $Lex(T) = \{ w \in \bigcup T : \forall w' \in [w], w \leq w'\}$
***Definition 4***
Let $(\Sigma, I)$ be a concurrent alphabet. For any string $w \in \Sigma^*$, denote $I(w) := \{w' : (w, w') \in I\}$, i.e. set of strings that are independent from $w$.
***Definition 5***
Let $(\Sigma, I)$ be a concurrent alphabet, and $L \subseteq \Sigma^*$. For any $[uv] \in [L]$, we say $rank_{L, I}(u, v) \leq n$ if there exists $w \in L$ such that:
1. $w = v_0 u_1 v_1 \dots u_n v_n$
2. $[v_0 \dots v_n] = [v]$ and $[u_1 \dots u_n] = [u]$
3. $(v_i, u_j) \in I$ for all $i < j$
Define $rank_{L, I}(u, v) = \min \{ n : rank_{L, I}(u, v) \leq n\}$ and $Rank_I(L) = \max \{rank_{L, I}(u, v) : [uv] \in [L]\}$.
One interpretation of $rank_{L, I}(u, v)$ is what is the minimum components needed to break down $[uv] \in [L]$ such that it forms a word $w \in L$.
***Definition 6***
A rational expression $R \in \text{REX}(\Sigma^*)$ is star-connected if it only have star operation over connected languages. Formally,
1. For any $w \in \Sigma^*$, the expression $w$ is star-connected
2. For any star-connected rational expression $R_1$ and $R_2$, $R_1 \cup R_2$ and $R_1 \cdot R_2$ are also star-connected.
3. For any star-connected rational expression $R$, $R^*$ is also rational expression if and only if $L(R)$ is connected.
---
# Plan
We will prove the following three statements are equivalent. Let $M = M(\Sigma, I)$ be a trace monoid, and $T \subseteq M$ be a trace language.
1. $T \in \text{REC}(M)$
2. $Lex(T) \in \text{REG}(\Sigma^*)$
3. There exists $R \in REX(\Sigma^*)$ such that $R$ is star connected and $[L(R)] = T$.
- (1) implies (2) is proven in ***Theorem 2***.
- (2) implies (3) is shown by ***Corollary 3.1***.
- (3) implies (1) is shown by ***???***
Note: I think it should be equivalent to what we have proven during the meeting, i.e. suppose $L \subseteq M$ such that $L$ is recognizable, then the three statements are equivalent:
1. $[L] \in REC(M)$
2. $Lex([L]) \in REG(\Sigma^*)$
3. There exists $R \in REX(\Sigma^*)$ such that $R$ is star connected and $[L(R)] = [L]$.
## Theorem 1
Let $M$ be a monoid and $\approx$ be a congruence on $M$. Let $X \subseteq M_\approx$ be a subset of equivalence classes.
$$ X \in \text{REC}(M_\approx) \iff \bigcup X \in \text{REC}(M) $$
*Proof*. It is sufficient to prove that the number of left quotient of $X$ is equal to $\bigcup X$.
Denote $\equiv_{\bigcup X}$ and $\equiv_X$ as the left syntactic relation of $\bigcup X$ w.r.t. the monoid $M$ and $X$ w.r.t. the monoid $M_\approx$ respectively.
We shall prove the following:
$$ [w_1] \equiv_X [w_2] \iff w_1 \equiv_{\bigcup X} w_2 $$
($\Rightarrow$)
- Given $[w_1], [w_2] \in X$ such that $[w_1] \equiv_X [w_2]$
- Take any $z \in (\bigcup X)/w_1$
- $w_1 z \in \bigcup X$ by definition
- $[w_1z] \in X$
- $[w_1][z] \in X$
- $[z] \in X/[w_1]$
- $[z] \in X/[w_2]$ as $[w_1] \equiv_X [w_2]$
- $[w_2][z] \in X$
- $[w_2 z] \in X$
- $w_2 z \in \bigcup X$
- $z \in (\bigcup X) / w_2$
- Therefore $(\bigcup X)/w_1 = (\bigcup X)/w_2$, hence $w_1 \equiv_{\bigcup X} w_2$
($\Leftarrow$)
- Given $w_1, w_2 \in \bigcup X$ such that $w_1 \equiv_{\bigcup X} w_2$
- Take any $[z] \in X/[w_1]$
- $[w_1][z] \in X$
- $[w_1 z] \in X$
- $w_1 z \in \bigcup X$
- $w_2 z \in \bigcup X$ as $w_1 \equiv_{\bigcup X} w_2$
- $[w_2 z] \in X$
- $[w_2][z] \in X$
- $[z] \in X/[w_2]$
- Therefore, $X/[w_1] = X/[w_2]$, hence $[w_1] \equiv_X [w_2]$
As there is one-to-one correspondence from an element in classes of $\equiv_X$ to classes of $\equiv_{\bigcup X}$, both have the same number of left quotient.
If a subset of a monoid is recognizable, then the number of left quotient is finite. This completes the proof.
***Corollary***
Let $M = M(\Sigma, I)$ be a trace monoid, and $L \subseteq \Sigma^*$ be a language. Then
$$ [L] \in \text{REC}(M) \iff f_I(L) = \bigcup [L] \in \text{REC}(\Sigma^*) $$
## Theorem 2
Let $M = M(\Sigma, I)$ be a trace monoid and $T \subseteq M$ be a trace language.
$$ T \in \text{REC}(M) \implies Lex(T) \in \text{REG}(\Sigma^*) $$
*Proof*. Note that $Lex(T) = Lex(M) \cap \bigcup T$. We know that $\bigcup T$ is regular by Theorem 1. Now we need to prove that $Lex(M)$ is also regular.
$$Lex(M) = \Sigma^* - \{\Sigma^*bI(a)a\Sigma^* : (a, b) \not\in I \text{ and } a < b\}$$
We can see that $I(a) = \{ w \in \Sigma^* : (w, a) \in I \} = \{ c \in \Sigma : (c, a) \in I \}^*$ is regular. Therefore, $Lex(M)$ is regular.
Hence, we conclude that $Lex(T)$ is also a regular language.
## Theorem 3
Let $M = M(\Sigma, I)$ be a trace monoid and $T \subseteq M$ be a trace language.
$$ Lex(T) \in \text{REG}(\Sigma^*) \implies \exists R \in \text{REX}(\Sigma^*), R \text{ is star-connected and } L(R) = Lex(T) $$
*Proof*. Suppose not, there exists a word $w$ and $ww$ in $L(R)$ such that $w$ is a disconnected word. However, $ww$ cannot be lexicographically smallest, as we can swap some smallest letter from second $w$ to the first $w$.
### Corollary 3.1
$$ Lex(T) \in \text{REG}(\Sigma^*) \implies \exists R \in \text{REX}(\Sigma^*), R \text{ is star-connected and } [L(R)] = T $$
## Lemma 3
Let $(\Sigma^*, I)$ be a concurrent alphabet. Let $u, v, w \in \Sigma^*$. $[w] = [uv]$ if and only if $w = v_0 u_1 v_1 \dots u_n v_n$, where $[u] = [u_1 \dots u_n]$, $[v] = [v_0 \dots v_n]$, and $(v_i, u_j) \in I$ for $i < j$.
## Theorem 4 (Hachiguchi)
Let $(\Sigma, I)$ be a concurrent alphabet. Suppose $L \subseteq \Sigma^*$ and $L \in \text{REG}(\Sigma^*)$.
If $Rank_I(L)$ is finite, then $f_I(L) \in \text{REG}(\Sigma^*)$.
*Proof*. It is sufficient to prove that the number of left quotient of $f_I(L)$ is finite.
We will try to find a way to represent $f_I(L)/u$, for some $u \in \Sigma^*$.
Let $v \in f_I(L) / u$, then $uv \in f_I(L)$. From ***Lemma 3***, we know that there exists $w \in L$ such that:
1. $w = v_0 u_1 v_1 \dots u_n v_n$
2. $[u] = [u_1 \dots u_n]$, $[v] = [v_0 \dots v_n]$
3. $(v_i, u_j) \in I$ for $i < j$
We can represent $v$ as $n$-syntactic quotient of $(u_1 \dots u_n)$. Let $\text{CON}$ be a concatenation function of tuples, i.e. $\text{CON}((w_1, w_2, \dots, w_n)) = w_1 \dots w_n$. Then for any $v \in f_I(L) / u$, surely $v \in f_I[\text{CON}(L // (u_1, \dots, u_n))]$, where $[u] = [u_1 \dots u_n]$. However, we need to ensure that all elements $(v_0, \dots, v_n) \in L // (u_1, \dots, u_n)$ must satisfy (3).
Hence, we intersect it with $I(x_1, \dots, x_k) = I(x_1 \dots x_k) \times I(x_2 \dots x_k) \dots \times I(x_k) \times \Sigma^*$, i.e. a tuple whose first element is independent with all, the second element is independent with second word onwards, etc.
Therefore, $f_I(L) / u = \bigcup \{f_I[\text{CON}(L // (u_1, \dots, u_n) \cap I(u_1, \dots, u_n))] : [u_1 \dots u_n] = [u]\}$.
Since the family $\{I(x) : x \in \Sigma^*\}$ is finite, so does $\{I(x_1, \dots, x_n) : x_i \in \Sigma^*\}$.
Since $L$ is regular, $\equiv_{L, k}$ has finite equivalence classes. Therefore, the family $\{L//(u_1, \dots, u_n)\}$ is finite as $Rank_I(L)$ is finite. Therefore, the family $\{f_I(L) / u\}$ is finite.
Hence, $f_I(L)$ is recognizable.
## Theorem 5
Let $M = M(\Sigma, I)$ be a trace monoid, and $T \subseteq M$ be a trace language.
$$ \exists R \in \text{REX}(\Sigma^*), R \text{ is star-connected and } [L(R)] = T \implies T \in \text{REC}(M) $$
*Proof.* We will prove this by using structural induction over definition of star-connected rational expression.
**Base Case**: For every word $w \in \Sigma^*$, suppose $R = w$. Therefore $[L(R)] \in \text{REC}(M)$, as the size of $[L(R)]$ is finite.
**Inductive Case**.
(Union) Suppose $R_1$ and $R_2$ are star-connected. By inductive step, both $[L(R_1)]$ and $[L(R_2)]$ are recognizable. Note that $[L(R_1 \cup R_2)] = [L(R_1)] \cup [L(R_2)]$. As recognizability is closed under union, we conclude that $[L(R_1 \cup R_2)] \in \text{REC}(M)$
(Concatenation) Suppose $R_1$ and $R_2$ are star-connected. By inductive step, both $[L(R_1)]$ and $[L(R_2)]$ are recognizable.
Denote $L_1 = \bigcup [L(R_1)]$ and $L_2 = \bigcup [L(R_2)]$. Both $L_1$ and $L_2$ are regular languages. As regular languages are closed under concatenation, therefore $L_1 \cdot L_2$ is a regular language.
Take any $[uv] \in [L_1L_2]$. We can find $w \in L_1L_2$ such that $w$ is factorization of $u$ and $v$. As $w \in L_1L_2$, we can break down $w$ as $w_1w_2$, where $w_1 \in L_1$ and $w_2 \in L_2$.
Note that $[uv] = [w_1w_2]$. By Levi's lemma, there exists $x_1, x_2, x_3, x_4$ such that $(x_2, x_3) \in I$ and $u = x_1 x_2, v = x_3 x_4, w_1 = x_1 x_3, w_2 = x_2 x_4$. Hence, $Rank_I(L_1L_2) \leq 2$. By Hachiguchi, $f_I(L_1 L_2) = \bigcup (L(R_1) L(R_2))$ is regular. Therefore, $[L(R_1)L(R_2)]$ is recognizable.
(Star) ???
---
## Appendix
We will prove that $Lex(M) = \Sigma^* - \{\Sigma^*bI(a)a \Sigma^* : (a, b) \not\in I \text { and } a < b\}$ is true.
It is sufficient to prove that all strings that are not minimal (lexicographically smallest) can be represented as $\Sigma^*bI(a)a\Sigma^*$ for some $(a, b) \not\in I$ and $a < b$, and that representation is sufficient to contain all minimal string.
Equivalently, we can prove the following:
Let $w$ be a word. $w$ is minimal if and only if for all factorizations $w = xbyaz$, where $x, y, z \in \Sigma^*$, $a, b \in \Sigma$, $(a, b) \in I$ and $a < b$, there is a letter of $y$ that does not commute with $a$.
*Proof*. If $w$ is minimal, then the consequent is true. Otherwise, we can swap $a$ and $b$ to get smaller string.
Let $w$ be a word such that for all factorizations $w = xbyaz$ where $x, y, z \in \Sigma^*$, $a, b \in \Sigma$, $a < b$, and $(a, b) \in I$, there is a letter in $y$ that does not commute with $a$.
Suppose $w$ is not minimal, i.e. there is an equivalent word $v$ such that $v < w$. Hence, there must be some symbols $a$ and $b$ such that $a < b$ and $v = x'av'$ and $w = x'bw'$, where $x', v', w' \in \Sigma^*$. We know that there must be symbol $a$ in $w'$, and $(a, b) \in I$.
We can rewrite $w$ as $w = x'byaz$ where $|y|_a = 0$, i.e. $a$ does not appear in $y$. This violates the assumption that all factorizations of $w$ satisfies the condition.
## References
- The Book of Traces Chapter 6
- Partial Commutation and Traces, Theorem 7