The note of Formal Language and the Theorem of Computation

tags: `NCTU`, `Formal Language`, `Theorem of Computation`

Chap 4 Decidability

4.1 Decidable Language

For Regular Language

為了要證明

A_{D F A}

是 decidable，
先說明 acceptance problems for

D F A s

( 在一個特定的

D F A

上檢測特定字串

w

是否 accept )，可以以 language 的方式來呈現，即

A_{D F A} = {< B, w > | B

is a

D F A

that accepts input string

w}

所以上述問題等同於判斷

< B, w >\in A_{D F A}

是否成立。

回到

A_{D F A}

是否 decidable 的問題，其若且唯若

< B, w >

是否 decidable 。

Decidable 指的是是否存在一個
$T M$ 可以決定
這個語言，且不存在 infinite loop。
當存在一個演算法可以判斷這個問題即表示找到此
$T M$

上述的說明可以推廣成 whether the laguage of a finite automaton is empty(

E_{D F A}

) and whether two finite automaton are equivalent(

E Q_{D F A}

)是不是 decidable

$A_{D F A}$
$A_{N F A}$
$E_{D F A}$
$E Q_{D F A}$
結論：如果能夠找到一個演算法，能夠使得 acceptance problem 可解，即代表此問題為 Turing recognizable，且因為找到的演算法不具 infinite loop 所以又稱為 Turing decidable

演算法舉例在課本有

For Contex-Free Language

證明方法和 Regular 一樣，皆是找出一個演算法。

其中的 proof idea 我覺得比較有趣的地方是，當判斷一個

w

是否屬於一個

C F G

的時候，若從

S

開始展開，過程中如果出現

w

即代表

w \in C F G

，這個方法看似沒問題，但在考慮

w \notin C F G

的時候上述方式卻會進入 infinite loop，那麼就代表

A_{C F G}

是 Turing recognizable 而不是 decidable？

解法，先將

C F G

轉換成 CNF，根據 CNF 的性質，長度為 n 的

w

，最多只需要

2 n - 1

個 steps 就能找到，依照這個特性設計的演算法就不會陷入 infinite loop 的窘境，故

A_{C F G}

是 decidable

$A_{C F G}$
$E_{C F G}$
Every CFL is decidable (not
$A l l_{C F G}$ )

$E Q_{C F G}$ is not decidable

4.2 Undecidability

電腦看似強大到每種 problem 都可解，但是仍然存在 unsolvable problem，舉例來說，去設計一個演算法來判斷一個演算法是否正確，像是課堂作業要求學生實作排序演算法，老師助教們需要去判別學生所實作的 sort algorithm 是不是在 any inputs 之下都會正確運作。

此種問題可以看成是一個 language，即

$A_{T M} = {< M, w > | M$ is a
$T M$ and
$M$ accepts
$w}$

而

A_{T M}

僅僅只有 Turing-recognizable。

那是因為目前不存在有比

T M

更強大的計算模型去包括

T M

本身 ( 像是

D F A

就可以用

T M

去 simulate ) 或是設計出有效率的方式能找到推演方法（像是

C F G

存在

C N F

去避免進入 infinite loop ），所以當輸入一個

T M

會 infinite loop 的

w

負責模擬

T M

的

T M^{'}

可能會進入 infinite loop。

Key point: 是否存在
$T M$ 不接受的 inputs?
- i.e. Language 的數量是否比
  $T M$ 能表示的數量還要多？

T M

的數量可以用所有 alphabet 的組合來表示 ( i.e.

\sum^{*}

)，
Language 的數量可以用

\sum^{*}

的 power set 來表示 ( i.e.

P (\sum^{*})

)

其中

\sum^{*}

是 countable infinite set，但

P (\sum^{*})

是 uncountable infinite set，所以 Language 的數量遠大於

T M

的數量。

證明在課本有，使用對角線證明。
用
$T M$ 來模擬的話，類似邏輯謬論（一位理髮師只幫不剪自己頭髮的人理頭髮，試問理髮師會不會理自己的頭髮）

Undecidable Language

Undecidable 具有一個特殊的性質，即

A language is decidable iff it is Turing-recognizable and co-Turing-recognizable

Image Not Showing Possible Reasons
The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported
Learn More →

其中我覺得關鍵是 Every string

w

is either in

A

\overset{―}{A}

，
因為在 simulate

T M

上的

T M

中所接受的 input 是 Language ，every language

L

is not either in

T M

\overset{―}{T M}

，
不像在 simulate

C F L

D F A

中的

T M

接受的 input 是 string (

\sum^{*}

)。

最後我們可以用上述性質來證明

\overset{―}{A_{T M}}

不是 decidable，因為

A_{T M}

不是 decidable

Ch5

$E_{T M}$ is undecidable
- Use the knowledge of
  $A_{T M}$ is undecidable
- Let
  $S$ is the
  $T M$ simulate
  $A_{T M}$ , and
  $R$ is the
  $T M$ simulate
  $E_{T M}$ .
- Construct
  $M_{1}$ which accept when
  $i n p u t = w$ and reject otherwise, so that
- $S$ = 1. Run
  $R$ on
  $M_{1}$
  2. If
  $R$ accepts then
  $r e j e c t$ ; if R rejects then
  $a c c e p t$ .
$R E G U L A R_{T M}$ is undecidable
- Use the knowledge of
  $A_{T M}$ is undecidable
- Let
  $S$ is the
  $T M$ simulate
  $A_{T M}$ , and
  $R$ is the
  $T M$ simulate
  $R E G U L A R_{T M}$ .
- Construct
  $M_{2}$ which accept strings in
  ${0^{n} 1^{n} | n \geq 0}$ if
  $M$ does not accept
  $w$ , and to recognize regular language
  $\sum^{*}$ if
  $M$ accepts
  $w$ , so that
- $S$ = 1. Run
  $R$ on
  $M 2$
  2. If
  $R$ accepts then
  $a c c e p t$ ; if R rejects then
  $r e j e c t$ .
$E Q_{T M}$ is undecidable
- Use the knowledge of
  $E_{T M}$ is undecidable
- Let
  $S$ is the
  $T M$ simulate
  $E_{T M}$ , and
  $R$ is the
  $T M$ simulate
  $E Q_{T M}$ , and
  $M_{1}$ is a
  $T M$ that rejects all inputs.
- $S$ = 1. Run
  $R$ on input
  $< M, M_{1} >$
  2. if
  $R$ accepts then
  $a c c e p t$ ; if
  $R$ rejects then
  $r e j e c t$

Linear Bounded Automaton

是

T M

的一種變形，多了 tape 長度的限制，舉例來說，當 input length 為 n 的時候，tape ( memory ) 長度只需要 n 的線性大小即可，因得其名。

$L B A$ 的 configuration 數量為
$q n g^{n}$ where
$T M$ has
$q$ states and
$g$ symbols

上述性質可以使得

A_{L B A}

是 decidable！

因為最多只需要經過

q n g^{n}

個 steps 就能得知是否

a c c e p t

( i.e.

L B A

never have infinite loop )

$A_{L B A}$ is decidable
$E_{L B A}$ is undecidable
- Proof idea is similar to
  $E_{T M}$
- 建立一個
  $L B A$ accepts if input 是
  $M$ accepts
  $w$ 的 computation history, otherwise reject all inputs.
- Let
  $S$ is the
  $T M$ simulate
  $A_{T M}$ , and
  $R$ is the
  $T M$ simulate
  $E_{L B A}$ .
- $S$ =
  $a c c e p t$ if
  $R$ rejects;
  
  $r e j e c t$ if
  $R$ accepts.

$L B A$ 如何 check computation history 的步驟課本有

$A L L_{C F G}$ is undecidable
- 建立一個
  $C F G$ 在
  $M$ accepts
  $w$ 時會產生除了該 computation history 之外的所有字串，而
  $M$ does not accept
  $w$ 時則產生所有字串。
- Let
  $S$ is the
  $T M$ simulate
  $A_{T M}$ , and
  $R$ is the
  $T M$ simulate
  $A L L_{C F G}$ .
- $S$
  $a c c e t p$ if
  $R$ rejects;
  
  $r e j e c t$ if
  $R$ accepts.

$C F G$ ( i.e.
$P D A$ ) 如何 check 不生成特定 computation history 的步驟課本有

$E Q_{C F G}$ is undecidable
- Let
  $S$ is the
  $T M$ simulate
  $A L L_{C F G}$ , and
  $R$ is the
  $T M$ simulate
  $E Q_{C F G}$ .
- Let
  $G_{0}$ 生成
  $\sum^{*}$
- $S$
  $a c c e p t$ if
  $R (G, G_{0})$ accepts;
  
  $r e j e c t$ if
  $R (G, G_{0})$ rejects.

A_{T M} \leq_{m} \overset{―}{X}

hints that

\overset{―}{X}

is undecidable and

X

is not turing-recognizable.

Note that
${\overset{―}{A}}_{T M} \leq_{m} \overset{―}{\overset{―}{X}} \Rightarrow {\overset{―}{A}}_{T M} \leq_{m} X$ , and
${\overset{―}{A}}_{T M}$ is not turing-recognizable, so is X.

The note of Formal Language and the Theorem of Computation

tags: NCTU, Formal Language, Theorem of Computation

Chap 4 Decidability

4.1 Decidable Language

For Regular Language

For Contex-Free Language

4.2 Undecidability

Undecidable Language

Ch5

Linear Bounded Automaton

Read more

NCTU Machine Learning

LINUX 核心設計 課程用書筆記

Play in a Sandbox!?

2020 Machine Learning Homework 7

tags: `NCTU`, `Formal Language`, `Theorem of Computation`

LINUX 核心設計課程用書筆記