演算法 - 荊宇泰 (2022 Fall)

tags: `NYCU-2022-Fall`

Class info.

課程資訊

One midterm exam, one final exam (70% of final score).

At most 3 programming assignments, some homework (reading assignments), and some quizs.

Date

9/16

課程介紹，期中要多努力，期末難

9/23

In Computer Science, computer algorithms defined under the Random Access Machine (RAM) model. RAM, Random Access Machine, there are finite number of instructions, \(+, -, *, ...\).
These instructions are basic enough (assembly language of X86).

在隨機存取圖靈機上，多了一個特殊的指針磁帶，大小是對數空間，字母是二進位單字(0和1)。圖靈機有一個特殊的狀態(state)，當指到這個狀態而指針磁帶的數字(二進位)是’p’時，圖靈機會將工作磁帶上面的指針移動到輸入的第p個符號。
這特性讓圖靈機可以直接讀取輸入的特定字母，而不需要花時間去處理整條輸入。這對使用少於線性時間的複雜度類來說，是必要的(因爲處理整個輸入的時間是線性時間)

Correctness of algorithm doesn’t imply the correctness of the program.

Calculate the number of operations required and present the time required as a function of the input size.

Worst case, each time you have a key which is smallest among the previous sorted list, and thus you have to compare all.
Generally follow the worst case convention.

Program to find the second smallest element in an array

Θ-Notation, asymptotic tight bound

Given a function \(g(n), Θ(g(n))\) is the set of functions, \(Θ(g(n)) = \{f (n) \ | \ ∃ \ positive \ constants \ c_1, c_2, and \ n_0 \ s.t. \ 0 ≤ c_1 · g(n) ≤ f (n) ≤ c_2 · g(n), ∀n > n_0\}\)

9/30

\(\theta(n^2) \ \newcommand*\xor{\oplus} \ \rm{O}(n^2) \rightarrow \theta(n^2)\)
兩集合取交集

Insertion Sort
stable
Selection Sort
stable(老師說的"當=時就交換就能簡單變成stable"，但查到的都unstable)

Binary Search

// g++ cpp-binary-search.cpp -o a.out -std=c++11
#include <iostream>
#include <vector>
#include <algorithm>
using namespace std;

int binary_search(const vector<int> &data, int key) {
    int low = 0;
    int high = data.size()-1;
    while (low <= high) {
        int mid = int((low + high) / 2);
        if (key == data[mid])
            return mid;
        else if (key > data[mid])
            low = mid + 1;
        else
            high = mid - 1;
    }
    return -1;
}

int main() {
    vector<int> data = {1, 9, 2, 7, 4, 10, 3, 8, 5, 6};
    int key = 7;
    
    sort(data.begin(), data.end());
    
    for (auto &i : data)
        cout << i << " ";
    cout << "\n";

    int ret = binary_search(data, key);
    if (ret == -1)
        cout << "找不到\n";
    else
        cout << "找到索引值" << ret << "\n";
}

Merge Sort
stable
Divide and Conquer
Divide and Conquer

Homework assignment: Prove the master theorm in the book
(check between simplified master theorem) \(LaTex \rightarrow Overleaf\)

Quick Sort
unstable
only Divide
Quick Sort Pseudocode

QUICKSORT(A, p, r)
    if p < r
        q = PARTITION(A, p, r)
        QUICKSORT(A, p, q - 1)//老師投影片寫q而非q-1
        QUICKSORT(A, q + 1, r)
PARTITION(A, p, r)
    x = A[r]
    i = p - 1
    for j = p to r - 1
        if A[j] <= x
            i = i + 1
            exchange A[i] with A[j]
    exchange A[i+1] with A[r]
    return i + 1

Quick Sort provement
為了避免recursion太多導致負擔太大
可以設定當subproblem的長度小於x時使用insertion sort
以及為了減少worst case的發生 (T(n)=T(n-1)+n) => O(n^2))
使用Randomized Quick Sort隨機切分成兩個subproblem
例如T(n)=T(n/4)+T(3n/4)+n => O(nlogn)
但這方法仍可能有worst case

Analysis of Quick Sort
用Random Variable

10/7

Heap
Complete binary tree:
Full binary tree + leaf全靠左

Heap Sort
unstable

Counting Sort
stable

Radix Sort
stable
針對一堆整數，從最小位數到最小位數做Counting Sort
為啥這樣有效?
prove by induction:
induction on digit(位數)
假設後面i位都排好，考慮i+1位時，若兩筆資料大小不一樣則排序完成，否則兩筆資料一樣，則根據Counting Sort是stable的特性，後面i位數也都會是排好的。

Largest Gap in an array \(O(N)\)
非max-gap,關於min-gap/max-gap在Other Problem

# A python 3 program to find largest gap between
# two elements in an array.

# function to solve the given problem
def solve(a, n):

	min1 = a[0]
	max1 = a[0]

	# finding maximum and minimum of an array
	for i in range ( n):
	
		if (a[i] > max1):
			max1 = a[i]
		if (a[i] < min1):
			min1 = a[i]
	
	return abs(min1 - max1)

# Driver code
if __name__ == "__main__":

	arr = [ -1, 2, 3, 4, -10 ]
	size = len(arr)
	print("Largest gap is : " ,solve(arr, size))

# This code is contributed by chitranayal

Homework assignment: Leetcode K closet pair in \(\rm{O}(nlogn)\)
Using C/C++ as language

10/14

Selection Problem

若要找第i大的數字，使用sorting來解的話會保證O(nlogn)，為何是big-O而非big-omega?因為這個問題可能有非sorting的解法。

找最大、最小數是theta(n)因為必定要跟n-1個數字比較
但是同時找最大最小只需要3ceiling(n/2)而非2n-2
不過老師說他不知道怎做的不是重點

Homework assignment: Prove un-used nodes always more than used nodes

CLRS answer

Tree
- Binary search tree
- High balanced tree
  - AVL tree
  - Red-Black tree
  - 2-3 tree or 2-3-4 tree

期中考11月第一周 (11/4)

考題範圍:

10/21

2-3-4 tree vs red black tree
One disadvantage for using 2-3-4 tree is that there will have lots of nodes which are unused.
Waste memory, comparing with red balck tree.
Concatenable queue
This data structure can execute concatenate() and split() in O(logn)

Advanced Design and analysis techniques
- Dynamic Programming
- Greedy Approach
- Amortized Analysis

Development of the dynamic programming algorithm is broken into
a sequence of 4 steps.

Characterize the structure of an optimal solution.
Recursively define the value of an optimal solution.
Compute the value of an optimal solution in the bottom-up fashion.
Construct an optimal solution from computed information.

Dynamic Programming
- Assembly Line Scheduling
- Rod Cutting Problem
- Matrix Chain Multiplication
- Longest Common Subsequence
- Optimal Polygon Triangulation

Random Variable

The hiring problem
機率、期望值

11/11

Greedy Algorithm

Huffman Code

Amortized Analysis

Homework 03 Table Expansion and Contraction proof

原文書 17.4 Dynamic tables

some info.
some powerpoint

11/18

Union and Find

Homework assignment

Binomial Heap / Fibonacci Heap

期末整理

期末考 (12/30)

考題範圍:

螢光的有考

部份參考解答

Greedy algorithm

Dynamic programming: A sequence of choices, carefully make choice.
Greedy approach: Make choice that looks the best at the moment, and it leads to optimal solution.