2022q1 Homework1 (quiz1)

# 2022q1 Homework1 (quiz1) contributed by < `Wallmountain` > ## Q1 Leetcode 題目 : [TwoSum](https://leetcode.com/problems/two-sum/) ### Structure ```cpp struct hlist_node { struct hlist_node *next, **pprev; }; struct hlist_head { struct hlist_node *first; }; typedef struct { int bits; struct hlist_head *ht; } map_t; struct hash_key { int key; void *data; struct hlist_node node; }; ``` :::danger `pprev` 的存在不是為了讓程式碼「更簡潔」，有其他更關鍵的作用。 :notes: jserv ::: - ~~利用指標的指標實作 ```pprev```，讓程式碼更簡潔~~ - 在 hlist_head 中不需要指標指向前一個 node, 所以將hash table 的 structure 分成 ```hlist_node``` 和 ```hlist_head```, 若 ```hlist_node``` 中使用指標 ```struct hlist_node *prev``` , 則在刪除 node 時，需要使用 branch 去判斷 ```prev``` 是否有指向前一個 node, 故選擇使用指標的指標消除 branch - ```data``` 的 type 為 ```void *``` 能夠讓 hash table 去 fit 不同 type 的資料 ### container_of ```cpp #define container_of(ptr, type, member) \ ({ \ void *__mptr = (void *) (ptr); \ ((type *) (__mptr - offsetof(type, member))); \ }) ``` - 使用 ```container_of``` 從指標取得 ```struct hash_key``` 的 ```data``` 和 ```key``` - ```offsetof``` 用來取得子物件在物件中的 offset (in bytes) ### map_init ```cpp #define MAP_HASH_SIZE(bits) (1 << bits) map_t *map_init(int bits) { map_t *map = malloc(sizeof(map_t)); if (!map) return NULL; map->bits = bits; map->ht = malloc(sizeof(struct hlist_head) * MAP_HASH_SIZE(map->bits)); if (map->ht) { for (int i = 0; i < MAP_HASH_SIZE(map->bits); i++) (map->ht)[i].first = NULL; } else { free(map); map = NULL; } return map; } ``` - hash table 大小 : 2^bit^ ### map_deinit ```cpp void map_deinit(map_t *map) { if (!map) return; for (int i = 0; i < MAP_HASH_SIZE(map->bits); i++) { struct hlist_head *head = &map->ht[i]; for (struct hlist_node *p = head->first; p;) { struct hash_key *kn = container_of(p, struct hash_key, node); struct hlist_node *n = p; p = p->next; if (!n->pprev) /* unhashed */ goto bail; struct hlist_node *next = n->next, **pprev = n->pprev; *pprev = next; if (next) next->pprev = pprev; n->next = NULL, n->pprev = NULL; bail: free(kn->data); free(kn); } } free(map); } ``` - 走訪 map 中所有的 ```hlist_head``` 和 ```hlist_node``` - 每走訪到一個 node, 若 ```n->next``` 存在，便將 ```n->next->pprev``` 指向 next node 本身 - 每走訪到一個 node, free 當前的節點 ```graphviz digraph { rankdir = LR; node [shape=record]; subgraph cluster { first; label = "heads[i]"; } a [label="<body>hlist_node a|{pprev|<n>next}"]; b [label="<body>hlist_node b|{pprev|<n>next}"]; c [label="<body>hlist_node c|{pprev|<n>next}"]; node [shape=none]; none [label=""]; edge [style="invis"]; first -> a -> b -> c; edge [style=""]; first -> a:body; a:n -> b:body; b:n -> c:body; c:n -> none; edge [color="blue"]; a:p -> first; b:p -> a:n; c:p -> b:n; } ``` ```graphviz digraph { rankdir = LR; node [shape=record]; subgraph cluster { first; label = "heads[i]"; } a [label="<body>hlist_node a|{pprev|<n>next}"]; b [label="<body>hlist_node b|{pprev|<n>next}"]; c [label="<body>hlist_node c|{pprev|<n>next}"]; node [shape=none]; none [label=""]; edge [style="invis"]; first -> a -> b -> c; edge [style=""]; first -> a a:n -> NULL b:n -> c:body; c:n -> none; edge [color="blue"]; a:p -> NULL b:p -> b:body; c:p -> b:n; } ``` ```graphviz digraph { rankdir = LR; node [shape=record]; subgraph cluster { first; label = "heads[i]"; } b [label="<body>hlist_node b|{pprev|<n>next}"]; c [label="<body>hlist_node c|{pprev|<n>next}"]; node [shape=none]; none [label=""]; edge [style="invis"]; b -> c; edge [style=""]; first -> NULL b:n -> c:body; c:n -> none; edge [color="blue"]; b:p -> b:body; c:p -> b:n; } ``` ### map_get ```cpp #define GOLDEN_RATIO_32 0x61C88647 static inline unsigned int hash(unsigned int val, unsigned int bits) { /* High bits are more random, so use them. */ return (val * GOLDEN_RATIO_32) >> (32 - bits); } static struct hash_key *find_key(map_t *map, int key) { struct hlist_head *head = &(map->ht)[hash(key, map->bits)]; for (struct hlist_node *p = head->first; p; p = p->next) { struct hash_key *kn = container_of(p, struct hash_key, node); if (kn->key == key) return kn; } return NULL; } void *map_get(map_t *map, int key) { struct hash_key *kn = find_key(map, key); return kn ? kn->data : NULL; } ``` - ```hash``` 為 hash function ### map_add ```cpp void map_add(map_t *map, int key, void *data) { struct hash_key *kn = find_key(map, key); if (kn) return; kn = malloc(sizeof(struct hash_key)); kn->key = key, kn->data = data; struct hlist_head *h = &map->ht[hash(key, map->bits)]; struct hlist_node *n = &kn->node, *first = h->first; AAA; if (first) first->pprev = &n->next; h->first = n; BBB; } ``` - 若 data 不在 hash table 中, 則新增 node 加入 hash table ```cpp struct hlist_head *h = &map->ht[hash(key, map->bits)]; struct hlist_node *n = &kn->node, *first = h->first; AAA; if (first) first->pprev = &n->next; h->first = n; BBB; ``` - h 為data對應要存入的hash table bucket - 當要在 hash table 中加入 node 時，因為只有指向頭的指標，故這邊為 insert node into head ```graphviz digraph { rankdir = LR; node [shape=record]; subgraph cluster { first; label = "heads[i]"; } a [label="<body>hlist_node a|{pprev|<n>next}"]; b [label="<body>hlist_node b|{pprev|<n>next}"]; c [label="<body>hlist_node c|{pprev|<n>next}"]; node [shape=none]; none [label=""]; edge [style="invis"]; first -> a -> b; edge [style=""]; first -> c:body; a:n -> b:body; b:n -> none; edge [color="blue"]; a:p -> c:n; b:p -> a:n; } ``` - 以上程式碼分別還少以下兩個操作將 ```node->next``` 指向 ```head->next``` ```graphviz digraph { rankdir = LR; node [shape=record]; subgraph cluster { first; label = "heads[i]"; } a [label="<body>hlist_node a|{pprev|<n>next}"]; b [label="<body>hlist_node b|{pprev|<n>next}"]; c [label="<body>hlist_node c|{pprev|<n>next}"]; node [shape=none]; none [label=""]; edge [style="invis"]; first -> c -> a -> b; edge [style=""]; first -> c:body; a:n -> b:body; b:n -> none; c:n -> a:body; edge [color="blue"]; a:p -> c:n; b:p -> a:n; } ``` 將```node->pprev``` 指向 ```head->first``` ```graphviz digraph { rankdir = LR; node [shape=record]; subgraph cluster { first; label = "heads[i]"; } a [label="<body>hlist_node a|{pprev|<n>next}"]; b [label="<body>hlist_node b|{pprev|<n>next}"]; c [label="<body>hlist_node c|{pprev|<n>next}"]; node [shape=none]; none [label=""]; edge [style="invis"]; first -> c -> a -> b; edge [style=""]; first -> c:body; a:n -> b:body; b:n -> none; c:n -> a:body; edge [color="blue"]; a:p -> c:n; b:p -> a:n; c:p -> first } ``` - AAA = ```n->next = first```, BBB = ```n->pprev = &h->first``` ### twoSum ```cpp int *twoSum(int *nums, int numsSize, int target, int *returnSize) { map_t *map = map_init(10); *returnSize = 0; int *ret = malloc(sizeof(int) * 2); if (!ret) goto bail; for (int i = 0; i < numsSize; i++) { int *p = map_get(map, target - nums[i]); if (p) { /* found */ ret[0] = i, ret[1] = *p; *returnSize = 2; break; } p = malloc(sizeof(int)); *p = i; map_add(map, nums[i], p); } bail: map_deinit(map); return ret; } ``` 若 ```target - nums[i]``` 存在 map 中, 則代表找到解 ```cpp int *p = map_get(map, target - nums[i]); if (p) { /* found */ ret[0] = i, ret[1] = *p; *returnSize = 2; break; } ``` 否則, 將 ```nums[i]``` 加入 map 中 ```cpp p = malloc(sizeof(int)); *p = i; map_add(map, nums[i], p); ``` ### 延伸題目 2 :::info 研讀 Linux 核心原始程式碼 include/linux/hashtable.h 及對應的文件 How does the kernel implements Hashtables?，解釋 hash table 的設計和實作手法，並留意到 tools/include/linux/hash.h 的 GOLDEN_RATIO_PRIME，探討其實作考量 ::: :::success TODO : 持續更新 ::: ## Q2 Leetcode 題目 : [Remove Duplicates from Sorted List II](https://leetcode.com/problems/remove-duplicates-from-sorted-list-ii/) - 原程式碼 ```cpp #include <stddef.h> struct ListNode { int val; struct ListNode *next; }; struct ListNode *deleteDuplicates(struct ListNode *head) { if (!head) return NULL; if (head->next && head->val == head->next->val) { /* Remove all duplicate numbers */ while (head->next && head->val == head->next->val) head = head->next; return deleteDuplicates(head->next); } head->next = deleteDuplicates(head->next); return head; } ``` ### 延伸問題 1 :::info 嘗試避免遞迴，寫出同樣作用的程式碼 ::: - 使用指標的指標, 讓 remove node 不需要 prev node, 也不需要在一開始判斷是否 ```head==NULL```,讓程式碼更簡潔 - 用外層的while 迴圈代替遞迴 ```cpp struct ListNode *deleteDuplicates(struct ListNode *head) { struct ListNode **node = &head; while (*node) { /* Remove all duplicate numbers */ if ((*node)->next && (*node)->val == (*node)->next->val) { while((*node)->next && (*node)->val == (*node)->next->val) { *node = (*node)->next; } *node = (*node)->next; } else node = &((*node)->next); } return head; } ``` > 思考如何寫出更精簡的程式碼，一樣是 indirect pointer 的應用 > :notes: jserv ### 延伸問題 2 :::info 以類似 Linux 核心的 circular doubly-linked list 改寫，撰寫遞迴和迭代 (iterative) 的程式碼 ::: #### 遞迴 ```cpp ``` #### 迭代 ```cpp ``` ## Q3 Leetcode 題目 : [LRU Cache](https://leetcode.com/problems/lru-cache/)