2016q3 Homework2 (phonebook-concurrent)

# 2016q3 Homework2 (phonebook-concurrent) contribute by <`bobsonlin`> ## 前言 ## 理解程式碼先從main.c作為理解程式的開頭吧！ ```C= #ifndef OPT FILE *fp; int i = 0; char line[MAX_LAST_NAME_SIZE]; #else struct timespec mid; #endif ``` * 優化版本由於是讓4個thread各自append整個word.txt，append的方式不同，所以只有未優化版本需要fp, line[MAX_LAST_NAME_SIZE] ```C= #ifndef OPT /* check file opening */ fp = fopen(DICT_FILE, "r"); if (!fp) { printf("cannot open the file\n"); return -1; } #else #include "file.c" #include "debug.h" #include <fcntl.h> #define ALIGN_FILE "align.txt" file_align(DICT_FILE, ALIGN_FILE, MAX_LAST_NAME_SIZE); //why the file must be aligned ? int fd = open(ALIGN_FILE, O_RDONLY | O_NONBLOCK); off_t fs = fsize( ALIGN_FILE); #endif ``` * 在讀word.txt前，我們必須將word.txt做對齊，原因是若沒將word.txt裡的每個字串做長度的對齊，在將字串投射至memory時，我們會不知道該如何移動pointer才會指到下一個字串。 * file_align(), fsize()是此程式作者自己寫的function，定義在file.c * 我們用open()來得到word.txt的file descriptor，是為了將來的mmap() ```C== char *map = mmap(NULL, fs, PROT_READ, MAP_SHARED, fd, 0); assert(map && "mmap error"); /* allocate at beginning */ entry *entry_pool = (entry *) malloc(sizeof(entry) * fs / MAX_LAST_NAME_SIZE); assert(entry_pool && "entry_pool error"); pthread_setconcurrency(THREAD_NUM + 1); pthread_t *tid = (pthread_t *) malloc(sizeof( pthread_t) * THREAD_NUM); append_a **app = (append_a **) malloc(sizeof(append_a *) * THREAD_NUM); ``` * 接下來第一步就是將word.txt mapping至記憶體，回傳值map是字典檔mapping後的記憶體起始位址(可以利用此pointer來search字典檔裡的每個lastname) * entry_pool的概念是，一個lastname會有一個entry，entry_pool就是建了一大長串的entry來裝word.txt裡的每個lastname。 * pthread_setconcurrency告知系統需要的concurrency level([詳見參考資料](https://linux.die.net/man/3/pthread_setconcurrency)) * app[i]是指第i個thread在word.txt裡所建的append list ```C== for (int i = 0; i < THREAD_NUM; i++) app[i] = new_append_a(map + MAX_LAST_NAME_SIZE * i, map + fs, i, THREAD_NUM, entry_pool + i); clock_gettime(CLOCK_REALTIME, &mid); for (int i = 0; i < THREAD_NUM; i++) pthread_create( &tid[i], NULL, (void *) &append, (void *) app[i]); for (int i = 0; i < THREAD_NUM; i++) pthread_join(tid[i], NULL); ``` * 第一個for loop是在初始化每個app[i] * 第二、三個for loop的結果是讓4個thread各自append字典檔裡的lastname 到此處我們需先停一下，解釋現在的狀況： * 執行完第一個loop後，情況如下圖所示 ![](https://i.imgur.com/eZKAuP6.jpg) * 接著4個thread要各自執行append()，所以必須先了解append的作法 ```C== void append(void *arg) { struct timespec start, end; double cpu_time; clock_gettime( CLOCK_REALTIME, &start); append_a *app = (append_a *) arg; int count = 0; entry *j = app->entryStart; for (char *i = app->ptr; i < app->eptr; i += MAX_LAST_NAME_SIZE * app->nthread, j += app->nthread,count++) { app->pLast->pNext = j; app->pLast = app->pLast->pNext; app->pLast->lastName = i; dprintf("thread %d append string = %s\n", app->tid, app->pLast->lastName); app->pLast->pNext = NULL; } clock_gettime(CLOCK_REALTIME, &end); cpu_time = diff_in_second(start, end); dprintf("thread take %lf sec, count %d\n", cpu_time, count); pthread_exit(NULL); } ``` 此函式的for loop內在做兩件事 1. 在經過mapping的memory區域中(就是map指向的memory區域)，找到自己thread要的對應位置的lastname，將之存入entry內。 2. 串接每個entry，使之形成一個list 當4個thread append完字典檔裡的所有字後，會執行下方的程式 ```C== entry *etmp; pHead = pHead->pNext; for (int i = 0; i < THREAD_NUM; i++) { if (i == 0) { pHead = app[i]->pHead->pNext; dprintf("Connect %d head string %s %p\n", i, app[i]->pHead->pNext->lastName, app[i]->ptr); } else { etmp->pNext = app[i]->pHead->pNext; dprintf("Connect %d head string %s %p\n", i, app[i]->pHead->pNext->lastName, app[i]->ptr); } etmp = app[i]->pLast; dprintf("Connect %d tail string %s %p\n", i, app[i]->pLast->lastName, app[i]->ptr); dprintf("round %d\n", i); } ``` 此程式目的是將4個thread所建立的append list串接起來，形成一個大list，以供之後的findname使用之後的程式碼就沒做太多修改，findname的整體方式也沒有變(還是有小修改，詳見[video](https://www.youtube.com/watch?v=sWtWslsr4Ac))，就是在entry_pool裡的逐一比對每個entry內的lastname ## 程式碼錯誤之處此程式碼有一個很明顯的錯誤，在main.c的第105行程式裡: ps.為了完整性，我在此附上整個片斷的程式 ```C== for (int i = 0; i < THREAD_NUM; i++) { if (i == 0) { pHead = app[i]->pHead->pNext; dprintf("Connect %d head string %s %p\n", i, app[i]->pHead->pNext->lastName, app[i]->ptr); } else { etmp->pNext = app[i]->pHead->pNext; dprintf("Connect %d head string %s %p\n", i, app[i]->pHead->pNext->lastName, app[i]->ptr); } etmp = app[i]->pLast; dprintf("Connect %d tail string %s %p\n", i, app[i]->pLast->lastName, app[i]->ptr); dprintf("round %d\n", i); } ``` 此處很明顯看到，pHead被assign給app[0]的第二個entry的位址(跳過了第一個entry)，所以之後的findName函式是無法找到word.txt裡的第一個lastname。除此之外，觀察程式中的etmp可以發現，thread 1所建立的append list的尾端式連接到thread 2的append list的第二個entry，同理thread 3, thread 4。所以透過findName是找不到word.txt的第二個lastname。解決方法為 ```C== for (int i = 0; i < THREAD_NUM; i++) { if (i == 0) { pHead = entry_pool; dprintf("Connect %d head string %s %p\n", i, app[i]->pHead->pNext->lastName, app[i]->ptr); } else { etmp->pNext = app[i]->pHead; dprintf("Connect %d head string %s %p\n", i, app[i]->pHead->pNext->lastName, app[i]->ptr); } etmp = app[i]->pLast; dprintf("Connect %d tail string %s %p\n", i, app[i]->pLast->lastName, app[i]->ptr); dprintf("round %d\n", i); } ``` ## 驗證程式碼 thread 2: ![](https://i.imgur.com/ddLqzpP.png) thread 4: ![](https://i.imgur.com/5OF7RYp.png) thread 6: ![](https://i.imgur.com/NjAZOcB.png) thread 8: ![](https://i.imgur.com/oDuVgZe.png) 由上方的結果發現，兩個thread的效果最好 ## 進度 1.修改fgets讓append()執行時間縮短 -- 已完成[10/2]，記錄在[A01: phonebook](https://hackmd.io/JwRgxghgTADMBsBaAJgDhlRAWKB2ArIqiBDNiWLgKYjBSj5A) >> 記得附上超連結，以利日後追蹤 [name=jserv] >>已附上！ [name=林伯陞] 2.理解[phonebook-concurrent](https://github.com/sysprog21/phonebook-concurrent)的程式 - 已完成 [10/4] 3.程式驗證，附上結果的圖表 - 已完成[10/4] >> 這樣的圖表沒有比較價值，重新製作！針對 append() 並調整 Y 軸和測試次數，一如 compute-pi 和 clz 的作法 [name=jserv] 4.理解thread變多但速度變慢的原因，以及sync/ansyc >> 拿出 perf 來分析! [name=jserv] ## 疑問點 ## 參考資料 1. [吳彥寬同學的HackMD](https://hackmd.io/MYNgnCDsCsYEwFoAmInQQFngMwQQwCM5dI88BmAU3LnIEYlsAGIA) ###### tags: `bobsonlin`

Syntax	Example	Reference
# Header	Header	基本排版
- Unordered List	Unordered List
1. Ordered List	Ordered List
- [ ] Todo List	Todo List
> Blockquote	Blockquote
Bold font	Bold font
Italics font	Italics font
~~Strikethrough~~	~~Strikethrough~~
19^th^	19^th
H~2~O	H₂O
++Inserted text++	Inserted text
==Marked text==	Marked text
[link text](https:// "title")	Link
![image alt](https:// "title")	Image
`Code`	`Code`	在筆記中貼入程式碼
```javascript var i = 0; ```	`var i = 0;`
:smile:		Emoji list
{%youtube youtube_id %}	Externals
$L^aT_eX$	L^aT_eX
:::info This is a alert area. :::	This is a alert area.