Chp 4 Threads

tags: `作業系統`

Chp 4 Threads

Overview

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Motivation

(原因待補)

Process creation:
- heavy-weight
Thread creation:
- light-weight

context switch, memory allocate
Omnom

創一個process就要重新allocate一塊記憶體給他，連帶也造成Process之間的資料傳遞比較麻煩，相比之下不同的thread仍然是在同一個記憶體空間(main process)
Neko

增加處理的平行度
Peter

simplify code, increase efficiency
考 Provide two case that multithreading does not provide better performance than a single-threaded solution:
1. sequential program: e.g. calculates an individual tax return
2. shell program: e.g. C-shell or Korn shell
考 multiple kernel threads provide better performance than a single-threaded solution:
- page fault, another kernel thread can be switched in to use the interleaving time in a useful manner
- single-threaded process will not be capable of performing useful work when a page fault takes

Benifits

Responsiveness – allow continued execution if part of process is blocked
Resource Sharing – thread sharing is easier than shared memory or message passing
Economy – cheaper than process creation, thread switching lower overhead than context switching
Scalability – process can take advantage of multiprocessor architectures

Multicore Programming

Issues of Programming

Dividing activities
Balance
Data splitting
Data dependency
Testing and debugging

Parallelism V.S Concurrency

考

Parallelism: perform more than one task simultaneously, at least two threads are executing simultaneously, e.g. multicore processor
Concurrency: support more than one task making progress, two or more tasks can start, run, and complete in overlapping time periods, e.g. multitasking on a single-core processer

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Parallelism VS Concurrency
Omnom

Types
1. Data Parallelism
  - distribute data across multiple cores
    e.g. Multiple same task, different data
    (每個core執行相同的operation，只是每個core用的data不一樣)
2. Task parallelism
  - distribute tasks(threads) across multiple cores
    e.g. Same data with multiple different tasks
    (每個core(thread)執行不同的task，可能會用一樣的data)

看到一個不錯的例子
150份15題的考卷分給三個助教去改，若是data parallelism的作法，則每一個助教分別改50份考卷；若是task parallelism的方法，則每一個助教分別改150份考卷的前、中、後5題。瞭解嗎？這裡可以分析每個助教的負擔，乍看下是一樣的，然而後者方法中每個助教分別對前、中、後5題有高度熟悉感，那麼後者的方法可以有很好的效能，一下子就把考卷給改完！而前者方法中每個助教都批改15題，無法發揮所長，效能因此較不彰。參考
Peter Peter

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Amdahl’s Law

S

: Serial portion,

1 - S

: parallel portion

N

: # of processing core

s p e e d u p \leq \frac{1}{S + \frac{1 - S}{N}}

$N \to \infty, s p e e d u p \to 1 / S$

Multithreading Models

user_thread-to-kernel_thread

Many-to-One(Few system use it)
- Many user-level threads mapped to single kernel thread
- Pro
  1. Thread management is done by the thread library in user space, so it is efficient (?)
  2. allows the developer to create as many user threads as she wishes
- Con
  1. the entire process will block if a thread makes a blocking system call
  2. only one thread can access the kernel at a time → unable be parallel

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

One-to-One
- Creating a user-level thread creates a kernel thread
- Pro
  1. provides more concurrency
  2. allows multiple threads to run in parallel
- Con
  1. large number of kernel threads may burden the performance

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Many-to-Many
- Pro
  1. Has advantages from both Many-to-One & One-to-One
- Con
  1. in practice it is difficult to implement

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

Two-level (Variation of Many-to-Many)

Image Not Showing Possible Reasons

The image file may be corrupted
The server hosting the image is unavailable
The image path is incorrect
The image format is not supported

Learn More →

考

number of kernel threads < number of processors:
- some of the processors would remain idle
number of kernel threads == the number of processors
- all of the processors might be utilized simultaneously
kernel threads > processors
- a blocked kernel thread could be swapped out, in favor of another kernel thread that is ready to execute

Thread Libraries(待補)

Pthread

Specification, not implementation

說"標準"的原因是Pthread是符合POSIX規範的Thread API，POSIX為可移植作業系統介面，就是說只要函式庫符合POSIX規範，都能夠在支援POSIX規範的系統上運作，至於函式庫的內容是如何實作的就不是很重要了，只要行為符合就好，比方說函式庫是使用哪一種multithread model是沒有規定的。
推出這個規範是因為Unix系統分支太氾濫，有很多Unix-like的作業系統但是又不全然可以移植程式，為了解決這個問題就誕生了POSIX
Neko

Implicit Threading

Concept: transfer the creation and management of threading from application developers to compilers and run-time libraries.
Note: require application developers to identify tasks—not threads—that can run in parallel

Thread Pools

Mechanism: Create a number of threads in a pool
Note: create multiple in advance
Pro
1. Servicing a request with an existing thread is often faster
2. thread pool limits the number of threads → limit concurrent threads
3. Separate task performing from task creating → flexibility to run tasks

OpenMP

API for programs written in C, C++, or FORTRAN that provides support for parallel programming in shared-memory environments
Mechanism: Identifies parallel regions – blocks of code that can run in parallel

Grand Central Dispatch

Mechanism:
1. identification of parallel sections (Block)
2. Blocks placed in dispatch queue
3. Assigned to available thread in thread pool when removed from queue
Type of dispatch queues
1. serial:
  - one goes out FIFO
  - called main queue
2. concurrent
  - several go out FIFO

Threading Issues

Signal Handling (待補)

Procedure:
1. A signal is generated by the occurrence of a particular event.
2. The signal is delivered to a process.
3. Once delivered, the signal must be handled.
default signal handler
1. runned by kernel
2. can be overridden by user-defined signal handler

Thread Cancellation (待補)

Note: actual cancellation depends on thread state

Asynchronous cancellation
terminates the target thread immediately
會有後遺症，資料不一致，影響另一個Thread的資訊，

補充說明：[摘自老師]
target thread 可能正在處理還沒處理完的工作，舉例來說：和其他 thread 可能有 shared data。~~一旦任意將 target thread cancel，可能造成資料不一致，「影響另一個Thread資訊的正確性？」~~
突然中止，可能導致工作無法正確交接，造成錯誤
Omnom

Deferred cancellation
allows the target thread to periodically check if it should be cancelled

Thread-Local Storage (TLS)

Uniqueness: each thread have its own copy of data
TLS v.s. local variables
- visibility:
  across function invocations
  v.s.
  during single function invocation

Scheduler Activations

M:M and Two-level models require communication

Each LWP is attached to a kernel thread

考

lightweight process (LWP):
- intermediate data structure between user and kernel threads(對user process來說相當於~~virtural process~~是virtual processor)
- User thread 要mapping到kernel thread才能做scheduling

不確定上面描述的內容想表達的跟我想表達的是不是一樣，總之做個補充。
我們可以想成LWP是一個虛擬的處理器，會執行Process，Process裡面是User thread，一個LWP可以有同一個Process的好幾個thread，這些thread在LWP上面做scheduling的工作(就是決定這些thread要怎麼樣執行)。
正如課堂所說，OS並不知道User thread有多少，這些也不是他要管的。OS要做的事情是kernel thread的scheduling，而每一個LWP都會關聯到一個kernel thread。
每個層級schedule的對象不大一樣
Neko

upcalls: a communication mechanism from the kernel to the upcall handler in the thread library

Peter Peter

2020/04/21 14:08:23

當一個task 需要很多線程同時處理，或需要常常create 新的線程處理事情，此時一直create 或 delete 新線程很花時間，因此一開始便new 很多thread 出來，增加處理效率，如: Android 的藍芽(BLE) 在處理callback 時，會有資料源源不絕的灌入，每筆資料都必須有一個thread 處理，如果只有單個thread 就會造成block 效率低落，因此使用thread pool 管理多個thread ，提升處理效率，不過要特別注意可能會造成處理資料的順序錯誤(畢竟有一堆thread 想要更改同一個變數)。 (Edited)

楊承燁

2020/04/21 15:13:57

資料要跟誰"一致"? (Edited)

2020/04/21 15:14:06

影響另一個Thread的什麼資訊? (Edited)

2020/04/21 16:32:55

我只是很好奇什麼叫一致啦 (Edited)

2020/04/21 16:33:20

是same的意思ㄇ (Edited)

Omnom

2020/04/21 16:33:56

應該是 (Edited)

2020/04/21 16:34:15

這比喻我喜歡 (Edited)

2020/04/21 16:34:50

應該不是 (Edited)

2020/04/21 16:35:01

不是嗎？

2020/04/21 16:35:22

比方說我有一個thread負責監聽web socket，檢查有沒有從server接收到資料 (Edited)

2020/04/21 16:35:28

請問這種情況下何謂same? (Edited)

2020/04/21 16:36:45

又或者我有一個thread負責從相機持續擷取影像，如果我截到一半就被停掉了，那麼造成的問題可能是影像的解碼錯誤或是大小錯誤 (Edited)

2020/04/21 16:37:09

所以我覺得資料不一致這個說法有點模糊 (Edited)

2020/04/21 16:39:00

原來如此 (Edited)

Chp 4 Threads

tags: 作業系統

Overview

Motivation

Benifits

Multicore Programming

Issues of Programming

Parallelism V.S Concurrency

Amdahl’s Law

Multithreading Models

Thread Libraries(待補)

Pthread

Implicit Threading

Thread Pools

OpenMP

Grand Central Dispatch

Threading Issues

Signal Handling (待補)

Thread Cancellation (待補)

Thread-Local Storage (TLS)

Scheduler Activations

Read more

Chp 5 CPU Scheduling

Chp 3 Process

Final chp 9,10

Chp 10 Virtual Memory

tags: `作業系統`