開發紀錄 (1): 環境架設

# 開發紀錄 (1): 環境架設 contributed by < [`charliechiou`](https://github.com/charliechiou) > < [`EricccTaiwan`](https://github.com/EricccTaiwan) > ## 關閉 E-core 電腦的 cpu 版本如果有 12-th 後[大小核](https://www.intel.com.tw/content/www/tw/zh/support/articles/000091896/processors.html)，會影響到實驗結果，需要把小核關掉 - 查看大小核 (P/E core) ```shell $ lstopo ``` ![image](https://hackmd.io/_uploads/S14Vs6oyll.png) [二：Intel系列主機板如何關閉CPU部分核心（即E-core）？](https://www.asus.com/hk/support/faq/1054283/#222) ![image](https://hackmd.io/_uploads/HkxB7BA1ex.png=30%x) :::spoiler ```$ $ uname -r 6.14.0-16-generic $ lsb_release -a No LSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 25.04 Release: 25.04 Codename: plucky ``` ```shell $ neofetch $ lstopo ``` **關閉 E-core 前** - CPU: 12th Gen Intel i9-12900K (24) @ 5.100GHz ![image](https://hackmd.io/_uploads/SyIUqkh1ge.png=30%x) ![image](https://hackmd.io/_uploads/r1aWUAoygg.png=10%x) **關閉 E-core 後** - CPU: 12th Gen Intel i9-12900K (16) @ 5.100GHz ![image](https://hackmd.io/_uploads/r1WKMSAkxg.png=30%x) ![image](https://hackmd.io/_uploads/B1pkQSC1ee.png=30%x) ::: ## 版本及 sched_ext 支援 ### 核心版本要求 [sched_ext is supported by the upstream kernel starting from version 6.12](https://arighi.blogspot.com/search?updated-max=2025-05-01T00%3A17%3A00%2B02%3A00&max-results=1)，因此核心版本要求 6.12+，至於 6.13、6.14 的核心版本在 scx 上的表現是否有差異? [Tejun Heo](https://github.com/htejun) (scx 的 maintainer) 給了以下回覆， > There are new features introduced which may improve performance in some cases (e.g. queued_wakeup support) but for the most part, the kernel versions wouldn't cause noticeable differences. All schedulers should work fine across the kernel versions. [name=Tejun Heo] ### 開發環境的核心及 Ubuntu 版本 > 我們將 server 從 Ubuntu 24.04 $\to$ 24.10 $\to$ 25.04 一路向上升級; 如果空間有限，也可用舊版 Ubuntu 搭配 kernel v6.12+ 作為實驗環境 (e.g. Ubuntu 24.04 w/ kernel v6.12) ```shell OS: Ubuntu 25.04 x86_64 Kernel: 6.14.0-16-generic ``` #### 方法一 : 升級至 Ubuntu 25.04 ```shell $ sudo apt update $ sudo apt upgrade $ sudo apt dist-upgrade $ sudo apt install update-manager-core ``` 目前最新的 LTS 為 Ubuntu 24.04，因此需調整升級設定以對應此版本： ```shell $ sudo vim /etc/update-manager/release-upgrades ``` 把最後一列的 `Prompt=lts` 改成 `Prompt=normal`， ```diff [DEFAULT] # Default prompting and upgrade behavior, valid options: # # never - Never check for, or allow upgrading to, a new release. # normal - Check to see if a new release is available. If more than one new # release is found, the release upgrader will attempt to upgrade to # the supported release that immediately succeeds the # currently-running release. # lts - Check to see if a new LTS release is available. The upgrader # will attempt to upgrade to the first LTS release available after # the currently-running one. Note that if this option is used and # the currently-running release is not itself an LTS release the # upgrader will assume prompt was meant to be normal. - Prompt=lts + Prompt=normal ``` 接著執行 ```shell $ sudo do-release-upgrade -d # 開始升級 Ubuntu ``` 升級後檢查是否升級成功 ```shell $ lsb_release -a ``` 由於每次升級僅能跨一個版本，因此若從 24.04 升級至 25.04，需先升級至 24.10，再進一步升級至 25.04，共需進行兩次升級。 #### 方法二 : 僅下載 kernel v6.12 先下載對應的核心檔案 ```shell $ wget https://cdn.kernel.org/pub/linux/kernel/v6.x/linux-6.12.tar.xz $ tar -xf linux-6.12.tar.xz $ cd linux-6.12 ``` 接著把原本作業系統中的設定檔複製並編譯 ```shell $ cp /boot/config-$(uname -r) .config $ make menuconfig ``` 編譯核心(可能會花上很多時間) ```shell $ make -j$(nproc) $ sudo make modules_install $ sudo make install ``` 最後更新 grub 並重開機。 ```shell $ sudo update-grub $ sudo reboot ``` 確認升級後的核心版本 ```shell $ uname -r # 要出現 6.12 ``` ### 確認支援 sched_ext 確認目前核心是否支援 sched_ext ```shell $ ls /sys/kernel/ | grep sched_ext ``` 確認目前 sched_ext 的狀態 ```shell $ cat /sys/kernel/sched_ext/state ``` 以 `scx_rustland` 為例（後續會提到如何編譯 `scx_rustland` 並執行）執行 `scx_rustland` 前後輸出結果如下： ```shell $ # Before attached scx_rustland $ cat /sys/kernel/sched_ext/state disabled $ # After attached scx_rustland $ cat /sys/kernel/sched_ext/state enabled ``` ### 下載 sched-ext (簡稱 scx) > [sched-ext/scx: README](https://github.com/sched-ext/scx?tab=readme-ov-file#sched_ext-schedulers-and-tools) 可以參考 [Build & Install](https://github.com/sched-ext/scx?tab=readme-ov-file#build--install)，下方以 Ubuntu/Debian 環境為例 #### 1. 下載 `meson` > Note: Many distros only have earlier versions of meson, in that case just clone the meson repo and call `meson.py` > e.g. `/path/to/meson/repo/meson.py compile -C build` . > > Alternatively, use pip e.g. `pip install meson` or `pip install meson --break-system-packages` (if needed). #### 2. 下載 dependencies ```shell $ sudo apt install build-essential libssl-dev llvm lld libelf-dev meson cargo rustc clang llvm cmake pkg-config protobuf-compiler ``` #### 3. Static linking against libbpf (preferred) > 這個方式，C 和 Rust 都可以編譯完成 ```shell $ cd ~/scx $ meson setup build --prefix ~ $ # 以下每次更新 code 完都需要重跑 (編譯前記得存檔) $ meson compile -C build $ sudo meson install -C build $ # $ meson setup --wipe build ## re-config 才需要或是 build 失敗 ``` > meson always uses a separate build directory. Running the following commands in the root of the tree builds and installs all schedulers under `~/bin`. 執行 meson compile 後會把執行檔及中間的建構檔存在對應的資料夾中，若要把可執行的檔案安裝下來則需要使用 meson install 並透過 `-C` 切換到 build 的資料夾中做安裝，而安裝的位置則是前述用 setup 設定的位置 (i.e., `~`)。接著執行 `sudo ~/bin/<schedule name>` 便可執行對應的 scheduler。 >詳細的安裝步驟可以查看 `meson.build` 的腳本，其中在 `if enable_rust` 的段落中 >```rust > cargo_cmd = [cargo, 'build', '--manifest-path=@INPUT@', '--target->dir=@OUTDIR@', cargo_build_args] >``` >會把 rust 的排程器建構在 `@OUTDIR@` ，而 `@OUTDIR@` 則是由 meson 傳入的 target。因此也可以在 target 資料夾中的 `release-fast` 中找到我們的 rust 排程器。另一方面 c 語言所建構的排程器則會安置在 build 資料夾底下。 ## 工具介紹 ### scxtop ```shell $ sudo LC_ALL=C scxtop # 執行 scxtop ``` 會出現下方的 TUI 界面，按下 `q` 即可離開 ![image](https://hackmd.io/_uploads/SJ-5FJyelx.png) 執行 scx 排程器的過程中，按下 `a` 可以儲存 trace 檔案，接著打開 [Perfetto UI](https://ui.perfetto.dev/#!/viewer) ，點選左側的 `Open trace file` 開啟剛剛生成的檔案，便可看到下圖的 CPU 排程結果。 ![image](https://hackmd.io/_uploads/SkAXuJtxge.png) 若不想使用到 `scxtop` 提供的 TUI 介面，也可以利用以下指令生成檔名為 test 的二進制檔 ```shell $ sudo scxtop trace --trace-ms 5000 --output-file test ``` ### Perfetto ![image](https://hackmd.io/_uploads/B1_uY1Felg.png) 詳細的測試分析，可見[開發紀錄 (2)： FIFO/RR scheduler](https://hackmd.io/@cce-underdogs/linux-exp2)，這邊只介紹幾個重要的參數，同時也是我們踩過的地雷， - [Wall duration](https://blog.csdn.net/f2006116/article/details/107581327) (ms) - 代表一個 Process 的總共持續時間 - Wall Avg duration (ms) - :::warning [代表一個 Process 在被中斷前的平均持續時間，而不是平均的 time slice。](https://hackmd.io/@cce-underdogs/prob-1#Clarification) (這點很重要!!!) ::: - Occurences - Process 執行期間的中斷次數  ## Ref * [Linux 核心專題: CPU 排程器 by vax-r](https://hackmd.io/@sysprog/BJdyxvxX0) * [Linux 核心專題: sched_ext 研究 by otteryc](https://hackmd.io/@sysprog/H1u6D9LI0) * [arighi's blog - Ubuntu 25.04 is now sched_ext ready](https://arighi.blogspot.com/) * [sched-ext/scx](https://github.com/sched-ext/scx) * [eBPF 隨筆（七）：sched_ext](https://medium.com/@ianchen0119/ebpf-%E9%9A%A8%E7%AD%86-%E4%B8%83-sched-ext-f7b60ea28976) * [Perfetto Docs](https://perfetto.dev/docs/) * [課程討論區相關文章](https://www.facebook.com/share/p/19MQee4kSX/)