Convlusion Cl Codes vstore2 and vload2 function __kernel void default_function_kernel0(__global float* restrict A, __global float* restrict W, __global float* restrict B) { float B_local[64]; __local float Apad_shared[8]; __local float W_shared[128]; float Apad_shared_local[8]; float W_shared_local[8]; for (int ff_c_init = 0; ff_c_init < 4; ++ff_c_init) {
12/14/2021Simulation Platform: https://drive.google.com/file/d/16xIMCTcmbXTVWEEe6Ru0KnbB37kOYD_j/view?usp=sharing Architecture Overview Manual Password Ubuntu password: caslabgpu
4/9/2021REQUIREMENTS The processor core operating speed is targeting at least 75 MHz for post-synthesized netlist. Its instruction set shall have at least 45 instructions, including branch, I/O instructions. IM + DM cannot be larger than 320KB The silicon area of the CPU+ICache+DCache+IM+DM shall be confined within 110 mm2 in total EPU shall be synthesized and constrained within 3 mm2 The kernel of the chip shall be less than 120 mm2 The read/write access time of an off-chip not-synthesized memory, usually DRAM, is 60ns The main memory only has one read/write port with bit width of 32
12/18/2020標題: 前瞻 GPGPU XXXXXXX (待定) 目標: 完成 GPGPU IP Design 內容 目標介紹時可以先簡介目前市面上現有技術 (Ex: NVIDIA, AMD),再來介紹我們要達成的目標以及實際上要如何達成,不用介紹到很 detail ,但大方向和如何實作要解釋的合理且看得懂,多附圖說明 orz 現有技術介紹很多文件可以參考,整理排版一下複製貼上就好 能加的幹話可以加多一點 目標介紹篇幅要長一點,一個項目約 3~5 頁 word 請大家在 ==12/7== 前給我初版
12/16/2020or
By clicking below, you agree to our terms of service.
New to HackMD? Sign up