## Paper Writing ## Introduction ## Background 1. Graph Computing (Where paralleism is coming in graph from) CSR potential problem -- Discuss fundamentals of graph and its application algorithm like page rank, single source shortest path and etc briefly --Highlight non-amortization of memory access time in graph computation 3. GPU Architecture (Describe stream multiprocessor in detail) 4. GPU Execution Model 5. Microarchitectural Limitation of GPU for Graph Processing Load --Highlighting memory coalescing, uncoalescing and load imbalance. --Enlist inefficiencies of vertex-centric graph parallel computing in GPU --Underutilization of SIMD (Load Imbalance and branch divergence issues) --Discuss "Scalarized vertex-centric parallel graph computing" method from "Graph waving architecture paper" in detail 7. Enlist few bottlenecks in performance imporvement -- Amdhal's law (Law of diminishing return): Illustrate it in current context of graph computation on GPU. And explain upper performance improvement limit from proposed method ## Related Work (Literature Survey) 1. Discuss utilization of SIMD unit reported by other authors for Graph computation ## Charaacterization section ## Proposed Architecture ## Table , PTS assembly code, DID library