# Master’s Thesis Proposal Edit Proposal on Overleaf: https://www.overleaf.com/2497778233nqxqdzkbmbqb Github repository: https://github.com/walzph/gpu-accelerated-medecom ### What is DNA methylation? - robust mehtods exist for detection and quantification of methylation data - mirrors functional state of a cell - characteristic methylation profile (methylome) - occur at CpG dinucleotides - cell-type specific patterns - influenced by - individual genetic constitution, gender, age, environment - important for comparative studies ### Why MeDeCom ? - can decompose raw methylome data into latend **DNA methylation components (LMCs)** - computational framework using: - regularized non-negative matrix factorization - biologically motivated regularizer - favors LMC with per-CpG values close to one or zero (i.e. methylated or unmethylated) - key element for accurate estimation - still works if only mixtures of different cell types / no purified references available - Matrix D represents CpG sites #### Resources > Website: http://public.genetik.uni-sb.de/medecom/ > Paper: https://genomebiology.biomedcentral.com/articles/10.1186/s13059-017-1182-6 > Current implementation based on R: https://github.com/lutsik/MeDeCom > **Are some parts already implemented in C++ ?** #### Internal Links: [[Matrix Facorization]] [[Singular Value Decomposition]] [[s13059-017-1182-6.pdf]] ## Optimization Approaches ### Matrix Factorization (fine-grained) GPU acceleration of MF: https://dorukkilitcioglu.github.io/2018/12/21/cu2rec.html Cu2Rec: https://github.com/nickgreenquist/cu2rec ### Run orchestration (coarse-grained)