#### What is the problem/reward signal/cost function ![](https://i.imgur.com/IiQ8LLU.png) --- #### What is the problem/reward signal/cost function ![](https://i.imgur.com/l1eG2jw.png =700x) --- #### What is the problem/reward signal/cost function ![](https://i.imgur.com/mAvmstY.png) ![](https://i.imgur.com/WtawZwl.png) --- #### What is the problem/reward signal/cost function with optimal control...differentiate trough the environment==policy grandient. ![](https://i.imgur.com/x9UodN9.png) some ref: https://michaelrzhang.github.io/model-based-rl <!-- .element: style="font-size: 10px;" --> --- ![](https://i.imgur.com/820uGJK.png) ![](https://i.imgur.com/bqj824y.png =550x) --- #### The optimal transport reprogramming data current results are ongoing... ![](https://i.imgur.com/jlcXcTd.png) [Schiebinger, Geoffrey, et al. "Optimal-transport analysis of single-cell gene expression identifies developmental trajectories in reprogramming." Cell 176.4 (2019): 928-943.](https://reader.elsevier.com/reader/sd/pii/S009286741930039X?token=CADB22D39F63F7E3D3F505F1D76D096E5DEFD0E16BDA93D3EB225C5FA46BD541A24CD091BDEB877D92918250A8861027&originRegion=us-east-1&originCreation=20220420200023) <!-- .element: style="font-size: 10px;" --> --- ![](https://i.imgur.com/F3GWT8Y.png =200x) #### What we needed to build to get to here... - build a simulator (fortunatly we sit on the shoulders of giants) - to be differentiable (for policy fradient) - use an oracle for the reward (a neural network classfier) - to be fast during simulation (very important!) - gym wrapping (to use already built RL policies) - bioinformatics (eh) <!-- .element: style="font-size: 23px;" --> Note: all of those makes a huge forest. none of the path is well known. time for sitting down and meditate. --- #### The Simulator ![](https://i.imgur.com/2v9JwpY.png =500x) Note: we have a very good idea of how the system behaves in response to actions. do we? --- #### The input needed to do control ![](https://i.imgur.com/wZ838Nc.png) --- The underlying dynamics: ![](https://i.imgur.com/6Og4B7r.png) - A fixed GRN how genes interact with each other - Production rate $P_i$ is a non linear function of the previous state. - Decay rate - Noise (stochastic Wiener processes) --- ![](https://i.imgur.com/WpOh6jC.png =300x) - We control the master regulator at t=0, our decision variable is the initial concetration $x_0$ = a, (actually a subset) - Loss function is on the last timestep $\ell(x_T, y)$ - Loss function is the loss of a classifier, the target state is not known and not even unique. <!-- .element: style="font-size: 23px;" --> --- ## Optimal Control approach - Single shooting - Jax Simulator --- ## Reinforcement Learning - Requires no gradient throught the environment. - $\gamma=0$, bandit problem - Will be extended to control intermediate steps --- #### Examples of control in our daily life: - the morning coffe and sometimes the evening coffe - sigaretes - alcohol - pills - ...for more [wiki Bioethics](https://en.wikipedia.org/wiki/Bioethics#:~:text=Bioethics%20is%20the%20study%20of,as%20environment%20and%20well%2Dbeing.) --- #### Examples of control in science **Yasmeen Hitti** uses **lights** to control the plant grow shape and directionality. **Micheal Levin** uses **bioelectricity** to change body parts of symbiotic bacteria. --- #### Some biologycal facts... ![](https://i.imgur.com/HbrMP3x.png) - all life originated from a single cell ancestor, A, T, G, C is the same in all species. - upregulation and downregualtion is a continous and non linear process. - because this microworld is so complex and difficult to make it hardware base some chose to work at the macrolevel but essentially: - **all life is encoded in the genome along with the epigenome** (sperm cell meets the egg cell) <!-- .element: style="font-size: 23px;" --> --- ### Contributions and Acknowledgements: \ **Pierre-Luc Bacon**: Supervision. **Payel Das**: Supervision and funding. **Manuel Del Verme**: For rewriting with me so many times the simulator, gym wrapping and daily support. **Simon Dufort-Labbé**: Adding JAX, fast simulator. **Youssef Mroueh**: Suggesting the optimal transport data. **David Kanaa**: Brainstorming with me about the math behind the simulator. **Ryan Jhon Cubero**: gave the right reccomendations for papers at the beginnning one year ago. **Rainer Kelz**: supervising my thesis and allowing me to have this project as my master thesis. <!-- .element: style="font-size: 23px; text-align:left;" --> \ I sincerely thank every single person who helped this project become so possible and I appreciate any small contribution which was made and it is yet to come. <!-- .element: style="font-size: 23px;" --> --- Thank you all for your attention. Time for discussion.
{"metaMigratedAt":"2023-06-16T23:31:00.625Z","metaMigratedFrom":"YAML","title":"Cell State Control","breaks":true,"slideOptions":"{\"spotlight\":{\"enabled\":false}}","contributors":"[{\"id\":\"f76b4bc5-114c-445f-a24d-bc3cfc8414e9\",\"add\":9014,\"del\":4975},{\"id\":\"c24dc5d4-41ff-41ab-b5ac-7a3f3df74495\",\"add\":894,\"del\":1}]"}
    233 views