# First meeting M
## Three topics
1- explainable recommenders
2- current top k recommenders
3- off-line policy evaulation techniques for top k recommenders
* What they are doing in mercury lab?
* Is there something that we can do to assist them there?
been a while since supervised. -> send him the link
RLRS
introductions, then tell him what we want, find PhD students
teaches Automata Computability and Complexity
AD: Smart algos for smart decision
RL, we need the specific algorithms
Making general algorithms
safe RL interactions with environment, avoid things goinog wrong
* more careful in exploration
* or train based on offline data
* real problems, getting data is hard or the amount of effort would be hard
* decision support system
* mercury lab: hiring PhD students, problems that booking.com is interested in.
* booking has a lot of data
* non-stationary
* Diverse but focused: Causality
* not classifying users (not our job), but using their data
Propensity score matching:
Student few years ago:
* you need the old policy, no exploration
* off-policy for exploration
* top-k stuff
* PhD students are still being hired, so we don't know who does anything, so let's wait for now
* This algorithm looks like one, but there's something missing
* Evaluation: problem: always interfering with the system, so you need a system that is decent enough (?)
* RS simulator
* problem we wanna solve, solution ?, steps to take, and test it using a simulator <- best way to go forward, meet a few weeks from now
https://github.com/google-research/recsim is it any good? does rs, and built by people who care about RL
* allows testing
**Not** building simulators on our own
Questions for next meeting
* Fail if didn't do the right methodology, not if nothing comes out of the research
* Milestones are nice, shows progression:
* 1.
* 2.
Good to have a plan.
* Applications in deep learning in mercury lab
* could look at modeling or algorithm part
* lot of magic there still
* maybe as an add-on: compare our project to DRL