meeting 22/12/2021

# meeting 22/12/2021 Overview: Multi arm bandit prblem Group items: similar to multi arm bandit problem boltzmann exploration -> exploration strategy generalize context example: type of user * given info arm does not work * though need prior info about user even if user is part of cluster of users -> need to explore but context helps explore to generalize to other users is it a business trip -> yes/no filtering flag: tells you the context amount of data is not the problem variable says something about: * how much someone is spending * could be per every euro in a bucket * might just be a few * could be less than and more than 100 euro * recommendations will be less targeted if its RS it's fine Try recsim * also dynamic, sequential sequential aspect of the recommendation recommend, watch, upate parameters, recommend as you interact over time, you get a better idea we could find coefficients recommend one time Direction: get something, start playing aorund with it and get an idea PhD start: will be easier to discuss with them how to transfer things between contexts all users are different, but we want to help the exploration from other user profiles collaborative filtering as basic techniques, look into othe rtechniques how to deal with big action spaces, you can show lots to people * score every movie, show top 10 * narrow down on what profiles they are balance certainty with potential fail for IG someone wants to spend lot of money, but we miss out on it by showing inexpensive hotels upper confidence bound takes max from this optimum ... certainty * if we don't konw what's going on, we explore * if we can select k of them * some function over movie space * if we keep sticking to one genre of movies, the user won't see other movies ever not our job to find the categories we want some simple categories **transfer across contexts** different ranking algorithms interleave, and compare bassed on what user chose **focus on sequential** context of the user, update the coefficients of the context based on the user if you explore too much don't build everything perfectly, that makes it hard to get ## Tasks > Get something, start playing around with it and get an idea 1. RecSim 2. LensKit Start reading documentation and apply them? or freeflow focus on sequential transfer across contexts