--- tags: Meeting --- # Trivago > https://recsys.trivago.cloud/challenge/dataset/ ## Session ![](https://i.imgur.com/HdkkMyx.png) * 這裡的session都是紀錄有登入的使用者,故單一使用者會有不同天的session紀錄 * Split date由比賽方決定 * X是觸發點擊 * ?是要predict的click item * validation group => public leaderboard * confirmation group => private leaderboard ## Example ``` Query 1: impressions = [100, 101, 102, 103, 104, 105] clicked_item_id = 102 submission = [101, 103, 104, 102, 105, 100] reciprocal rank = 0.25 ``` ``` Query 2: impression = [101, 103, 104, 100, 105] clicked_item_id = 105 submission = [103, 105, 101, 100, 104] reciprocal rank = 0.5 mrr = (0.25 + 0.5) / 2 = 0.375 ``` Evaluation: MRR@25 > https://github.com/keyblade95/recsys2019/blob/master/visualize_data/NN%20impressions_features_input.ipynb ## Features Session actions > https://github.com/keyblade95/recsys2019/blob/master/visualize_data/click_out%20accommodations.ipynb * user_id: identifier of the user * session_id: identifier of each session * timestamp: UNIX timestamp for the time of the interaction * step: step in the sequence of actions within the session ![](https://i.imgur.com/5yId49f.png) ## Observations ### User session clicks on more than one hotel > https://github.com/keyblade95/recsys2019/blob/master/visualize_data/same_clickout_user_more_clickouts.ipynb * 約44趴的人 ![](https://i.imgur.com/aoriVyt.png) ### Sessions amount of per user > https://github.com/keyblade95/recsys2019/blob/master/visualize_data/sessions_per_user.ipynb * 有163609的user有超過一個session, 總過有948041個user, 表示有17趴的user有超過1個session * 一人平均1.2個 ![](https://i.imgur.com/pWrzrLP.png)