---
# System prepended metadata

title: Trivago
tags: [Meeting]

---

---
tags: Meeting
---

# Trivago
> https://recsys.trivago.cloud/challenge/dataset/


## Session
![](https://i.imgur.com/HdkkMyx.png)

* 這裡的session都是紀錄有登入的使用者，故單一使用者會有不同天的session紀錄
* Split date由比賽方決定
* X是觸發點擊
* ?是要predict的click item
* validation group => public leaderboard
* confirmation group => private leaderboard

## Example
```
Query 1:
impressions = [100, 101, 102, 103, 104, 105]
clicked_item_id = 102
submission = [101, 103, 104, 102, 105, 100]
reciprocal rank = 0.25
```

```
Query 2:
impression = [101, 103, 104, 100, 105]
clicked_item_id = 105
submission = [103, 105, 101, 100, 104]
reciprocal rank = 0.5
mrr = (0.25 + 0.5) / 2 = 0.375
```

Evaluation: MRR@25
> https://github.com/keyblade95/recsys2019/blob/master/visualize_data/NN%20impressions_features_input.ipynb

## Features
Session actions
> https://github.com/keyblade95/recsys2019/blob/master/visualize_data/click_out%20accommodations.ipynb

* user_id: identifier of the user
* session_id: identifier of each session
* timestamp: UNIX timestamp for the time of the interaction
* step: step in the sequence of actions within the session


![](https://i.imgur.com/5yId49f.png)


## Observations
### User session clicks on more than one hotel
> https://github.com/keyblade95/recsys2019/blob/master/visualize_data/same_clickout_user_more_clickouts.ipynb

* 約44趴的人

![](https://i.imgur.com/aoriVyt.png)


### Sessions amount of per user
> https://github.com/keyblade95/recsys2019/blob/master/visualize_data/sessions_per_user.ipynb

* 有163609的user有超過一個session, 總過有948041個user, 表示有17趴的user有超過1個session
* 一人平均1.2個

![](https://i.imgur.com/pWrzrLP.png)


