了解資料
read data
path = 'data/'
df = pd.read_csv(path + 'train.csv')
features (training set only)
profile pic:⽤戶有沒有頭貼# check feature 'profile pic'
print(set(df['profile pic'])) # value: 0/1
print(df.loc[(df['profile pic'] == 0) & (df['fake'] == 0)]) # count: 2
print(df.loc[(df['profile pic'] == 0) & (df['fake'] == 1)]) # count: 170
print(df.loc[(df['profile pic'] == 1) & (df['fake'] == 0)]) # count: 286