All Features (29/11/2021) Iterative

# All Features (29/11/2021) Iterative ## Iteration 1 - Original keypoints ### Overall score Training accuracy 0.8929524765107294 +/- 0.013995257489632991 Cross-validation accuracy: 0.6831016694808427 +/- 0.026483745356732525 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('kbest', SelectKBest(k=360))])), ('classifier', RidgeClassifier(alpha=0.02030917620904739, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8742382271468144 +/- 0.8742382271468144 Fold 0: Cross-validation accuracy: 0.7098445595854922 +/- 0.7098445595854922 Fold 1: Training accuracy 0.909778812572759 +/- 0.909778812572759 Fold 1: Cross-validation accuracy: 0.6680761099365751 +/- 0.6680761099365751 Fold 2: Training accuracy 0.8868335146898803 +/- 0.8868335146898803 Fold 2: Cross-validation accuracy: 0.7195467422096318 +/- 0.7195467422096318 Fold 3: Training accuracy 0.9087136929460581 +/- 0.9087136929460581 Fold 3: Cross-validation accuracy: 0.6527777777777778 +/- 0.6527777777777778 Fold 4: Training accuracy 0.8851981351981352 +/- 0.8851981351981352 Fold 4: Cross-validation accuracy: 0.6652631578947369 +/- 0.6652631578947369 ### Fold distributions Training folds are well distributed, test folds not so much. Remark: folds with more easy to recognize signs perform better. ## Iteration 2 - Original keypoints without face features ### Overall score Training accuracy 0.8366827771203214 +/- 0.006371249668605516 Cross-validation accuracy: 0.65505040849567 +/- 0.03401498477793019 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('kbest', SelectKBest(k=360))])), ('classifier', RidgeClassifier(alpha=0.8376776400682924, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8271468144044322 +/- 0.8271468144044322 Fold 0: Cross-validation accuracy: 0.6632124352331606 +/- 0.6632124352331606 Fold 1: Training accuracy 0.8416763678696159 +/- 0.8416763678696159 Fold 1: Cross-validation accuracy: 0.6448202959830867 +/- 0.6448202959830867 Fold 2: Training accuracy 0.8454842219804135 +/- 0.8454842219804135 Fold 2: Cross-validation accuracy: 0.71671388101983 +/- 0.71671388101983 Fold 3: Training accuracy 0.8340248962655602 +/- 0.8340248962655602 Fold 3: Cross-validation accuracy: 0.621031746031746 +/- 0.621031746031746 Fold 4: Training accuracy 0.835081585081585 +/- 0.835081585081585 Fold 4: Cross-validation accuracy: 0.6294736842105263 +/- 0.6294736842105263 ### Fold distributions Worse score than using all features but scores of folds are a bit closer to each-other. One fold outperforms them all (2) but we notice that that one contains a lot of HAAS-oos (a class that is easily classified) ## Iteration 3 - Original keypoints without face features and fix-undetected and sync ### Overall score Training accuracy 0.8430460818940008 +/- 0.008887841207751565 Cross-validation accuracy: 0.6643151309647638 +/- 0.011563874678297214 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('kbest', SelectKBest(k=360))])), ('classifier', RidgeClassifier(alpha=0.8376776400682924, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8343490304709141 +/- 0.8343490304709141 Fold 0: Cross-validation accuracy: 0.6839378238341969 +/- 0.6839378238341969 Fold 1: Training accuracy 0.8591385331781141 +/- 0.8591385331781141 Fold 1: Cross-validation accuracy: 0.6511627906976745 +/- 0.6511627906976745 Fold 2: Training accuracy 0.8378672470076169 +/- 0.8378672470076169 Fold 2: Cross-validation accuracy: 0.6685552407932012 +/- 0.6685552407932012 Fold 3: Training accuracy 0.8458802608180201 +/- 0.8458802608180201 Fold 3: Cross-validation accuracy: 0.6547619047619048 +/- 0.6547619047619048 Fold 4: Training accuracy 0.837995337995338 +/- 0.837995337995338 Fold 4: Cross-validation accuracy: 0.6631578947368421 +/- 0.6631578947368421 ### Fold distributions Folds are closer to each other! We also have a better score although the distribution of the folds remains the same. Why fold 2 worse? Maybe it learned that untracked hands belong to HAAS-oor. When untracking was fixed, haas-oor could be classified less accurate but lies closer to the other folds. ## Iteration 4 - Orig. kp no face, fix-undetect, sync and center ### Overall score Training accuracy 0.8360936126616796 +/- 0.01372835908822496 Cross-validation accuracy: 0.6668011186354829 +/- 0.02922201137874495 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('kbest', SelectKBest(k=300))])), ('classifier', RidgeClassifier(alpha=0.49238826317067413, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.821606648199446 +/- 0.821606648199446 Fold 0: Cross-validation accuracy: 0.6994818652849741 +/- 0.6994818652849741 Fold 1: Training accuracy 0.8608847497089639 +/- 0.8608847497089639 Fold 1: Cross-validation accuracy: 0.6257928118393234 +/- 0.6257928118393234 Fold 2: Training accuracy 0.8253536452665942 +/- 0.8253536452665942 Fold 2: Cross-validation accuracy: 0.6997167138810199 +/- 0.6997167138810199 Fold 3: Training accuracy 0.8352104327208062 +/- 0.8352104327208062 Fold 3: Cross-validation accuracy: 0.6626984126984127 +/- 0.6626984126984127 Fold 4: Training accuracy 0.8374125874125874 +/- 0.8374125874125874 Fold 4: Cross-validation accuracy: 0.6463157894736842 +/- 0.6463157894736842 ### Fold distributions Score is slightly better and we only use 300 features instead of 360 ## Iteration 5 - Orig. kp no face, fix-undetect, sync, center and unify shoulder ### Overall score Training accuracy 0.8531281167534821 +/- 0.014770899640708575 Cross-validation accuracy: 0.663717254019289 +/- 0.021415229930152112 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('kbest', SelectKBest(k=380))])), ('classifier', RidgeClassifier(alpha=0.8376776400682924, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8354570637119113 +/- 0.8354570637119113 Fold 0: Cross-validation accuracy: 0.694300518134715 +/- 0.694300518134715 Fold 1: Training accuracy 0.8783469150174622 +/- 0.8783469150174622 Fold 1: Cross-validation accuracy: 0.6342494714587738 +/- 0.6342494714587738 Fold 2: Training accuracy 0.8422198041349293 +/- 0.8422198041349293 Fold 2: Cross-validation accuracy: 0.6770538243626062 +/- 0.6770538243626062 Fold 3: Training accuracy 0.8518079430942501 +/- 0.8518079430942501 Fold 3: Cross-validation accuracy: 0.6666666666666666 +/- 0.6666666666666666 Fold 4: Training accuracy 0.8578088578088578 +/- 0.8578088578088578 Fold 4: Cross-validation accuracy: 0.6463157894736842 +/- 0.6463157894736842 ### Fold distributions We have a worse (-0.003) score when adding unified shoulder (why?) ## Iteration 6 - Orig. kp no face, fix-undetect, sync, center, unify shoulder and LD mothafucking A ### Overall score Training accuracy 0.8876427949409464 +/- 0.012702597974971514 Cross-validation accuracy: 0.671975836843225 +/- 0.03355775359978403 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select', ColumnTransformer(transformers=[('old_feats_body', LDA(42) ('classifier', RidgeClassifier(alpha=100.0, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8692520775623269 +/- 0.8692520775623269 Fold 0: Cross-validation accuracy: 0.7253886010362695 +/- 0.7253886010362695 Fold 1: Training accuracy 0.9068684516880093 +/- 0.9068684516880093 Fold 1: Cross-validation accuracy: 0.6257928118393234 +/- 0.6257928118393234 Fold 2: Training accuracy 0.8813928182807399 +/- 0.8813928182807399 Fold 2: Cross-validation accuracy: 0.6855524079320113 +/- 0.6855524079320113 Fold 3: Training accuracy 0.8855957320687611 +/- 0.8855957320687611 Fold 3: Cross-validation accuracy: 0.6726190476190477 +/- 0.6726190476190477 Fold 4: Training accuracy 0.8951048951048951 +/- 0.8951048951048951 Fold 4: Cross-validation accuracy: 0.6505263157894737 +/- 0.6505263157894737 ### Fold distributions We have a pretty good score for only 42 features! BUT... test and cross-validation gap still significant(67 vs 63) and training and cross-validation gap still stagnating (might be explained by the few features) ## Iteration 7 - Orig. kp no face, fix-undetect, sync, center, unify shoulder, LDA and finger lenghts ### Overall score Training accuracy 0.8766903000353615 +/- 0.012473732068323124 Cross-validation accuracy: 0.6854659564375706 +/- 0.026140365610837244 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select', LDA and new features)])), ('classifier', RidgeClassifier(alpha=2424.462017082326, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8581717451523546 +/- 0.8581717451523546 Fold 0: Cross-validation accuracy: 0.7331606217616581 +/- 0.7331606217616581 Fold 1: Training accuracy 0.8963911525029103 +/- 0.8963911525029103 Fold 1: Cross-validation accuracy: 0.6659619450317125 +/- 0.6659619450317125 Fold 2: Training accuracy 0.8710554951033732 +/- 0.8710554951033732 Fold 2: Cross-validation accuracy: 0.6940509915014165 +/- 0.6940509915014165 Fold 3: Training accuracy 0.8772969768820391 +/- 0.8772969768820391 Fold 3: Cross-validation accuracy: 0.6646825396825397 +/- 0.6646825396825397 Fold 4: Training accuracy 0.8805361305361306 +/- 0.8805361305361306 Fold 4: Cross-validation accuracy: 0.6694736842105263 +/- 0.6694736842105263 ### Fold distributions Adding new features yielded an increase of about 1%. Fold validation scores lie a bit closer to each-other but the most extreme fold (1) has increased even more. ## Iteration 8 - Orig. kp no face, fix-undetect, sync, center, unify shoulder, LDA and finger lenghts (longer interpol) ### Overall score Training accuracy 0.8767418496522925 +/- 0.014295838228407583 Cross-validation accuracy: 0.6848646725148474 +/- 0.028474976829306636 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select',LDA + kbest = 30)])) ('classifier', RidgeClassifier(alpha=2424.462017082326, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8548476454293629 +/- 0.8548476454293629 Fold 0: Cross-validation accuracy: 0.7357512953367875 +/- 0.7357512953367875 Fold 1: Training accuracy 0.8981373690337602 +/- 0.8981373690337602 Fold 1: Cross-validation accuracy: 0.6638477801268499 +/- 0.6638477801268499 Fold 2: Training accuracy 0.8699673558215452 +/- 0.8699673558215452 Fold 2: Cross-validation accuracy: 0.6968838526912181 +/- 0.6968838526912181 Fold 3: Training accuracy 0.8778897451096621 +/- 0.8778897451096621 Fold 3: Cross-validation accuracy: 0.6646825396825397 +/- 0.6646825396825397 Fold 4: Training accuracy 0.8828671328671329 +/- 0.8828671328671329 Fold 4: Cross-validation accuracy: 0.6631578947368421 +/- 0.6631578947368421 ### Fold distributions We dont notice any great improvement. This means that adding more interpolation frames doesnt really improve overall score. ## Iteration 9 - Orig. kp no face, fix-undetect, sync, center, unify shoulder, LDA and body angles ### Overall score Training accuracy 0.8745785410139281 +/- 0.01572602870822609 Cross-validation accuracy: 0.6852942924720761 +/- 0.03307577664995871 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select',LDA + all body angles)])) ('classifier', RidgeClassifier(alpha=2424.462017082326, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8509695290858725 +/- 0.8509695290858725 Fold 0: Cross-validation accuracy: 0.7435233160621761 +/- 0.7435233160621761 Fold 1: Training accuracy 0.8993015133876601 +/- 0.8993015133876601 Fold 1: Cross-validation accuracy: 0.6448202959830867 +/- 0.6448202959830867 Fold 2: Training accuracy 0.8683351468988031 +/- 0.8683351468988031 Fold 2: Cross-validation accuracy: 0.6940509915014165 +/- 0.6940509915014165 Fold 3: Training accuracy 0.8743331357439241 +/- 0.8743331357439241 Fold 3: Cross-validation accuracy: 0.6746031746031746 +/- 0.6746031746031746 Fold 4: Training accuracy 0.87995337995338 +/- 0.87995337995338 Fold 4: Cross-validation accuracy: 0.6694736842105263 +/- 0.6694736842105263 ### Fold distributions We see an improvement compared to iteration 6 which might mean that the body angles are useful features to add. ## Iteration 10 - Orig. kp no face, fix-undetect, sync, center, unify shoulder, LDA and body angles (more interpol 4) ### Overall score Training accuracy 0.8745752189269901 +/- 0.01514973094040969 Cross-validation accuracy: 0.6867412423567442 +/- 0.029966149309334698 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select': LDA + kbest=32)])) ('classifier', RidgeClassifier(alpha=2424.462017082326, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8509695290858725 +/- 0.8509695290858725 Fold 0: Cross-validation accuracy: 0.7409326424870466 +/- 0.7409326424870466 Fold 1: Training accuracy 0.8975552968568102 +/- 0.8975552968568102 Fold 1: Cross-validation accuracy: 0.6511627906976745 +/- 0.6511627906976745 Fold 2: Training accuracy 0.8688792165397171 +/- 0.8688792165397171 Fold 2: Cross-validation accuracy: 0.6912181303116147 +/- 0.6912181303116147 Fold 3: Training accuracy 0.8755186721991701 +/- 0.8755186721991701 Fold 3: Cross-validation accuracy: 0.6746031746031746 +/- 0.6746031746031746 Fold 4: Training accuracy 0.87995337995338 +/- 0.87995337995338 Fold 4: Cross-validation accuracy: 0.6757894736842105 +/- 0.6757894736842105 ### Fold distributions We have a slight improvement by adding more frames. Unlike adding frames with finger lengths, this one improved a bit. This might imply that body angles contain more temporal information than finger lengths. ## Iteration 11 - Orig. kp no face, fix-undetect, sync, center, unify shoulder, LDA and body angles (more interpol 8) ### Overall score Training accuracy 0.8802791943125555 +/- 0.014468509930616727 Cross-validation accuracy: 0.6911229426592922 +/- 0.032164458538195606 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select',LDA + kbest 116)])) ('classifier', RidgeClassifier(alpha=1511.77507061566, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8587257617728532 +/- 0.8587257617728532 Fold 0: Cross-validation accuracy: 0.7383419689119171 +/- 0.7383419689119171 Fold 1: Training accuracy 0.9027939464493597 +/- 0.9027939464493597 Fold 1: Cross-validation accuracy: 0.6448202959830867 +/- 0.6448202959830867 Fold 2: Training accuracy 0.8732317736670294 +/- 0.8732317736670294 Fold 2: Cross-validation accuracy: 0.7138810198300283 +/- 0.7138810198300283 Fold 3: Training accuracy 0.8814463544754001 +/- 0.8814463544754001 Fold 3: Cross-validation accuracy: 0.6785714285714286 +/- 0.6785714285714286 Fold 4: Training accuracy 0.8851981351981352 +/- 0.8851981351981352 Fold 4: Cross-validation accuracy: 0.68 +/- 0.68 ### Fold distributions Yay, we have an even better score now! Unfortunately, selectKbest selected 116 angle features instead of the previous 32 angle features. This is only a slight improvement for many more features added. ## Iteration 12 - Orig. kp no face, fix-undetect, sync, center, unify shoulder, LDA and body distances ### Overall score Training accuracy 0.8648751415401035 +/- 0.015861879270712077 Cross-validation accuracy: 0.6831993575663304 +/- 0.03170117761353051 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select',LDA + all feat)])) ('classifier', RidgeClassifier(alpha=6235.507341273912, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8404432132963989 +/- 0.8404432132963989 Fold 0: Cross-validation accuracy: 0.7409326424870466 +/- 0.7409326424870466 Fold 1: Training accuracy 0.889988358556461 +/- 0.889988358556461 Fold 1: Cross-validation accuracy: 0.6469344608879493 +/- 0.6469344608879493 Fold 2: Training accuracy 0.8601741022850925 +/- 0.8601741022850925 Fold 2: Cross-validation accuracy: 0.6855524079320113 +/- 0.6855524079320113 Fold 3: Training accuracy 0.8672199170124482 +/- 0.8672199170124482 Fold 3: Cross-validation accuracy: 0.6646825396825397 +/- 0.6646825396825397 Fold 4: Training accuracy 0.8665501165501166 +/- 0.8665501165501166 Fold 4: Cross-validation accuracy: 0.6778947368421052 +/- 0.6778947368421052 ### Fold distributions Better score but not as good as the angles. Folds are bit less equal. ## Iteration 13 - Orig. kp no face, fix-undetect, sync, center, unify shoulder, LDA and body distances (more interpol 8) ### Overall score Training accuracy 0.8676540071976543 +/- 0.014847237268801883 Cross-validation accuracy: 0.685021932408308 +/- 0.03154627789688863 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select',LDA + kbest=52)])) ('classifier', RidgeClassifier(alpha=3888.155180308085, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8426592797783934 +/- 0.8426592797783934 Fold 0: Cross-validation accuracy: 0.7357512953367875 +/- 0.7357512953367875 Fold 1: Training accuracy 0.889406286379511 +/- 0.889406286379511 Fold 1: Cross-validation accuracy: 0.638477801268499 +/- 0.638477801268499 Fold 2: Training accuracy 0.8683351468988031 +/- 0.8683351468988031 Fold 2: Cross-validation accuracy: 0.6940509915014165 +/- 0.6940509915014165 Fold 3: Training accuracy 0.8684054534676942 +/- 0.8684054534676942 Fold 3: Cross-validation accuracy: 0.6726190476190477 +/- 0.6726190476190477 Fold 4: Training accuracy 0.8694638694638694 +/- 0.8694638694638694 Fold 4: Cross-validation accuracy: 0.6842105263157895 +/- 0.6842105263157895 ### Fold distributions We have a slightly better score but maybe not worth it for the extra amount of features ## Iteration 14 - Orig. kp no face, fix-undetect, sync, center, unify shoulder, LDA and every above ### Overall score Training accuracy 0.8741393729422737 +/- 0.013570018607200444 Cross-validation accuracy: 0.6951640244979987 +/- 0.02591457020895899 Best estimator: Pipeline(steps=[('preprocessing', Pipeline(steps=[('scaler', StandardScaler())])), ('feature_selection', Pipeline(steps=[('feat_select', LDA + body_dist=1 + body_angles=81 + finger_lengths = ?)])) ('classifier', RidgeClassifier(alpha=2335.7214690901214, class_weight='balanced', max_iter=10000))]) ### Fold scores Fold 0: Training accuracy 0.8548476454293629 +/- 0.8548476454293629 Fold 0: Cross-validation accuracy: 0.7331606217616581 +/- 0.7331606217616581 Fold 1: Training accuracy 0.8946449359720605 +/- 0.8946449359720605 Fold 1: Cross-validation accuracy: 0.653276955602537 +/- 0.653276955602537 Fold 2: Training accuracy 0.8645266594124048 +/- 0.8645266594124048 Fold 2: Cross-validation accuracy: 0.6997167138810199 +/- 0.6997167138810199 Fold 3: Training accuracy 0.8778897451096621 +/- 0.8778897451096621 Fold 3: Cross-validation accuracy: 0.6865079365079365 +/- 0.6865079365079365 Fold 4: Training accuracy 0.8787878787878788 +/- 0.8787878787878788 Fold 4: Cross-validation accuracy: 0.7031578947368421 +/- 0.7031578947368421 ### Fold distributions