--- tags: Draft --- # Results from BlockScience ## Goals: - Show that we have a semi-supervised ML pipeline for detecting sybil attack - Explain what that entails (options) - White box algorithms - Feature Engineering - Sensitivity vs Specificity - Labels and GIGO - Simple models vs ensemble models - Impress with some numbers - Impress with some visualizations - Convince the audience that will only get better through continous evaluation Target Audience: - Boundary conditions: - 5min for presentation, 2min for Q&A - No doxing - No sharing of feature specifics ## Outline - Intro (45s) - "Hi! It is well known that we had lots of attacks. We from BSci together with a tight collaboration with the Gitcoin Dev Team have deved a Semi Supervised Piepline" - Semi Supervised Pipeline (3m) - What is a semi supervised pipeline (45s) - What are the pieces of the pipeline (1min) - White box algo (40s) - Labels and GIGO (20s) - How it gets better with time (1min) - Feature Engineering (15s) - Human evaluation (30s) - Simple & ensemble models (15s) - What we have now (20s) - Mixture of DTs and RFs - Results (1min) - Summary stats from EDA (20s) - Prediction results (20s) - Predictive features (20s) - Conclusions (30s) - - Questions (2min) ## Content resources