# Sybil Report on Gitcoin Grants Rounds 11 ###### tags: `gitcoin` `Reports` :::info Updated by September 2021 ::: *Authors: Danilo Lessa Bernardineli* ## Summary The combined FDD process is **effective at catching about 83% of the Sybil Users (between 100% to 57% under 95% CI)** according to blind human evaluations. The best estimate for **Sybil Incidence on Gitcoin is 6.4%, and is contained between 3.6% to 9.3%** with 95% CI. IP clusters together with GitHub account creation date are the most relevant features for detecting sybil users programatically right now. The **Fraud Tax is computed as 0.61%** of the Funding Amount. ## Links - Sybil Detection Service Repo: https://github.com/gitcoindao/gitcoin_asop/tree/2c39f443068a9378ab9aadb5b7364e53a178cdbb - Summary stats: https://github.com/gitcoindao/gitcoin_asop/blob/2c39f443068a9378ab9aadb5b7364e53a178cdbb/notebooks/gr11_report.ipynb - Incidence stats: https://github.com/gitcoindao/gitcoin_asop/blob/2c39f443068a9378ab9aadb5b7364e53a178cdbb/notebooks/gr11_sybil_incidence.ipynb - List of flagged users: https://github.com/gitcoindao/gitcoin_asop/blob/2c39f443068a9378ab9aadb5b7364e53a178cdbb/data/predict_data/2021-09-27T18:59:10%2B00:00/2021-09-27T19:03:53-gc_df.csv ## Parameters Algorithm Aggressiveness: 30% Matching Pool: 950k USD :::info Algorithm Aggressiveness tunes how much sensitive / specific the flagging model should be. When it is set to 50%, then it will be optimized for accuracy, while closer values to 0% means that it will be optimized for minimizing false detection rate ::: ### Trained Model Feature Importance ![Model Feature Importance](https://i.imgur.com/pMMA9To.png) ## Statistics ### Sybil Incidence & Detection #### Sybil Incidence ![](https://i.imgur.com/9hXOf1D.png) - **Estimated Unbiased Sybil Incidence: 6.4%** (between 3.6% and 9.3%, 95% CI) - Esimated Sybil Users (unbiased): 1127 (between 635 and 1631, 95% CI) - **Flagged Users Fraction: 5.3%** - Flagged Sybil Users: 853 - Can we reject the hypothesis that we're catching everyone? **No** - Best estimate for how much % of the sybil users are being caught: 82.9% - Worst estimate for how much % of the sybil users are being caught: 56.9% :::info The Unbiased Sybil Incidence is the estimated incidence by debiasing the probability parameter as estimated by a Binomial Distribution over the Average `is_sybil` on the Human Evaluation flags. Debiasing takes into account the squelches and the sampling distribution. ::: #### Sybil Flags - ML generated flags: 494 - Heuristic generated flags: 178 - Human provided flags: 1247 - Total flags: 1247 :::info Flags can be duplicate for the same user if they're evaluated more than once ::: ### Summary Statistics on Scenarios ![](https://i.imgur.com/O2U6Tp8.png) ![](https://i.imgur.com/k9CDuzn.png) #### Contributors Total Contributors (original): 15986 Total Contributors (modified): 15134 Total Contributors (removed): 853 Sum of USDT Amount (original): 3499639.63 Sum of USDT Amount (removed): 606494.88 Median of Median USDT Amount per User (original): 1.43 Median of Median USDT Amount per User (modified): 1.44 Median of Median USDT Amount per User (removed): 1.36 Median Contribution Count per User (original): 17.00 Median Contribution Count per User (modified): 17.00 Median Contribution Count per User (removed): 20.00 Mean Count per User (original): 27.89 Mean Count per User (modified): 29.46 Mean Count per User (removed): 34.45 ### Quadratic Funding Statistics Total Grant Funding (original): 950000.00 Total Grant Funding (modified): 950000.00 Mean Grant Funding (USDT, original): 769.85 Mean Grant Funding (USDT, modified): 780.61 Median Grant Funding (USDT, original): 0.02 Median Grant Funding (USDT, modified): 0.02 Grant count (original): 1234 Grant count (modified): 1217 Max payout (USDT) 955787.22 **Fraud Tax (USDT) 5787.22 Fraud Tax (% of total funding) 0.61%** ![](https://i.imgur.com/5tqNybm.png)