# Image preprocessing ###### tags: `image preprocessing` ![](https://i.imgur.com/X8dfRyv.png) ## 1. CNN Data (Classifying Label 1) - /shared/data/image_cnn/ - Data Specs - Downsample Detritus class 20,000 - Zooplankton class ~18,000 - Preprocessing: - Gaussian blur kernel size 5 - Grayscale - Resize to 40x40 ## 2. Augmentation Data (Label 2) - /shared/jenniferding/plankton - Data Specs - 500 samples from copepod and noncopepod data - Preprocessing: - Gaussian blur kernel size 3 - RGB - Resize to 128x128 ## Potential Issues: There are 3 images that are labeled both copepod and noncopepod (label 2): ['Pia1.2017-10-26.1849+N00011891_hc.tif', 'Pia1.2017-10-03.1726+N00294605_hc.tif', 'Pia1.2017-10-03.1726+N00358839_hc.tif'] For now I've removed these from augmentation data