Video anamoly detection
Spatiotemporal auto-encoders
The authors propose a new architecture for anomaly detection in videos. Their architecture includes two main components one for spatial feature representation, and one for learning the temporal evolution of the spatial features.
The method is based on the principle that the frames containing an abnormality will be significantly different from the previous frames.
Reconstruction error of all pixel values in frame t of the video sequence is taken as the Euclidean distance between the input frame and the reconstructed frame:
Abnormality score :
The minimum and maximum are calculated from t = t to t+k.
K was chosen to be 50 in the original paper
Reularity score :
The reconstruction error of each frame determines whether the frame is classified as anomalous. The threshold determines how sensitive we wish the detection system to behave