DETR
- treat the problem as a direct set prediction problem.
- it is pointed out that both self-attention is especially suitable for the constraints of set prediction.
- architecture: encode-decoder transformer (with non-autoregressive parallel decoding), set-based global loss (bipartite matching for computing loss, where loss is permutation-invariant).
Image Not Showing
Possible Reasons
- The image file may be corrupted
- The server hosting the image is unavailable
- The image path is incorrect
- The image format is not supported
Learn More →
- bipartite matching losses for set prediction, encode-decoder architectures based on transformer, parallel decoding, and object detection methods.
Set Prediction
- the most basic form of set prediction would be a multi-class classification task (one vs one, one vs all strategies).
- direct set-prediction problem needs global inference schemes that model interactions between all predicted elements to avoid redundancy.
- auto-regressive models are the commonly used models.