Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification
Abstract: Local key action regions can improve CNN-based visual action recognition. Self-attention requires focusing on important details and ignoring others. Self-attention aids action recognition. Current self-attention methods ignore local feature vector correlations at spatial positions…