In recent years, spatio-temporal (ST) deep models have become increasingly powerful. We build larger models, learn richer representations, and achieve strong performance across many tasks. At the same time, as these models are expected to be more interpretable and more reliable for supporting …