论文阅读笔记：“ActionVLAD：Learning spatio-temporal aggregation for action classification”

sixwalter Lv6

2023-08-05 11:14:26 2023-08-05 11:14

144 Words 1 Mins

ActionVLAD: Learning spatio-temporal aggregation for action classification

代码：http://rohitgirdhar.github.io/ActionVLAD

论文思想

时空信息聚合

将特征空间划分为K个区域，该区域可以表示为“action words”，也可以称其为锚点(achor points ck)

公式：

上面的公式对特征与锚点（typical actions）之前的差异在整个视频维度进行了求和。

why use VLAD to pool?

HOW to combine RGB and FLOW streams?

Post title：论文阅读笔记：“ActionVLAD：Learning spatio-temporal aggregation for action classification”
Post author：sixwalter
Create time：2023-08-05 11:14:26
Post link：https://coelien.github.io/2023/08/05/paper-reading/paper_reading_056/
Copyright Notice：All articles in this blog are licensed under BY-NC-SA unless stating additionally.