我的征途是星辰大海

论文阅读笔记：“Temporal Segment Networks： Towards Good Practices for Deep Action Recognition”

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition（ECCV 2016）引言分析先从引言开始分析，分析文章的写作思路，说不定对写论文有所裨益，但...
2023-08-05
- 论文阅读
- > 行为识别
- 2D model
- | Long time range modeling
Read more
论文阅读笔记：“Convolutional Two-Stream Network Fusion for Video Action Recognition”

Convolutional Two-Stream Network Fusion for Video Action Recognition 这篇文章贵在思路清晰，整篇文章的问题引入及脉络在引言就已经很精彩了，很值得学习借鉴这篇文章引言其实就是再说，...
2023-08-05
- 论文阅读
- > 行为识别
- 3D conv & pooling
- | fusion method
Read more
论文阅读笔记：“Learning Spatiotemporal Features with 3D Convolutional Networks”

Learning Spatiotemporal Features with 3D Convolutional Networks 这篇文章提出了C3D模型，成为了之后研究的基础性原型，值得注意的是作者使用了一系列可视化方法对C3D学习到的特征进行可视化...
2023-08-05
- 论文阅读
- > 行为识别
- 3D model
- | C3D
Read more
论文阅读笔记：“A Hierarchical Deep Temporal Model for Group Activity Recognition”

A Hierarchical Deep Temporal Model for Group Activity Recognition核心亮点层次模型如图，该模型逻辑上分为两个阶段，第一个阶段预测单人级别的行为，第二阶段收集第一阶段获得的行为隐变量...
2023-08-05
- 论文阅读
- > 群体行为识别
Read more
论文阅读笔记：“TSM：Temporal Shift Module for Effificient Video Understanding”

TSM: Temporal Shift Module for Effificient Video Understanding 对于实际部署来讲，硬件高效的视频理解方法十分重要。论文提出时也有一些方法，来tradeoff建模与计算量，但是论文提出，已知...
2023-08-05
- 论文阅读
- > 行为识别
- 2D model
- | TSM
Read more
论文阅读笔记：“Is Space-Time Attention All You Need for Video Understanding”

Is Space-Time Attention All You Need for Video Understanding?一. 主要思想这篇文章所提出的模型仅使用了自注意力来实现视频分类。该模型将用于图像分类的ViT模型进行扩展，将空间从2D图像空间...
2023-08-05
- 论文阅读
- > 行为识别
- TimeSformer
- | 坐标上升算法
Read more
论文阅读笔记：“MobileNets：Efficient Convolutional Neural Networks for Mobile Vision Applications”

论文阅读笔记：MobileNets系列论文浅析：mobilenet v1 参考原论文参考B站视频 hint: 为了去寻找一个最优的网络架构，有哪些思考模式是可以借鉴的核心亮点depth-wise separable convolution ...
2023-08-05
- 论文阅读
- > 图像分类
Read more
论文阅读笔记：“X3D：Expanding Architectures for Effificient Video Recognition”

X3D: Expanding Architectures for Effificient Video Recognition一. 主要思想该模型基于“mobile-regime”系列模型MobileNet, 对原模型稍作修改，令乘加运算数少了近10倍...
2023-08-05
- 论文阅读
- > 行为识别
- 坐标上升算法
- | X3D
Read more
论文阅读笔记：“Aggregated Residual Transformations for Deep Neural Networks”

论文阅读笔记：“Aggregated Residual Transformations for Deep Neural Networks” 我想读一些关于网络架构设计的文章，也许会有利于我搭建自己的network，虽然说他们只是2D的架构但是迁移到3...
2023-08-05
- 论文阅读
- > 图像分类
- architecture design
- | ResNeXt
- | group convlution
Read more
论文阅读笔记：“MobileNetV2：Inverted Residuals and Linear Bottlenecks”

论文阅读笔记：MobileNets系列论文浅析：mobilenet v2 参考原论文参考B站视频 hint: 为了去寻找一个最优的网络架构，有哪些思考模式是可以借鉴的。哪怕我无法提出一个通用的网路架构design的创新方法，学习已有的架构设计思路来...
2023-08-05
- 论文阅读
- > 图像分类
- MobileNet
- | architecture design
- | Inverted Residuals
- | Linear Bottlenecks
Read more

Prev Next