no code implementations • 1 Dec 2023 • Deepak Sridhar, Yunsheng Li, Nuno Vasconcelos
The resulting $\textit{Scalable CHannEl MixEr}$ (SCHEME) can be plugged into any ViT architecture to obtain a gamut of models with different trade-offs between complexity and performance by controlling the block diagonal MLP structure.
no code implementations • ICCV 2021 • Deepak Sridhar, Niamul Quader, Srikanth Muralidharan, Yaoxin Li, Peng Dai, Juwei Lu
Our attention mechanism outperforms prior self-attention modules such as the squeeze-and-excitation in action detection task.
no code implementations • 1 Jan 2021 • Ali Ghobadzadeh, Deepak Sridhar, Juwei Lu, Wei Li
In this paper, we probe this direction by deriving a relationship between the estimation of unknown parameters of the probability density function (pdf) of input data and classification accuracy.