Fusing visual and audio information in a distributed intelligent surveillance system for public transport systems
B Ping L
Human Weapon-Activity Recognition in Surveillance Videos Using Structural-RNN