本篇論文提出利用新穎的3D卷積神經網路(3D-CNN, 3D Convolutional Neural Network)學習包含時間及空間的特徵,並將該類神經網路分類器與技術發展純熟的 Cascade架構結合。可以對於不同複雜度的個別區域(如前景與背景),動態學習出一個至數個分類器,級聯成獨立相應的偵測系統。方法上利用將訓練樣本集合細分為典型和非典型兩個子集合,模擬正負樣本;並以增加不同比例高斯雜訊的方式擴充資料量、增加變化性,解決卷積神經網路較難應用於異常事件偵測領域的問題。使用常見的資料集來測試偵測系統之精確度與召回率,由實驗結果顯示我們所提出的方法有不錯的表現,經過後處理後能與其他先進的演算法匹敵。
In this paper we present a neural network (NN) architecture for abnormal events detection in a surveillance system. When training such system, it is a challenge that only normal samples are available for training. In addition, there are various contents inside a surveillance video frame, such like lawn, pedestrian walking area, trees, etc., and each has its own “typical/atypical” pattern. In solving the first problem, we propose a scheme to separate training data into “typical” and “atypical” that serve the roles of negative and positive training samples. In the second problem, we first divide a frame into several blocks. Then, for each block, an adaptive cascaded 3D-CNN classifier is trained. In this way, most of irrelevant blocks (normal) are discarded and only those abnormal or confusing normal blocks are kept for further computations. We evaluate our approach on popular dataset and show that our approach is competitive to state-of-the-art methods.