##Action Recognition on KTH Dataset.
- I used the STIP Binaries found at [here] (https://www.di.ens.fr/~laptev/download.html#stip) , to extract the STIPs with the HOG-HOF descriptors.
- The descriptors extracted are then clustered -using k-means with N cluster- in order to form a visual codebook with N words.
- A Bag of words is then constructed for each example (video sequence) based on the occurrences of the codewords in the given example.
- The examples (N+1-vector-BoW+Label) are then classified using Multi-Class non-Linear SVM.
<img src="/images/pipeline.png" width="400" height"200">
###Results :
I used RBF kernel for the SVM, SVM params: [gamma = 0.0002, C = 2]
Settings | HoF with 1000 cluster | HoG/HoF with 3000 cluster | HoG/HoF with 4000 cluster |
| ---------------------------|:----------------------------:| :----------------------------:|:-----------------------------:| | Accuracy | 88.98% | 90.07% | 83.89% |
Note: [Hof with 1000 clusters] was by far the fastet, it acheived 500% gain in performance in comparison with [HoG/HoF with 3000 clusters] ####Confusion Matrix of HoG/HoF with 3000 clusters
##References