In this project, we developed a video analysis system that can detect events of human falling.
The system receives video as input, scans each frame of the video, and then creates 17 key-points for each individual, each of which corresponds to a position of that person's body parts in that frame. This is done using the YOLOv7-POSE model.
For example
You can learn more about YOLOv7-POSE by reading this document.
We initially created an LSTM-based neural network that learned from a database of photos of humans that were categorized as "falling" and "not falling." We obtained roughly 500 images from the Internet for this purpose. After we used the data to train the model, we discovered that the network did not succeed in learning, and we got poor results using the test images.
After looking at the pictures, we found that some of them labeld as "falling" but contained multiple people who some of them were not actually falling (but only one of them) . Therefore, in order to improve our data, we created a tool that go through all of the data and extracted every person from each photo into a different photo. After that, we once again cleaned and tagged all of the new data.
Unfortunately, the model did not succeed in learning on the new data either. The amount of the data, which contains approximately 500 photos, is likely the key factor in the network learning failure, but this is the only data that is available on the Internet.
The original image database can be found here, while the new image database we made can be found here.
We considered creating a straightforward if-else-based model that would detect a fall in accordance with logical conditions regarding the relative location of the body parts after the LSTM-based network failed to learn from our image library.
Using our own data, we tested our model, and the results were, in our opinion, pretty good, as follows:
Accuracy | Precision | Recall | F1 score |
---|---|---|---|
81.18% | 83.27% | 83.58% | 83.42% |
For the video analysis we used Nvidia's Tesla K80 GPU, the system analyzes the videos at a reasonable speed of 15Fps.
- Clone this repository into your drive
- Download the YOLOv7-POSE model into
Human-Fall-Detection
directory. - Install all the requirements with
pip install -r requirements.txt
- Run main.py file
Examples of videos collected from the Internet and analyzed by our system are shown below.
These videos demonstrate how the model successfully and accurately recognizes human falls.
Using a camera and RTSP protocol, our system may be upgraded to a real-time system.
In order to alert human falls and save lives, the real-time system may be deployed and implemented in nursing homes, hospitals, and senior living facilities.