Problem Definition

A machine learning model that can recognize and classify different types of activities using smartphone sensor data
Use different tools/frameworks to analyse data

Run the experiment

setup:

pip install --upgrade -r "requirements.txt"

data preparation: (execute in the project root directory)

# cleaning.py INPUT_DIR MIN_INTERVAL_SEC
spark-submit --driver-memory 3G src/cleaning.py data/all_data_v2 60

Project Structure

PROJ_ROOT:

.
├── data/
│   └── all_data_v2
├── model/
│   ├── classifier.trs
│   └── vae.trs
├── module/
│   ├── __init__.py
│   ├── nets.py
│   └── util.py
├── src
│   ├── activity_recognition.py
│   ├── addLabel.py
│   ├── cleaning.py
│   ├── RFC_act_rec.py
│   └── torch/
│       ├── exp_torch.py
│       ├── __init__.py
│       ├── loader.py
│       └── pre_loader.py
├── pyproject.toml
├── README.md
└── requirements.txt

Running the SKLearn Models

The full dataset can be downloaded from the following link: https://drive.google.com/drive/folders/1qsQ0GcVMYLuoDPEXPlTfmGtm_HmPXZpI?usp=share_link

After cloning the repo and cd to the project root directory, you can start running the cleaning code.
```
spark-submit src/cleaning.py example_data 60
```
- Optional: add the argument –driver-memory [#]g replacing the [#] with amount of memory you choose to run the program with
  - With our full dataset we found 3g was the sweet spot
After running the cleaning script you should notice a folder called example_data-oput_60
- This contains the grouped data that will train the sklearn models
Run the command
```
python3 src/RFC_act_rec.py example_data-oput_60s example_ML_testing_data-oput_60s
```
- This will create and test RandomForest and MLPClassifier models and output their results to src/output_pd/
- The second argument in the example ('example_ML_testing_data-oput_60s') has to contain the cleaned data files used for testing, the files currently in 'example_ML_testing_data' are testing files for your convienience that don't overlap with the files provided in example_data/
  - These files can be generated manually by following the first step and replacing 'example_data' with 'example_ML_testing_data'

Running the PyTorch Models

After cloning the repo and cd to the project root directory, you can start running the cleaning code.
```
spark-submit src/cleaning.py example_data 60 True
```
- Optional: add the argument –driver-memory [#]g replacing the [#] with amount of memory you choose to run the program with
  - With our full dataset we found 3g was the sweet spot
After running the cleaning script you should notice a folder called example_data-oput_60
- This contains the grouped data for use with the sklearn model
Run the command (in the project root directory)
```
PYTHONPATH=./ python3 src/torch/exp_torch.py example_data-oput_60s
```
- This will create and test RandomForest and MLPClassifier models and output their results to src/output_pd/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Problem Definition

Run the experiment

Project Structure

Running the SKLearn Models

Running the PyTorch Models

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
example_ML_testing_data-oput_60s		example_ML_testing_data-oput_60s
example_ML_testing_data		example_ML_testing_data
example_data-oput_60s		example_data-oput_60s
example_data		example_data
images		images
model		model
module		module
oput		oput
src		src
.DS_Store		.DS_Store
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

mittonj/Activity-Monitoring-Data-Analysis

Folders and files

Latest commit

History

Repository files navigation

Problem Definition

Run the experiment

Project Structure

Running the SKLearn Models

Running the PyTorch Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages