Pipeline to get data sources from Instituto Nacional de Salud - INS related to Covid19 cases daily report in Colombia to create datasets.
The number of new cases is increasing day by day around the world. This dataset has information about reported cases from 32 Colombia departments.
Here you can find the result from my auto-learning process about data science, this dataset has a daily report from Instituto Nacional de Salud - INS about Covid19 cases reported in Colombia, also has a history report from Instituto Nacional de Salud - INS about Covid19 Samples processed in Colombia.
This dataset uses the INS Covid19 report data source, I did clean the data source and fill the NaN values to generate this dataset with additional attributes like, day of the week, year, and month of the year.
covid19co.csv -> Daily report, Cases reported in Colombia
covid19co_samples_processed.csv -> Daily report, Samples processed in Colombia
covid19co_time_line.csv -> Timeline about cases reported, recuperated, and deceased in Colombia.
covid19co_samples_time_line.csv -> Timeline about samples processed in Colombia.
Date format DD/MM/YYYY for instance: 11/09/2001
This dataset is updated from an automatic pipeline, you can find the GitHub code repository here: Colombia Covid19 Pipeline
Dataset is obtained from Instituto Nacional de Salud - INS daily report Covid19 in Colombia. You can get the official dataset here: INS - Official Report
What questions do you want to see answered?
You can view and collaborate with the analysis here: colombia_covid_19_analysis Kaggle Notebook Kernel.
- Python 3:
- See requirements.txt file.
See ./src directory.
See ./output directory.
Directory | Readme |
---|---|
./chart | README.md |
./doc | README.md |
./input | README.md |
./output | README.md |
./src | README.md |
Please read and execute each step below:
Create and use Python virtual environment:
$promt> python3 -m venv .venv
$promt> source .venv/bin/activate
Install all Python requirements:
$promt> pip3 install -r requirements.txt
Run Pipeline script:
$promt> ./run.sh
The Pipeline output is generated within ./output directory.
Work in progress ...
$promt> jupyter nbconvert --to script ./src/covid19co_pipe.ipynb --output covid19co_pipe
That's all for now ...
Getting in touch with @sebaxtianbach
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.