tidy-data-script removing species from the dataset #1

gavinfay · 2022-10-26T13:48:35Z

@AngeliaMiller @CataRoman It looks like the tidy-data-script.R is hardwiring a selection of ~10 species, thus the 'complete data' is not complete. Is this intentional?

The text was updated successfully, but these errors were encountered:

gavinfay · 2022-10-26T13:49:51Z

General suggestion, include some notes at the top of each R script summarizing what the code in that file does. ie the objective / part it plays in the analysis workflow, what it takes, what it outputs, etc.
e.g. There seems to be a lot of work in tidy-data-script.R that is redone/reorganized in complete-datasets other than aggregating spiny dogfish and skates.

AngeliaMiller · 2022-10-26T15:36:29Z

You are correct that part of tidy-data-script.R is hardwiring a selection of 10 species, thereby forming an incomplete 'complete dataset'. The script was initially created for the summary statistics for the workshop. I will make a copy of this script, adjust to tidy the full data set (~1960s and ~1000 species), and include some notes at the top for each script. My intention was to have tidy-data-script.R be a script for tidying the full dataset not just the 10 species.

Some of the work in the tidy-data-script and complete-datasets, may be the same because some information was lost when we use complete(). I will take a look at it again and move anything from complete-datasets.R that could/should be done in tidy-data-script.R.

gavinfay · 2022-10-27T11:28:07Z

Thanks. See the file in the data-cleaning branch referenced in issue #3 that moves towards this. I think we want to have a 'base' raw data set that everyone can then use, with some operations being case-specific. (e.g. the spatial configuration of subsets of data)

AngeliaMiller self-assigned this Oct 26, 2022

AngeliaMiller mentioned this issue Nov 11, 2022

in data-cleaning branch, create a data cleaning script that produces a clean full data set #3

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tidy-data-script removing species from the dataset #1

tidy-data-script removing species from the dataset #1

gavinfay commented Oct 26, 2022

gavinfay commented Oct 26, 2022 •

edited

Loading

AngeliaMiller commented Oct 26, 2022

gavinfay commented Oct 27, 2022

tidy-data-script removing species from the dataset #1

tidy-data-script removing species from the dataset #1

Comments

gavinfay commented Oct 26, 2022

gavinfay commented Oct 26, 2022 • edited Loading

AngeliaMiller commented Oct 26, 2022

gavinfay commented Oct 27, 2022

gavinfay commented Oct 26, 2022 •

edited

Loading