-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy path_dataset-template.qmd
63 lines (47 loc) · 2.28 KB
/
_dataset-template.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
---
title: A descriptive dataset title
author: Your Name
date: Today's Date (e.g., April 17, 2023)
description: A one- or two-sentence description of the data. If possible, give a brief summary of the sports problem and statistical situation.
categories:
- list the relevant
- statistical methods
- that can be used
- with this dataset
- one per line
- with two spaces and a hyphen in front
---
## Motivation
The categories above determine how this dataset is listed on the [datasets by
methods](https://data.scorenetwork.org/by-statsds-topic.html) page.
Consult that page for a list of statistical categories already used by other
datasets.
In this first section, describe the source of the dataset and what it's about.
Give any necessary background about it and the sports research question of interest. See other
datasets on the website for examples.
This file is Markdown, so you *can* use formatting; [here is a guide to the
basics](https://quarto.org/docs/authoring/markdown-basics.html).
## Data
Describe the data. What does each row represent? How many rows are there? If
there is missingness, say how it is coded and why it is present.
| Variable | Description |
|----|-------------|
| Column name | A description of this variable, including units when possible |
| Column name 2 | A description of this variable, including units when possible |
: [filename.csv](../data/filename.csv)
For the link above, change filename.csv to the name of the data file. Leave the
`../data/` part in the link; this is where the data will live in the system.
If there are multiple data files, repeat the table, including the line starting
with `:`, for each data file.
## Questions
If possible, list some analyses or research questions that could be answered
with this dataset, or the types of questions you'd ask students. This is mainly
meant to give other instructors ideas for how they could use the data.
## References
Give references to the original source here, such as by pasting the package
citation:
Petti B, Gilani S (2022). _baseballr: Acquiring and Analyzing Baseball
Data_. R package version 1.3.0,
<https://CRAN.R-project.org/package=baseballr>.
If the dataset has a particular license, mention the license here. For example,
"Data available under the Creative Commons Attribution license."