Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge bcaffo-courses into dss-courses #140

Open
wants to merge 222 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
222 commits
Select commit Hold shift + click to select a range
7102d0f
Merge pull request #4 from TroyOfHelen/master
bcaffo May 25, 2014
1fbc1ff
Fixed the lecture image stuff
bcaffo May 25, 2014
1fc7f65
Tried to fix the pdf conversion
bcaffo May 25, 2014
8ae253b
completing the merge
bcaffo May 25, 2014
72371c2
hw4 added
bcaffo May 26, 2014
4e23c50
Fixing the pdf issues
bcaffo May 28, 2014
4b04f97
Converting pdfs
bcaffo May 28, 2014
db81e76
Still figuring out the pdf thing
bcaffo May 28, 2014
295ff1f
Think that I have it figured out
bcaffo May 28, 2014
deb6810
Converting more pdfs
bcaffo May 29, 2014
26553bc
Added names lectures
bcaffo May 29, 2014
8d00886
Added a utility function for copying files
bcaffo May 29, 2014
8f3b0d8
Added some asset files
bcaffo May 29, 2014
0415db6
Converted lecture 02_01 from portrait to landscape
bcaffo May 30, 2014
22d5228
Merge branch 'master' of github.com:bcaffo/courses
bcaffo May 30, 2014
554f989
Provided linked pdfs for data science
bcaffo May 30, 2014
85f61c5
Don't know why rCharts wasn't rendering
bcaffo May 30, 2014
eed6e31
Could not get js rendering to work
bcaffo May 30, 2014
2041a81
Merge branch 'master' of github.com:bcaffo/courses
bcaffo May 30, 2014
ab68675
working on the pdfs
bcaffo May 31, 2014
5e9f9c4
Working lecture pdfs
bcaffo May 31, 2014
c8d3970
Merge branch 'master' of github.com:bcaffo/courses
bcaffo Jun 1, 2014
42e94e7
Redid the slidify lecture pdf
bcaffo Jun 1, 2014
1ac1375
Added example project zip file
bcaffo Jun 2, 2014
b98300a
Merge remote-tracking branch 'upstreamBCaffo/master' into inference
TroyOfHelen Jun 2, 2014
1b5c2f5
Merge pull request #6 from TroyOfHelen/inference
bcaffo Jun 2, 2014
069b46f
fixed as.complex(x) to show NAs
gdhorne Jun 3, 2014
a445d57
fixed a typo in hw1
bcaffo Jun 3, 2014
ce27fca
Another little typo in the hw
bcaffo Jun 3, 2014
3a63e6a
Arrggg another typo
bcaffo Jun 3, 2014
adb6f11
Merge pull request #8 from gdhorne/gdhorne
bcaffo Jun 3, 2014
08125c0
fix spelling on probability in a couple of places
bschousek Jun 3, 2014
4dfc5be
Merge pull request #9 from bschousek/master
bcaffo Jun 3, 2014
847aa28
Updated tutorial link under "Getting Started"
tksf Jun 4, 2014
9d212fc
fix a couple of typos
bschousek Jun 5, 2014
a14c1ac
Fixed a small error
bcaffo Jun 6, 2014
c51e60d
Merge branch 'master' of github.com:bcaffo/courses
bcaffo Jun 6, 2014
e6085f3
Fixed some homework errors
bcaffo Jun 8, 2014
042b0c9
Merge pull request #13 from bschousek/patch-1
bcaffo Jun 8, 2014
56a8f2b
Merge pull request #11 from tksf/master
bcaffo Jun 8, 2014
89ce1f1
Added gridExtra package to required library list
tksf Jun 9, 2014
9959d52
Added some requested zips
bcaffo Jun 10, 2014
4e743de
Merge branch 'master' of github.com:bcaffo/courses
bcaffo Jun 10, 2014
69ca254
fix typo in R^2 math
gostevehoward Jun 12, 2014
1306901
Fix a typo "corrrelation"
gostevehoward Jun 12, 2014
8723198
Modified hw2-4 and hw4-3
mmulchandani Jun 13, 2014
dabecc9
Merge pull request #17 from mmulchandani/master
bcaffo Jun 16, 2014
a4ba3aa
Merge pull request #16 from gostevehoward/master
bcaffo Jun 16, 2014
09bbbd6
Merge pull request #14 from tksf/patch-2
bcaffo Jun 16, 2014
0ab6042
Tried fixing the display problems
bcaffo Jun 16, 2014
9aafe08
Recompiled HW4. Not sure what the issue was
bcaffo Jun 17, 2014
c9196fa
Updated hw3.Rmd to fix Q2(hint and answer) and Q3(typo)
mmulchandani Jun 21, 2014
9bb603d
Merge pull request #18 from mmulchandani/master
bcaffo Jun 29, 2014
8a40634
Merge branch 'master' of github.com:bcaffo/courses
bcaffo Jun 30, 2014
6871078
Commiting a bunch of minor stuff
bcaffo Jun 30, 2014
583bda6
Fixed 01_02 to be a lower level lecture
bcaffo Jul 1, 2014
1a02c0c
Worked through the lecture
bcaffo Jul 1, 2014
a7418d3
expectation lecture
bcaffo Jul 4, 2014
98ffd71
Converted most of the figures to ggplot2
bcaffo Jul 6, 2014
0376888
Finished converting the expected value lecture
bcaffo Jul 6, 2014
59271d4
FInished the expecation lecture
bcaffo Jul 6, 2014
38062b5
Moving around some things
bcaffo Jul 7, 2014
7b4ee14
Finished the variance lecture
bcaffo Jul 7, 2014
5404659
Massive amount of reorganization
bcaffo Jul 7, 2014
4406ef7
Worked on the common distros lecture
bcaffo Jul 7, 2014
beb3582
Finished the common distros lecture
bcaffo Jul 7, 2014
1d8d96b
Made some small changes for 03
bcaffo Jul 7, 2014
4f67ec0
In Problem 2 as originally worded, "data" don't have standard errors;…
rthisted Jul 7, 2014
e88f319
Problem 3 improve wording for non-English speakers
rthisted Jul 7, 2014
85c927e
The BMI comparison problem doesn't provide a place to submit the answ…
rthisted Jul 7, 2014
ccb9aaf
The BMI comparison problem doesn't provide a place to submit the answ…
rthisted Jul 7, 2014
0580042
Unnecessary intermediate html version deleted
rthisted Jul 7, 2014
48b5b45
Fix placement of instructions on question
rthisted Jul 7, 2014
d86c933
correct typos
rthisted Jul 7, 2014
63fb4a7
Added pdfs for the first three lectures
bcaffo Jul 8, 2014
0d2c396
Update index.Rmd
tcarnus Jul 8, 2014
2181d74
Update index.Rmd
tcarnus Jul 8, 2014
00de5ee
Merge pull request #20 from rthisted/master
bcaffo Jul 8, 2014
2575097
Added use arrow keys to the title slides
bcaffo Jul 8, 2014
bd3ba1e
Added hw3.html
bcaffo Jul 8, 2014
abf411d
Reworded hw4
bcaffo Jul 8, 2014
1bafb65
Fixed some typos
bcaffo Jul 8, 2014
c197d4e
Fix typo
bschousek Jul 9, 2014
2c2d6eb
slide 17 typo: red line
roncoursera Jul 9, 2014
586ecb6
slide 17 typo: red line
roncoursera Jul 9, 2014
12a3c3f
typos: calculus, form -> from
roncoursera Jul 9, 2014
79f6c85
typos: calculus, form -> from
roncoursera Jul 9, 2014
a0216b5
typos: calculus, form -> from
roncoursera Jul 9, 2014
267c9ab
slide 17 typo: red line
roncoursera Jul 9, 2014
2b25f43
Fixed LATEX/ R typo from Regression Models 01_03
clifflu Jul 9, 2014
11ad4aa
Added a binder pdf
bcaffo Jul 10, 2014
3398a7e
Added plotly data
bcaffo Jul 10, 2014
329d430
Update hw2.rmd, change the serial number
jimmysun Jul 11, 2014
d4dece3
Fixed up the regression lecture
bcaffo Jul 11, 2014
a34b79e
Merge pull request #1 from bcaffo/master
gugclo Jul 11, 2014
9c0027a
Fixed expression to Variance.
Entodi Jul 12, 2014
d2e1489
Edit Fixed expression to Variance
Entodi Jul 12, 2014
4b365bc
Update hw1.md
Entodi Jul 12, 2014
bb482b3
Finished asymptotics lecture
bcaffo Jul 14, 2014
be552d1
Worked on the common distros lecture
bcaffo Jul 14, 2014
c076601
Merge pull request #31 from Entodi/master
bcaffo Jul 14, 2014
63e661a
Worked on regression lecture
bcaffo Jul 14, 2014
b6e8e1d
Merge pull request #30 from gugclo/master
bcaffo Jul 14, 2014
929d455
Merge pull request #29 from jimmysun/patch-1
bcaffo Jul 14, 2014
d1b24e0
Merge pull request #28 from clifflu/typo_RM_01_03_patch01
bcaffo Jul 14, 2014
a7fcdfd
Merge pull request #27 from roncoursera/master
bcaffo Jul 14, 2014
8e6f880
Merge pull request #26 from bschousek/gh-pages
bcaffo Jul 14, 2014
5e7822f
Merge pull request #23 from tcarnus/patch-3
bcaffo Jul 14, 2014
5b6c688
Update index.Rmd
bcaffo Jul 14, 2014
b0c4f3c
Update index.Rmd
bcaffo Jul 14, 2014
1b6d06f
Reverted the commits where I accidentally committed devel stuff into …
bcaffo Jul 14, 2014
0615d1d
Added some figure chunks from slidify
bcaffo Jul 14, 2014
311c6e7
Merge branch 'gh-pages' of github.com:bcaffo/courses into gh-pages
bcaffo Jul 14, 2014
f04ba6a
Compiled the Rmd files
bcaffo Jul 14, 2014
afea117
Fixed a typo
bcaffo Jul 14, 2014
19929c8
Fixed typo in hw4
bcaffo Jul 14, 2014
6bec6d2
Added some cached figures
bcaffo Jul 15, 2014
1e82ffe
Worked or lecture 8
bcaffo Jul 15, 2014
1778e65
line 217 was missing /10 in homework #2 item 11
whitefishdontjump Jul 15, 2014
22dda8d
Finished recording lecture 06
bcaffo Jul 18, 2014
4d35a8a
Fix typo in link's target.
crorvick Jul 21, 2014
611a8c7
Finished 08
bcaffo Jul 23, 2014
bcd6c97
Merge pull request #34 from crorvick/fix-typo
bcaffo Jul 23, 2014
16e9f05
Merged in the new lectures
bcaffo Jul 27, 2014
065bdbb
Organized the master directory
bcaffo Jul 27, 2014
0c8121b
Moved arround some lectures
bcaffo Jul 27, 2014
9e377e9
Removed the two group interval lecture since it's been merged into th…
bcaffo Jul 27, 2014
97352ea
Pulled back the old lectures so that I don't break links for the curr…
bcaffo Jul 27, 2014
f7c9687
Fixed some git merge errors in lecture 1
bcaffo Jul 28, 2014
033b621
Fixed the merge errors in 01
bcaffo Jul 28, 2014
e7e6c92
Added lecture 2 pdf
bcaffo Jul 28, 2014
5bb0756
Added lecture 3 pdf
bcaffo Jul 28, 2014
49d5b04
Added lecture 4 pdf
bcaffo Jul 28, 2014
8cd3077
Added lecture 5 pdf
bcaffo Jul 28, 2014
901ffe9
Added lecture 6 pdf
bcaffo Jul 29, 2014
a34b6c3
Added the lecture 7 pdf
bcaffo Jul 29, 2014
4d4ce01
Worked on the HT lecture
bcaffo Jul 29, 2014
f3d55a9
Merge branch 'devel'
bcaffo Jul 29, 2014
2f9270e
Added lecture 8 pdf
bcaffo Jul 30, 2014
ecc2a3d
Added lecture 9
bcaffo Aug 1, 2014
d0dde3e
Added 08 pdf
bcaffo Aug 1, 2014
11a4411
Merge branch 'master' into gh-pages
bcaffo Aug 1, 2014
88cb5b0
Fixing typos: it's -> its
vatirr Aug 5, 2014
57a3626
Typo fixed
Alexander-Minyushkin Aug 5, 2014
f66a592
Merge pull request #36 from Alexander-Minyushkin/patch-1
bcaffo Aug 19, 2014
49bda01
Merge pull request #35 from vatirr/master
bcaffo Aug 19, 2014
ff901eb
Merge pull request #33 from whitefishdontjump/patch-1
bcaffo Aug 19, 2014
a11db3e
Merge pull request #24 from tcarnus/patch-4
bcaffo Aug 19, 2014
8539ba7
Added some small changes to the P-values lecture
bcaffo Aug 20, 2014
d99145a
Editted the power lecture
bcaffo Aug 20, 2014
e9b95d4
small changes in lecture 11
bcaffo Aug 21, 2014
3aa5a88
Correction to Slide 5 (formula for the log likelihood ratio)
rthisted Aug 21, 2014
834034e
Changed the PDFs for a few lectures
bcaffo Aug 22, 2014
a774404
Merge branch 'master' into gh-pages
bcaffo Aug 22, 2014
917b727
Merge branch 'master' of https://github.com/bcaffo/courses
Aug 23, 2014
5597354
minor fixes to the resampling lecture as part of the course update
bcaffo Aug 24, 2014
ed8eccf
Trying to get 13 to exist as a stand alone
bcaffo Aug 24, 2014
3cd703c
Finished lecture 13
bcaffo Aug 24, 2014
acb3541
Made minor modifications to lecture 2
bcaffo Sep 3, 2014
d137e22
Merge branch 'master' of https://github.com/bcaffo/courses
rthisted Sep 3, 2014
2840099
Merge branch 'master' into gh-pages
bcaffo Sep 4, 2014
40c9230
Merge branch 'gh-pages' of github.com:bcaffo/courses into gh-pages
bcaffo Sep 4, 2014
41f0313
Converted RTTM lecture to ggplot2
bcaffo Sep 4, 2014
ee42232
slight work on the linear regression lecture
bcaffo Sep 4, 2014
3e5b14b
Added some ggplot to the lectures
bcaffo Sep 4, 2014
03124ee
Nightly commit, small lecture changes
bcaffo Sep 5, 2014
a1fd860
slightly revised lecture 01_05 for new recording
bcaffo Sep 9, 2014
40abb91
Revised residual variation
bcaffo Sep 10, 2014
cfb3523
Worked on the residuals lecture
bcaffo Sep 15, 2014
ab49825
Worked on the inference regression slides
bcaffo Sep 16, 2014
2191d4e
Fixed 02_02 pdf
bcaffo Sep 19, 2014
d7590b6
Lecture uses "pooled standard deviation", not "common".
Sep 24, 2014
e7e937d
Minor changes to 02_01
bcaffo Sep 25, 2014
9031ee3
Added a small change to 02_01
bcaffo Sep 25, 2014
5b686e3
Added lectures as zip file
bcaffo Oct 5, 2014
9efa928
Added rmd zip directory
bcaffo Oct 5, 2014
9fd4674
Update index.Rmd
kiistala Oct 6, 2014
cf24262
Fixed a homework typo
bcaffo Oct 10, 2014
b96e396
hw1: correct typo and error in hint
Oct 14, 2014
780d139
Update index.md
snickels Oct 19, 2014
5333789
Improve formula display in least squares=sample mean derivation
Nov 4, 2014
864e01e
Fix typo
panta- Nov 10, 2014
9a36e07
minus sign should be in math mode
Radcliffe Dec 2, 2014
4649221
input boxes won't display without question numbers?
Radcliffe Dec 2, 2014
b532f63
Merge branch 'master' of github.com:bcaffo/courses
bcaffo Dec 4, 2014
f98a31a
typos
Gubaer Dec 9, 2014
324619f
typos
Gubaer Dec 9, 2014
3065dfa
added loading of ggplot2 and manipulate libraries for first example
irJERAD Dec 19, 2014
8d875ee
Commiting changes that I've forgotten about
bcaffo Jan 9, 2015
e10f279
I believe this is a typo...
nilbot Jan 10, 2015
2ec8439
typo
mkoconnor Jan 10, 2015
eaf84ee
html lectures reconverted to pdf with active urls
whitefishdontjump Jan 12, 2015
8fdc485
Merge pull request #61 from whitefishdontjump/master
bcaffo Jan 29, 2015
7fa7e82
Merge pull request #60 from mkoconnor/master
bcaffo Jan 29, 2015
707ca5b
Merge pull request #59 from nilbot/patch-1
bcaffo Jan 29, 2015
3ea69f3
Merge pull request #58 from irJERAD/master
bcaffo Jan 29, 2015
7c4ec75
Merge pull request #57 from Gubaer/07_Asymptotia.typos
bcaffo Jan 29, 2015
5ea3264
Merge pull request #53 from panta-/gh-pull-requests
bcaffo Jan 29, 2015
ba6ee1a
Merge pull request #54 from Radcliffe/master
bcaffo Jan 29, 2015
faebf04
Merge pull request #51 from aconley/Regression_sumdisambiguate
bcaffo Jan 29, 2015
a7bc1b2
Merge pull request #50 from snickels/master
bcaffo Jan 29, 2015
8396add
Merge pull request #49 from rawdamedia/master
bcaffo Jan 29, 2015
e8d3448
Merge pull request #38 from rthisted/master
bcaffo Jan 29, 2015
73f61a8
Merge pull request #42 from piggyatbaqaqi/master
bcaffo Jan 29, 2015
751eea4
Merge pull request #44 from kiistala/master
bcaffo Jan 29, 2015
9d59633
Moved the old lectures to the a directory calle old
bcaffo Feb 7, 2015
209617e
Fixing merge issues
bcaffo Feb 7, 2015
2b2a2c7
Moved one more directory
bcaffo Feb 7, 2015
9d3a9c8
renamed old directory to old markdown
bcaffo Feb 7, 2015
24c8375
Trying to fix repo errors and inconsistencies
bcaffo Feb 16, 2015
b63a5b2
css and javascript library changes
bcaffo Feb 16, 2015
1dd0aae
test
bcaffo Feb 16, 2015
ef6f78f
trying to fix sync issues
bcaffo Feb 16, 2015
0558768
added links to hw1
bcaffo Mar 2, 2015
d4c8b6b
Merge branch 'master' of github.com:bcaffo/courses
bcaffo Mar 2, 2015
33a6770
added links to hw1
bcaffo Mar 2, 2015
59623e0
bunch of slidify files
bcaffo Jun 29, 2015
1c1372e
fixed small errors
bcaffo Jul 20, 2015
76e8a43
small changes
bcaffo Jul 20, 2015
e924dcb
Several small changes
bcaffo Jul 22, 2015
f5389a2
Update index.Rmd
bcaffo Aug 17, 2015
9d994b1
added hw2.rmd
bcaffo Sep 16, 2015
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -4,3 +4,4 @@
.Rproj.user
.RData

*.db
6 changes: 4 additions & 2 deletions 02_RProgramming/DataTypes/index.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -200,7 +200,9 @@ NAs introduced by coercion
> as.logical(x)
[1] NA NA NA
> as.complex(x)
[1] 0+0i 1+0i 2+0i 3+0i 4+0i 5+0i 6+0i
[1] NA NA NA NA
Warning message:
NAs introduced by coercion
```

---
Expand Down Expand Up @@ -472,4 +474,4 @@ Data Types

- data frames

- names
- names
6 changes: 4 additions & 2 deletions 02_RProgramming/DataTypes/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -263,7 +263,9 @@ <h2>Explicit Coercion</h2>
&gt; as.logical(x)
[1] NA NA NA
&gt; as.complex(x)
[1] 0+0i 1+0i 2+0i 3+0i 4+0i 5+0i 6+0i
[1] NA NA NA NA
Warning message:
NAs introduced by coercion
</code></pre>

</article>
Expand Down Expand Up @@ -636,4 +638,4 @@ <h2>Summary</h2>
<script src="../../libraries/highlighters/highlight.js/highlight.pack.js"></script>
<script>hljs.initHighlightingOnLoad();</script>
<!-- DONE LOADING HIGHLIGHTER JS FILES -->
</html>
</html>
4 changes: 3 additions & 1 deletion 02_RProgramming/DataTypes/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -200,7 +200,9 @@ NAs introduced by coercion
> as.logical(x)
[1] NA NA NA
> as.complex(x)
[1] 0+0i 1+0i 2+0i 3+0i 4+0i 5+0i 6+0i
[1] NA NA NA NA
Warning message:
NAs introduced by coercion
```

---
Expand Down
Binary file added 02_RProgramming/assets/img/Thumbs.db
Binary file not shown.
Binary file added 04_ExploratoryAnalysis/assets/img/Thumbs.db
Binary file not shown.
Binary file not shown.
Binary file removed 06_StatisticalInference/01_02_Probability/index.pdf
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
161 changes: 161 additions & 0 deletions 06_StatisticalInference/01_Introduction/index.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,161 @@
---
title : Introduction to statistical inference
subtitle : Statistical inference
author : Brian Caffo, Jeff Leek, Roger Peng
job : Johns Hopkins Bloomberg School of Public Health
logo : bloomberg_shield.png
framework : io2012 # {io2012, html5slides, shower, dzslides, ...}
highlighter : highlight.js # {highlight.js, prettify, highlight}
hitheme : tomorrow #
url:
lib: ../../librariesNew
assets: ../../assets
widgets : [mathjax] # {mathjax, quiz, bootstrap}
mode : selfcontained # {standalone, draft}
---

## Statistical inference defined

Statistical inference is the process of drawing formal conclusions from
data.

In our class, we wil define formal statistical inference as settings where one wants to infer facts about a population using noisy
statistical data where uncertainty must be accounted for.

---

## Motivating example: who's going to win the election?

In every major election, pollsters would like to know, ahead of the
actual election, who's going to win. Here, the target of
estimation (the estimand) is clear, the percentage of people in
a particular group (city, state, county, country or other electoral
grouping) who will vote for each candidate.

We can not poll everyone. Even if we could, some polled
may change their vote by the time the election occurs.
How do we collect a reasonable subset of data and quantify the
uncertainty in the process to produce a good guess at who will win?

---

## Motivating example: is hormone replacement therapy effective?

A large clinical trial (the Women’s Health Initiative) published results in 2002 that contradicted prior evidence on the efficacy of hormone replacement therapy for post menopausal women and suggested a negative impact of HRT for several key health outcomes. **Based on a statistically based protocol, the study was stopped early due an excess number of negative events.**

Here's there's two inferential problems.

1. Is HRT effective?
2. How long should we continue the trial in the presence of contrary
evidence?

See WHI writing group paper JAMA 2002, Vol 288:321 - 333. for the paper and Steinkellner et al. Menopause 2012, Vol 19:616 621 for adiscussion of the long term impacts

---

## Motivating example
### Brain activation

![fMRI salmon study](fig/fmri-salmon.jpg 'fMRI salmon study')

http://www.wired.com/2009/09/fmrisalmon/


---

## Summary

- These examples illustrate many of the difficulties of trying
to use data to create general conclusions about a population.
- Paramount among our concerns are:
- Is the sample representative of the population that we'd like to draw inferences about?
- Are there known and observed, known and unobserved or unknown and unobserved variables that contaminate our conclusions?
- Is there systematic bias created by missing data or the design or conduct of the study?
- What randomness exists in the data and how do we use or adjust for it? Here randomness can either be explicit via randomization
or random sampling, or implicit as the aggregation of many complex uknown processes.
- Are we trying to estimate an underlying mechanistic model of phenomena under study?
- Statistical inference requires navigating the set of assumptions and
tools and subsequently thinking about how to draw conclusions from data.

---
## Example goals of inference

1. Estimate and quantify the uncertainty of an estimate of
a population quantity (the proportion of people who will
vote for a candidate).
2. Determine whether a population quantity
is a benchmark value ("is the treatment effective?").
3. Infer a mechanistic relationship when quantities are measured with
noise ("What is the slope for Hooke's law?")
4. Determine the impact of a policy? ("If we reduce polution levels,
will asthma rates decline?")
5. Talk about the probability that something occurs.

---
## Example tools of the trade

1. Randomization: concerned with balancing unobserved variables that may confound inferences of interest
2. Random sampling: concerned with obtaining data that is representative
of the population of interest
3. Sampling models: concerned with creating a model for the sampling
process, the most common is so called "iid".
4. Hypothesis testing: concerned with decision making in the presence of uncertainty
5. Confidence intervals: concerned with quantifying uncertainty in
estimation
6. Probability models: a formal connection between the data and a population of interest. Often probability models are assumed or are
approximated.
7. Study design: the process of designing an experiment to minimize biases and variability.
8. Nonparametric bootstrapping: the process of using the data to,
with minimal probability model assumptions, create inferences.
9. Permutation, randomization and exchangeability testing: the process
of using data permutations to perform inferences.

---
## Different thinking about probability leads to different styles of inference

We won't spend too much time talking about this, but there are several different
styles of inference. Two broad categories that get discussed a lot are:

1. Frequency probability: is the long run proportion of
times an event occurs in independent, identically distributed
repetitions.
2. Frequency inference: uses frequency interpretations of probabilities
to control error rates. Answers questions like "What should I decide
given my data controlling the long run proportion of mistakes I make at
a tolerable level."
3. Bayesian probability: is the probability calculus of beliefs, given that beliefs follow certain rules.
4. Bayesian inference: the use of Bayesian probability representation
of beliefs to perform inference. Answers questions like "Given my subjective beliefs and the objective information from the data, what
should I believe now?"

Data scientists tend to fall within shades of gray of these and various other schools of inference.

---
## In this class

* In this class, we will primarily focus on basic sampling models,
basic probability models and frequency style analyses
to create standard inferences.
* Being data scientists, we will also consider some inferential strategies that rely heavily on the observed data, such as permutation testing
and bootstrapping.
* As probability modeling will be our starting point, we first build
up basic probability.

---
## Where to learn more on the topics not covered

1. Explicit use of random sampling in inferences: look in references
on "finite population statistics". Used heavily in polling and
sample surveys.
2. Explicit use of randomization in inferences: look in references
on "causal inference" especially in clinical trials.
3. Bayesian probability and Bayesian statistics: look for basic itroductory books (there are many).
4. Missing data: well covered in biostatistics and econometric
references; look for references to "multiple imputation", a popular tool for
addressing missing data.
5. Study design: consider looking in the subject matter area that
you are interested in; some examples with rich histories in design:
1. The epidemiological literature is very focused on using study design to investigate public health.
2. The classical development of study design in agriculture broadly covers design and design principles.
3. The industrial quality control literature covers design thoroughly.

Loading