Skip to content

carloocchiena/the_statistics_handbook

Repository files navigation

The Statistics Handbook

The statistics handbook open source repository

Just give me the PDF and leave me alone

Ok, fine: click here (it's just the link from the above repo).

Version

Version 0.2 is out:

  • Revisited images.
  • Update of some explanations and formulas.
  • Typo fixing and general editing of doc layout.
  • Updated list of contributors.

Thanks to all the proof-readers and contributors! Can't believe this repo has more than 200 stars!

Contents of this repo

  • LICENSE: the license definition for this package.
  • README.md: the readme file.
  • The Statistics Handbook: the compiled version of the handbook.
  • Statistical Workbook.xlsx: the .xlsx file with exercises and demonstrations.
  • datachart_template.ai: the .ai file with the charts and image source file.
  • main.tex: is the latex source code of the handbook.
  • z-table.xlsx: the z-table generated with an Excel formula.
  • z_score_table_t_score_analysis.ipynb: interactive Jupyter Notebook to explore t-score and z-score testing.

Scope of this handbook

"Statistical analysis is the best way to predict events we do not know using information we do know."

We are used to talking generally about mathematical skills, thinking perhaps of derivatives, integrals, theorems, and graphs of functions.

Often we do that in an abstract way, as if they were certainly logical elements, but with just specific applications. Instead, we forget that not only are mathematical elements present in every single action, but that quantitative sciences are components of everyday life.

Specifically, I believe that statistics is among all the mathematical sciences the most fascinating because of the vastness and incredible opportunities for its application.

Every decision we make can be traced back to statistical phenomena, either innate (such as fear of the dark, because in the dark increases the likelihood of dangerous animals) or conscious (today I think it's likely to rain, so I'll take my umbrella).

On the other hand, approaching even basic statistical calculations (e.g., the infamous probability of winning the lottery) requires nontrivial skills in order to apply concepts and formulas that are not always complex but certainly have dissimilar results if used thoughtlessly. I claim for certain that worse than the lack of mathematical thinking is the misuse of mathematical thinking. This paper of mine is also in fact intended to combat my limitations through study and applications.

In this handbook, I wanted to create a path from the basics, including terminology (often one of the main obstacles for the laymen approaching the subject), to formulations of hypotheses, validations, and verification of formulas.

The path was constructed by consulting a large number of sources, cited in the appendix, and during long months of study and in-depth proof of the results and evidence, precisely because first and foremost I wanted to verify my own expertise, even before, of course, I could write about it.

Before releasing this publication, which is distributed under a Creative Common and Free Culture license, I asked for a check from eminent acquaintances with important academic and working backgrounds. I would like to endlessly thank all of them (their names can be found in the appropriate section). Nevertheless, I am staying receptive to additions, insights and corrections, taking full responsibility for any shortcomings and errors, certainly reported in good faith.

Happy reading!

Carlo, 26th of May 2024.

Featured

  • Positively voted on HackerNews (111 upvotes, 21 comments). Also made the "Buzzing on HN daily rank" for that day.
  • Cited in the popular Newsletter "La Cultura del Dato" by Stefano Gatti (episode #54).
  • LibHunt top 10 TeX project tagged "mathematics" (url).

About

the statistics handbook open source repository

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published