Skip to content

Commit 7d2aea9

Browse files
committed
added summary of 0.1 class
1 parent a1daa9a commit 7d2aea9

File tree

3 files changed

+76
-2
lines changed

3 files changed

+76
-2
lines changed

class/data-science-process.html

+74
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,74 @@
1+
<!DOCTYPE html>
2+
<html>
3+
<head>
4+
5+
<meta name="viewport" content="width=device-width, initial-scale=1">
6+
7+
<!-- Latest compiled and minified CSS -->
8+
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/css/bootstrap.min.css" integrity="sha384-BVYiiSIFeK1dGmJRAkycuHAHRg32OmUcww7on3RYdg4Va+PmSTsz/K68vbdEjh4u" crossorigin="anonymous">
9+
10+
<!-- Optional theme -->
11+
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/css/bootstrap-theme.min.css" integrity="sha384-rHyoN1iRsVXV4nD0JutlnGaslCJuC7uwjduW9SVrLvRYooPp2bWYgmgJQIXwl/Sp" crossorigin="anonymous">
12+
13+
<!-- Latest compiled and minified JavaScript -->
14+
<script src="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/js/bootstrap.min.js" integrity="sha384-Tc5IQib027qvyjSMfHjOMaLkfuWVxZxUPnCJA7l2mCWNIpG9mGCD8wGNIcPD7Txa" crossorigin="anonymous"></script>
15+
16+
<link rel="stylesheet" href="../style.css" />
17+
18+
<title>Homework 1: Review of Prerequisite Knowledge</title>
19+
20+
</head>
21+
<body>
22+
<p><a href="../index.html">Back to Homepage</a></p>
23+
24+
<!-- UPDATE -->
25+
<img src="../images/lines.png" class="img-responsive" />
26+
27+
<h1>Data Science is a Process</h1>
28+
<div class="lead">Benjamin Xie & Greg L. Nelson</div>
29+
30+
<p>
31+
We define data science as an <b>iterative process of augmenting human thinking with computational tools to use data to make decisions in/about the world.</b>
32+
</p>
33+
34+
<p>
35+
Let's decompose that definition:
36+
<ul>
37+
<li>
38+
<i>"iterative process"</i>: While we see data science as a process (explained below), it is very much an iterative one.
39+
We will often find ourselves jumping back to a previous step in the process or jumping "out of order" as the situation demands.
40+
</li>
41+
<li>
42+
<i>"augmenting human thinking with computational tools"</i>: Human thinking and reasoning is at the core of data science.
43+
We want to teach you first and foremost how to think like a data scientist. Computational tools supplement human thinking,
44+
but we reiterate that human thinking is at the core of data science.
45+
</li>
46+
<li>
47+
<i>"make decisions in/about the world"</i>: The purpose of data science is inform decisions.
48+
Because these decisions are very dependent on the contexts they are made in, the contexts data scientists work are very critical.
49+
</li>
50+
</ul>
51+
</p>
52+
53+
<p>
54+
We define the <b>data science process</b> as 5 steps:
55+
<ol>
56+
<li>Identify decision context and data science question(s)</li>
57+
<li>Collect and clean data</li>
58+
<li>Model data</li>
59+
<ol>
60+
<li>Generate explanations and models</li>
61+
<li>Evaluate and interpret explanations and models</li>
62+
</ol>
63+
<li>Make/Inform decisions</li>
64+
<li>Archive work</li>
65+
</ol>
66+
67+
We reiterate that this process is iterative and we may jump backwards or out of order to different steps.
68+
We also reiterate that this entire process exists within specific contexts, so data scientists much be critical of their work at each step.
69+
This means (among other things) considering bias in the data, model, and interpretations and ethical and privacy concerns.
70+
</p>
71+
72+
</body>
73+
74+
</html>

images/lines.png

461 KB
Loading

index.html

+2-2
Original file line numberDiff line numberDiff line change
@@ -129,8 +129,8 @@ <h2 id="schedule">Schedule</h2>
129129
Data science is a process
130130
<ul>
131131
<li><a href="https://docs.google.com/presentation/d/1u20j2Vc70oeeUd7Cq_6Xm21PrI5H_b79frDZozoaaL0/edit?usp=sharing" target="_blank">Welcome</a> (10 min)</li> <!-- I'm experienced -->
132-
<li>Activity (40 min): <a href="#">Investigating a disease outbreak</a></li>
133-
<li>Discussion (30 min): <a href="#">Data science is a process</a></li>
132+
<li>Activity (40 min): Investigating a disease outbreak</li>
133+
<li>Discussion (30 min): <a href="class/data-science-process.html">Data science is a process</a></li>
134134
<li>Review this syllabus (20 min)</li>
135135
</ul>
136136
Assigned: <a href="homeworks/review-prerequisite.html">Homework 1</a>. Due Tues 10/3.

0 commit comments

Comments
 (0)