Skip to content

Commit 4361d24

Browse files
committed
added lab 3.2
1 parent 0a65480 commit 4361d24

File tree

3 files changed

+65
-2
lines changed

3 files changed

+65
-2
lines changed

images/bricks.jpg

176 KB
Loading

index.html

+4-2
Original file line numberDiff line numberDiff line change
@@ -209,6 +209,7 @@ <h2 id="schedule">Schedule</h2>
209209
Improving modeling decisions using Baye's rule
210210
<ul>
211211
<li>Reading: <a href="https://docs.google.com/document/d/1JVacsJzMCG4tp0TTnEGV8Mb8vcc2vgS9z_EQzj9B0sU/edit?usp=sharing" target="_blank">Inferring probability distributions from data</a></li>
212+
<li>Slides: <a href="https://docs.google.com/a/uw.edu/presentation/d/1_5fLuWt29mOR9ZQ9O_aDoOIgEzcaWONjwn4eVauo1_s/edit?usp=sharing" target="_blank">Bayesian Inference in R</a></li>
212213
</ul>
213214
Assigned: <a href="homeworks/bayesian.html">Homework 3</a>. Due Tues 10/17.
214215
</td>
@@ -223,7 +224,8 @@ <h2 id="schedule">Schedule</h2>
223224
Bayesian Inference in Action, Data cleaning process
224225
<ul>
225226
<li>Reading: (None)</li>
226-
<!-- <li>Activity (80 min): <a href="activities/rank-qualities.html">Rank qualities</a></li> -->
227+
<li>Slides: <a href="https://docs.google.com/a/uw.edu/presentation/d/15QseWnGUQg6WA3TdyrcH3DGuqY-x_SVcHxKwWYew844/edit?usp=sharing" target="_blank">More on Bayesian Inference</a></li>
228+
<li>Slides: <a href="https://docs.google.com/a/uw.edu/presentation/d/1Zpzizr2Qc45umPAw81XQdcwO675YiCSa8ccnB4SuwU4/edit?usp=sharing" target="_blank">Data Cleaning</a></li>
227229
</ul>
228230
</td>
229231
</tr>
@@ -233,7 +235,7 @@ <h2 id="schedule">Schedule</h2>
233235
<td>
234236
Applying data cleaning process using Wrangler
235237
<ul>
236-
<!-- <li>Activity (50 min): <a href="activities/complexity.html">Great engineers are facile with complexity</a></li> -->
238+
<li>Activity: <a href="labs/data-cleaning.html">Data cleaning with Wrangler</a></li>
237239
</ul>
238240
</td>
239241
</tr>

labs/data-cleaning.html

+61
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,61 @@
1+
<!DOCTYPE html>
2+
<html>
3+
<head>
4+
5+
<meta name="viewport" content="width=device-width, initial-scale=1">
6+
7+
<!-- Latest compiled and minified CSS -->
8+
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/css/bootstrap.min.css" integrity="sha384-BVYiiSIFeK1dGmJRAkycuHAHRg32OmUcww7on3RYdg4Va+PmSTsz/K68vbdEjh4u" crossorigin="anonymous">
9+
10+
<!-- Optional theme -->
11+
<link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/css/bootstrap-theme.min.css" integrity="sha384-rHyoN1iRsVXV4nD0JutlnGaslCJuC7uwjduW9SVrLvRYooPp2bWYgmgJQIXwl/Sp" crossorigin="anonymous">
12+
13+
<!-- Latest compiled and minified JavaScript -->
14+
<script src="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.7/js/bootstrap.min.js" integrity="sha384-Tc5IQib027qvyjSMfHjOMaLkfuWVxZxUPnCJA7l2mCWNIpG9mGCD8wGNIcPD7Txa" crossorigin="anonymous"></script>
15+
16+
<link rel="stylesheet" href="../style.css" />
17+
18+
<title>Data Cleaning</title>
19+
20+
</head>
21+
<body>
22+
<p><a href="../index.html">Back to Homepage</a></p>
23+
24+
<!-- UPDATE -->
25+
<img src="../images/bricks.jpg" class="img-responsive" />
26+
<small>Credit: Takeshi Hirano</small>
27+
28+
<!-- UPDATE -->
29+
<h1>Data Cleaning with Wrangler</h1>
30+
<div class="lead">Benji Xie & Greg Nelson</div>
31+
32+
<p>Today you're going to practice using Trifacta Wrangler to clean some data.</p>
33+
34+
<h2>Download Wrangler, Data</h2>
35+
36+
<p>Wrangler is free to download: <a href="https://www.trifacta.com/products/wrangler/" target="_blank">Wrangler download page</a></p>
37+
38+
<h2>Transform obituary data from disease simulation</h2>
39+
40+
<p>Do the following:</p>
41+
42+
<ol>
43+
<li>Export the <a href="https://goo.gl/GKxUB7", target="_blank">obituary data</a> from the Google Sheets (as CSV).</li>
44+
<li>Import the data into Wrangler</li>
45+
<li>Start transforming the data!</li>
46+
</ol>
47+
48+
<p>Tips for data cleaning with Wrangler:</p>
49+
50+
<ul>
51+
<li>Try highlighting some part of the data. Wrangler is pretty good at recommending some "recipes" based on what you highlighted and you can modify the recommendations if necessary.</li>
52+
<li>Split up the problem into sub-steps. Break up a problem so 1 step is done in 1 column and another step is done in another column and then combine the 2 new columns.</li>
53+
<li>Consult documentation. Trifacta has great <a href="https://docs.trifacta.com/display/PE/Workflow+Basics" target="_blank">documenation</a>
54+
and <a href="https://www.trifacta.com/support/articles/topics/125211-online-training/" target="_blank">online training</a> to teach you how to use Wrangler.</li>
55+
</ul>
56+
57+
<p>Be sure to clean your data with a purpose and goal in mind!</p>
58+
59+
</body>
60+
61+
</html>

0 commit comments

Comments
 (0)