-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy path02-cases.Rmd
executable file
·340 lines (211 loc) · 14.1 KB
/
02-cases.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
# The Cases
## John Snow and the Cholera Epidemic
How demarcation helped John Snow figure out that water caused cholera to spread in the 19th century
<div align="center">
![Source: [Wikipedia](https://en.wikipedia.org/wiki/John_Snow#/media/File:John_Snow.jpg)](images/snow1.jpg)
</div>
<font size="5"> The Puzzle </font>
In the mid-19th century, cholera was claiming the lives of thousands in London. But how did the disease spread? In other words, what was the main mode of transmission of cholera?
For an overview of this case, see our [introductory story map](https://uploads.knightlab.com/storymapjs/a0d512bc2bc17977f1029fedead0329a/trying-out/draft.html) and [video](https://youtu.be/lGN8SK1Y1h4).
<div align="center">
![](images/snow2.png)
</div>
<font size="5"> The Research Design </font>
To answer that question, a doctor named John Snow developed a waterborne theory of cholera and then studied the locations where the disease was prevalent as well as those where it was not, along with specific locations where water suppliers varied.
<div align="center">
![](images/snow3.jpg)
</div>
By demarcating cases in this way and testing his theory at both a micro (Soho) and a macro level (South London), Snow was able to gather evidence that was compatible with the waterborne theory of cholera but was harder to account for by the airborne theory.
For more detail on the research designs devised by Snow and his contemporaries, see our [specialized story map](https://docs.google.com/presentation/d/e/2PACX-1vTpkyf_CSeAgD8datvssFfNHHwypUEYIg-tAC-cNaj6Nu_wrqqIOdsT_Y4phZYl8EBJA_OyWvrFhJfV/pub?start=false&loop=false&delayms=3000&slide=id.g9d798a6fd3_0_81) and [video](https://youtu.be/wClwk94IkLY).
<div align="center">
![](images/snow4.png)
</div>
<font size="5"> The Tools </font>
Using cluster analysis and other statistical techniques like conditional plots and averages charts, it is today possible to replicate and illustrate Snow’s analyses with our [GeoDa demo scripts](https://geodacenter.github.io/data-and-lab/data/geoda_scripts_snow.pdf).
<div align="center">
![](images/snow5.png)
</div>
<div align="center">
![](images/snow6.jpg)
</div>
<font size="5"> The Insights </font>
Snow used a natural experiment to find out that cholera cases concentrated in groups of people who relied on specific water supply mechanisms, whereas groups which relied on a different water supply were not affected even if they were located right next to clusters of infections.
<div align="center">
![](images/snow7.jpg)
</div>
<font size="5"> More Information </font>
Access our [data](https://geodacenter.github.io/data-and-lab/snow/) and [documentation](https://geodacenter.github.io/data-and-lab/data/snow_documentation.pdf) to replicate these findings in [GeoDa](https://geodacenter.github.io).
<div align="center">
![](images/snow8.jpg)
</div>
## Sherlock Holmes and the Napoleon Busts
Finding a key common feature of the smashed Napoleon busts allowed the famous detective to solve the mystery of why they were smashed.
<div align="center">
![Source: [The Arthur Conan Doyle Encyclopedia](https://www.arthur-conan-doyle.com/index.php/The_Adventure_of_the_Six_Napoleons)](images/sherlock1.jpg)
</div>
<font size="5"> The Puzzle </font>
In [The Adventures of the Six Napoleons](https://sherlock-holm.es/stories/pdf/a4/1-sided/sixn.pdf), the famous detective Sherlock Holmes faced a mystery: Napoleon busts were being destroyed in private property across the city of London and nobody knew why.
For an overview of this case, see our [story map](https://docs.google.com/presentation/d/e/2PACX-1vS1ADwo9Uj2JQ-jSqV3Yd1u00FUA33m20NcWvh0qW78axsJ-a-hCMmThFmBjNAyMNnhrcBQVeZxkIB9/pub?start=false&loop=false&delayms=3000) and [video](https://www.youtube.com/watch?v=Fu-4NmiuxJI).
<div align="center">
![](images/sherlock2.png)
</div>
<font size="5"> The Research Design </font>
Policemen initially thought that hatred was causing someone to break the busts -- or that they were dealing with a potential vendetta.
<div align="center">
![](images/sherlock3.png)
</div>
According to Sherlock, however, neither of these theories explained why only a specific subset of busts was being smashed. By observing the circumstances of the attacks and discovering that the busts came from the same manufacturer and the same batch, he came up with the theory that there was a hidden object inside one of these busts. He then predicted which bust would be the next to be destroyed, alerted the police and conducted a quasi-natural experiment at the location where he thought that the attack would take place.
<div align="center">
![](images/sherlock4.jpg)
</div>
<font size="5"> The Insights </font>
As it turned out, Sherlock was right and the burglar was caught on site. Basing his approach on demarcation and a quasi-natural experimental setting was successful at solving the mystery.
<div align="center">
![Source: [Pixabay](https://pixabay.com/photos/sherlock-holmes-london-rowdy-5499030/)](images/sherlock5.png)
</div>
## The Immigrant Paradox
We demonstrate that health outcomes in poor immigrant neighborhoods that are better than those in nearby non-immigrant poor neighborhoods do not actually reflect the “Immigrant Paradox” theory but are likely an artifact of a younger immigrant population.
<div align="center">
![](images/immigrant1.png)
</div>
<font size="5"> The Puzzle </font>
Economic hardship has long been associated with worse health outcomes, particularly with more life years lost prematurely. But, in the US, Hispanic immigrants often do better in terms of health even when they face similar socioeconomic issues. Why?
For an overview of this case, see our video [here](https://www.youtube.com/watch?v=sDjuZ1Ak9kQ&t=44s).
<div align="center">
![](images/immigrant2.png)
</div>
<font size="5"> The Research Design </font>
We compare premature mortality outcomes for different demographic groups and use a process of abductive reasoning to explore the plausibility of hypotheses that could explain these outcomes.
<div align="center">
![](images/immigrant3.png)
</div>
<font size="5"> The Tools </font>
We test these hypotheses with scatter plots, parallel coordinate plots, conditional box plots and various types of maps -- and with regression analysis.
<div align="center">
![](images/immigrant4.jpg)
</div>
<font size="5"> The Insights </font>
In the end, while insurance rates for different groups do not seem to explain the difference in health outcomes, younger ages in Hispanic neighborhoods do appear to be the driving factor behind lower premature mortality compared to predominantly White or African-American neighborhoods in Chicago.
<div align="center">
![](images/immigrant5.jpg)
</div>
## Health and Race: A Preliminary Approach
Author: **Atman Mehta** (3rd year student in the College)
A protocol that differentiates groups based on racial majorities provides an initial assessment of potential determinants of health indicators
<div align="center">
![](images/health1.png)
</div>
<font size="5"> The Puzzle </font>
How can we create groups to explain differences in outcomes in the social sciences? The question of the determinants of health in Chicago can be used as an example to illustrate this process.
For an overview of this case, see more from our summer project [here](https://uchicago.box.com/s/ganfzzmzonoaqc1of2kzzkw67pgg7csw).
<div align="center">
![](images/health2.png)
</div>
<font size="5"> The Research Design </font>
Atman first looked at potential plausible explanations and mechanisms, with the main idea being that greater economic hardship means greater unemployment, which means lower insurance and results in premature mortality. He then differentiated demographic groups according to race to test if the variables behaved in the expected ways across tracts with different racial majorities.
<div align="center">
![](images/health3.png)
</div>
<font size="5"> The Tools </font>
Several hypotheses were tested by using tools available in GeoDa such as parallel coordinate plots and scatter plots as well as co-location, cluster and LISA maps.
<div align="center">
![](images/health4.png)
</div>
<font size="5"> The Insights </font>
The resulting protocol helps in structuring the problem at hand in a way that is easy to test. Substantially, dissimilarities in indicators such as premature mortality rates for different racial groups do not seem to be explained by unemployment or violent crime. This puzzle is explored in more detail in the case of The Immigrant Paradox, also available on this website.
<div align="center">
![](images/health5.png)
</div>
## Turnout and Elections: A Spatial Perspective
Author: **R.E. Stern** (1st year student in the College)
Changes in turnout in presidential elections from 2012 to 2016, which could have had an impact on their outcome, appear to be related to demographics.
<div align="center">
![](images/elections1.png)
</div>
<font size="5"> The Puzzle </font>
What are the factors that explain changes in turnout in the 2016 presidential election compared to previous elections?
For an overview of this case, see more from our summer project [here](https://uchicago.box.com/s/5wu1l2zz8tbwlxm9frvruuicx6ogtw2b).
<div align="center">
![](images/elections2.png)
</div>
<font size="5"> The Research Design </font>
R.E.’s protocol highlights the importance of an iterative discovery process, i.e. the continuous testing and re-testing of hypotheses to develop explanatory insights. The hypothesis that non-college white voters turned out to vote in higher numbers in 2016 leads to expectations with an estimated probability that are then tested.
<div align="center">
![](images/elections3.jpg)
</div>
<font size="5"> The Tools </font>
First, exploratory spatial data analysis tools such as box maps give an overview of the data. Then, GeoDa’s cluster maps such as K-Means or Local Moran’s I allow users to re-assess the probabilities they assign to each hypothesis.
<div align="center">
![](images/elections4.png)
</div>
<font size="5"> The Insights </font>
Counties with more white residents without college degrees did see slightly higher turnout in 2016 compared to 2012 -- more importantly, though, counties with more non-white residents and no college degrees saw less turnout.
<div align="center">
![](images/elections5.jpg)
</div>
<font size="5"> More Information </font>
Access our [data](https://uchicago.box.com/s/m7lf4ldukuh3b6zle3cqjqd437vb5yqm) to replicate these findings in [GeoDa](https://geodacenter.github.io).
<div align="center">
![](images/elections6.jpg)
</div>
## Asthma and Pollution
Authors: **Mark Baker** and **Jizhou Wang** (3rd year students in the College)
Proximity to potential bus pollution is higher in areas with more residents who are economically vulnerable and African-American, groups which are also at higher asthma risk.
<div align="center">
![](images/asthma1.jpg)
</div>
<font size="5"> The Puzzle </font>
Are groups at higher risk of asthma also more likely to be exposed to bus pollution?
For an overview of this case, see [insights](https://uchicago.box.com/s/zmoyz07zwu2opnuovfrogge6ckkwjtpq) and [methods](https://uchicago.box.com/s/1zo055jgec3sxrygvr0unrtsr8ptvb8v) from our summer project.
<div align="center">
![](images/asthma2.jpg)
</div>
<font size="5"> The Research Design </font>
Path diagrams outline what factors could be driving spatial concentrations of asthma, including pollution from traffic, lower housing values and older homes. Mechanisms for hypotheses are developed and tested to assess their strength based on a classification by Mark Baker.
<div align="center">
![](images/asthma3.jpg)
</div>
<font size="5"> The Tools </font>
Mark Baker and Jizhou Wang assess the strength of different hypotheses with boxplots, scatterplots, parallel coordinate plots, cluster analyses, averages charts and regressions in GeoDa.
<div align="center">
![](images/asthma4.png)
</div>
<font size="5"> The Insights </font>
Economically vulnerable and African-American neighborhoods that have a higher asthma risk are also more likely to be exposed to traffic pollution.
<div align="center">
![](images/asthma5.jpg)
</div>
## Racial Diversity: What Built Environment Features Distinguish Racially Diverse from Non-Diverse Areas?
Author: **Felix Farb** (Highschool Junior student)
Planning-related factors like highway dividers and land use diversity, as well as out-migration of African-American residents, distinguish racially diverse from non-diverse areas in Chicago.
<div align="center">
![](images/racialdiversity1.png)
</div>
<font size="5"> The Puzzle </font>
Do racially diverse areas in the city of Chicago differ in terms of planning-related factors from non-diverse areas?
For an overview of this case, see [insights](https://uchicago.box.com/s/pdku4e9rszvhtfnl1mv9o6sdv0ul1j3b) and [methods](https://uchicago.box.com/s/rllzauyb3ew80htdq92i38o0re3q34v0) from our summer project.
<div align="center">
![](images/racialdiversity2.jpg)
</div>
<font size="5"> The Research Design </font>
An exploratory spatial data analysis identifies areas in Chicago that are racially diverse and compares potential planning-related drivers such as highway dividers and diverse land uses to gentrification. Felix’s protocol demonstrates how hypotheses can be made more and more specific to make them testable.
<div align="center">
![](images/racialdiversity3.png)
</div>
<font size="5"> The Tools </font>
Different maps, distance buffers, averages charts and K-Means clusters are used to assess the relevance of the results to each hypothesis.
<div align="center">
![](images/racialdiversity4.png)
</div>
<font size="5"> The Insights </font>
This initial exploratory analysis suggests that racially diverse neighborhoods in Chicago are more likely to be physically divided from other neighborhoods through highways and have more diverse land uses. Some diverse areas are also characterized by disproportionate out-migration of African-American residents.
<div align="center">
![](images/racialdiversity5.png)
</div>
<font size="5"> More Information </font>
Access our [data](https://uchicago.box.com/s/5rmjez5pxh7odornc9xzmg3x0wmqkmjg) to replicate these findings in [GeoDa](https://geodacenter.github.io).
<div align="center">
![](images/racialdiversity6.png)
</div>
# References {-}
'`