-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathmanuscript.Rmd
1657 lines (1396 loc) · 89.7 KB
/
manuscript.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
---
title : "Online Social Cohesion Reflects Real-World Group Action in Syria during the Arab Spring"
shorttitle : "Arab Spring"
author:
- name : "Megan Chiovaro"
affiliation : "1, 2"
corresponding : yes # Define only one corresponding author
address : "406 Babbidge Road, Unit 1020, Storrs, CT 06269"
email : "[email protected]"
- name : "Leah C. Windsor"
affiliation : "3,*"
- name : "Alistair Windsor"
affiliation : "3,4"
- name : "Alexandra Paxton"
affiliation : "1,2,*"
affiliation:
- id : "1"
institution : "Center for the Ecological Study of Perception and Action, University of Connecticut"
- id : "2"
institution : "Department of Psychological Sciences, University of Connecticut"
- id : "3"
institution : "Institute for Intelligent Systems, University of Memphis"
- id : "4"
institution : "Department of Mathematical Sciences, University of Memphis"
- id : "*"
institution : "Equal author contributions"
abstract: |
In recent years, political activists have taken to social media platforms to rapidly reach broad audiences. Despite the prevalence of micro-blogging in these sociopolitical movements, the degree to which virtual mobilization reflects or drives real-world movements is unclear. Here, we explore the dynamics of real-world events and Twitter social cohesion in Syria during the Arab Spring. Using the nonlinear methods cross-recurrence quantification analysis and windowed cross-recurrence quantification analysis, we investigate if frequency of events of different intensities are coupled with social cohesion found in Syrian tweets. Results indicate that online social cohesion is coupled with the counts of all and negative events each day but shows a more nuanced connection to positive events, suggesting that online mobilization does reflect real-world movements. We conclude with a discussion of implications and applications of nonlinear methods in political science research.
<!-- https://tinyurl.com/ybremelq -->
keywords : "social cohesion, recurrence quantification analysis, complex systems, Twitter, Arab Spring"
wordcount : "10981"
bibliography : ["references.bib"]
floatsintext : no
figurelist : no
tablelist : no
footnotelist : no
linenumbers : no
mask : no
draft : no
appendix :
documentclass : "apa6"
classoption : "man"
output : papaja::apa6_word
#output : papaja::apa6_pdf
---
```{r setup, include = FALSE}
# clear the workspace
rm(list=ls())
# install and load required packages
source('./scripts/required_packages.r')
source('./scripts/libraries_and_functions.r')
# turn on caching
knitr::opts_chunk$set(cache=TRUE, autodep=TRUE, cache.lazy=FALSE)
# create subdirectories
dir.create("./data/raw") # load three data files here
dir.create("./data/formatted/primary", recursive = TRUE)
dir.create("./data/formatted/secondary/icews", recursive = TRUE)
dir.create("./data/formatted/secondary/phoenix", recursive = TRUE)
dir.create("./results/primary/crqa", recursive = TRUE)
dir.create("./results/primary/windowed-crqa", recursive = TRUE)
dir.create("./results/secondary/icews/crqa", recursive = TRUE)
dir.create("./results/secondary/icews/windowed-crqa", recursive = TRUE)
dir.create("./results/secondary/phoenix/crqa", recursive = TRUE)
dir.create("./results/secondary/phoenix/windowed-crqa", recursive = TRUE)
```
```{r data-prep, echo=FALSE, warning=FALSE, message=FALSE, error=FALSE, results='hide'}
# format the data
source('./scripts/primary-analyses/01_formatting_data.R')
num_target <- sum(ICEWS_formatted_target$all_events_target)
num_source_target <- sum(ICEWS_formatted_source_target$all_events_source_target)
```
Before social media, people coordinated protests using phone tree calling sheets
and mimeographed flyers, through in-person meetings of social organizations and
churches, and by word of mouth. They gathered together and hand-made protest
signs that reflected their agendas. As McAdam found, people were more likely to
join movements when they had a personal connection [@mcadam1986recruitment;
@mcadam1993specifying]: Historically, strong ties have been strong predictors of
involvement in high-risk protest activity.
The cohesiveness and strength of social movements before the Internet were
operationalized by the number of people present and through the political events
that precipitated and resulted from social gatherings *en masse*. Cohesive
social movements were reflected in their cohesive placards and posters. For
example, the pervasive “I am a man” posters during the 1968 Memphis sanitation
workers' strike reflected their internal cohesion and unified, goal-oriented
movement [@Windsor2020].
However, the rise of the Internet and its decentralized online networks have
upended theories about recruitment to---and participation in---social movements.
Strong interpersonal ties have yielded to weak impersonal ties that strongly
predict involvement in social movements [@steinert2017spontaneous]. Measurements
of social movements have similarly evolved from ethnographic interviews to “big
data” analyses of social media.
Amid the cacophony of messages during political upheaval---such as the Arab
Spring of the early 2010s---consistent themes emerge through hashtags, retweets,
and the repeated mentions of concepts such as regime, demonstration, and human
rights. The cohesiveness of these themes (i.e., the degree to which participants
in the movement are metaphorically singing in one voice) reflects the real-world
events happening contemporaneously with collective action in social movements or
civil conflicts.
In the present research, we investigate the relationship between linguistic
cohesion and social cohesion in times of social conflict and unrest. We tackle a
question related to the one posed by Zeitzoff [-@zeitzoff2017social]: How does
social media reflect on-the-ground realities during times of social conflict? To
do so, we examine patterns of linguistic cohesion on Twitter and their
connection to real-world events during the Arab Spring uprising in Syria between
March and June 2012.
Disentangling the complicated relationship between mass online
mobilization---and social mobilization in person---continues to be relevant,
even as our modes of communication change. Accordingly, the political events
around the Arab Spring are still being understood. As Zeitzoff
[-@zeitzoff2017social] recognizes, social media is changing conflict, but the
mechanisms and relationships by which it does so are still largely unknown.
## Syria in Context
The decades preceding the Arab Spring in Syria were marked by repression and
fear. In 1971, Hafez al-Assad overthrew the previous leader and remained in
power for thirty years, presiding during regional contentiousness and
cooperation until his death in 2000. His son, Bashar al-Assad, continued the
regime of authoritarian leadership after assuming his office. The Assad lineage
hails from a minority Islamic sect (the Alawites) in a multi-religious and
multi-ethnic country comprising Christians, Sunni and Shia Muslims, Kurds, and
Palestinian refugees.
In December 2010, a Tunisian man named Mohamed Bouazizi died after setting
himself on fire to protest the authoritarian government’s unjustifiable
confiscation of his vendor cart by municipal officials. His actions served as a
catalyst for the Tunisian Revolution, which rapidly spread to surrounding
countries also subject to unjust authoritarian rule, including Egypt, Libya, and
Syria.
Following a wave of democratization efforts across the Middle East and North
Africa, modest Syrian protests began in March 2011. The regime responded harshly
with imprisonments, detentions, censorship, and military operations as the
resistance to Assad's regime spread. The fighting escalated dramatically in
March 2012, after increased unrest in the western city of Homs. International
actors---including the Red Cross/Red Crescent Society, the United Nations
General Assembly, the U.N. Security Council, and individual world
leaders---worked toward cease-fires, negotiations, and concessions without much
success. While many high-level military and diplomatic officials defected,
numerous loyal forces continued the regime's repression, and the Syrian people
turned to social media to coordinate their activism and communicate to the
world. Between March and June 2012, millions of tweets were sent within Syria,
revealing the details of the conflict suppressed by the government. A full
account of the history of the Arab Spring is outside of the scope of the current
article; more information can be found in [@brownlee2013syria] and
[@lynch2016new].
Syria is a multi-ethnic, multi-lingual, and multi-religious state. Kurdish
people make up between 10-15% of the population, Alawites between 8-15%, and
Christians about 10%. There are also much smaller sects (e.g., Ismailis, Druze),
but Sunni Muslims comprise the vast majority of society. Most people speak
Arabic as their primary language. Most policy reports addressing the current
conflict report on the cohesion of the ruling Alawite military elite, rather
than on the cohesiveness of the opposition movement. However, the Sunni
majority's disaffection with the Syrian government and their lack of political
representation provide sufficient grievance and collective group identification
to make them a cohesive force.
## Social Media and Social Movements
Massive online social cohesion can manifest in real-world social behavior.
Social media platforms enable online social cohesion by offering distributed
information-gathering and real-time information dissemination. Given its
accessibility and proficiency as an information exchange tool, Twitter has
provided social and political activists an opportunity to complement
[@greijdanus2020psychology] and not hinder participation in
[@harlow2012collective; @milovsevic2017civic] real-world social mobilization.
During the 2012 Italian protests for global economic change, tweeting was
determined to be more effective locally in discussing real-world, real-time
events than traditional mainstream media platforms [@vicari2013public]. As
people increasingly rely on social media for local news and information, we must
understand exactly how these micro-blogs connect to and reflect real-world
action.
Along with Facebook, Twitter has been credited as essential to the Arab
Spring---both for protesters and for the government's response to them
[@comninos2011twitter]. The Arab Spring has been deeply examined within the
political science literature [e.g., @lutterbeck2013arab;
@koehler2017political], but to our knowledge, the direct *coupling* between
Syrian Twitter cohesion and international action has not been investigated
(though hashtag usage has been linked to next-day protests;
@steinert2015online). In fact, some have been skeptical about the effect of
social media on collective action at all [@seay2014slacktivism].
This skepticism is summarized in a quote by Lisa @anderson2011demystifying:
\setlength{\leftskip}{1cm}
\noindent The important story about the 2011 Arab revolts in Tunisia, Egypt, and
Libya is not how the globalization of the norms of civic engagement shaped the
protesters' aspirations. Nor is it about how activists used technology to share
ideas and tactics. Instead, the critical issue is how and why these ambitions
and techniques resonated in their various local contexts. (p. 2)
\setlength{\leftskip}{0pt}
\noindent Others have similarly expressed hesitation about the role of online
organizing. Some have argued that real-life strong ties outweigh online weak
ties and that the power of the strong ties through in-person contact is critical
to recruiting people for higher-risk activities [@gladwell2010revolution;
@mcadam1986recruitment].
Interestingly, these views may help make the case for bridging social media and
social mobilization. Such views suggest the strength of online social ties is
inextricably linked to the *context* and *ambitions* of the actors. Moreover,
research suggests that weak ties still retain considerable political power:
While strong ties tend to bind smaller groups, weak ties tend to bind groups
across distances [@granovetter1977strength].
Social media generates critical information about events, including whether and
how a demonstration will take place, how many people participate, and how those
in power (e.g., state, military) respond. Similarly, these media platforms alert
us to more global shifts within a conflict, such as when fighting moves to a new
city or region, the number of casualties and wounded, and the types of weaponry
used by parties [@zeitzoff2017social]. In monitoring social media channels,
government forces may respond to the perception that citizens are gaining
strength and sympathy. Governments have been shown to be quite tolerant of
dissent so long as it does not metamorphose into collective action
[@king2013censorship].
Protestors and social activists communicate with each other with increased
transparency due to the ubiquitous availability of social media like Twitter.
Activists are able to access these tools via mobile phones and computers with
Internet connectivity, providing a stream of real-time shared information.
Citizens can share the location of events like protests and military incursions,
the degree of state repression, and the number of casualties. In turn, the
government---interested in remaining in power---monitors social media platforms
to ascertain emerging threats [@comninos2011twitter], such as where new groups
are forming, where existing groups are strengthening, where gatherings are held,
and who the leaders are.
With few exceptions, leader Bashar al-Assad chose to keep social media platforms
up and running. Gohdes [-@gohdes2015pulling] notes that regimes can attempt to
thwart the growth of social protest online and in the streets by shutting down
the Internet. During these times, military repression increases, but government
information about the citizens decreases. As a result, regimes may elect to
maintain Internet connectivity to monitor and surveil their citizens.
At the same time, the insulation of online social platforms allows researchers
to detail real-world social dynamics at unprecedented resolution and scale
[@paxton2017finding]. The digital traces left by individuals and collectively as
groups are regarded as a valuable source of information that can offer powerful
insights into states of social disequilibrium, uprisings, and escalations into
civil wars. By examining these conversations, researchers can study the degree
of cohesion between speakers and the alignment between that cohesion and their
collective political behavior, just as the degree of cohesion among real-world
protestors in past movements has been captured in their shared sign messaging
[@Windsor2020].
## The Role of Cohesion in Social Movements and Civil Wars
People join social and political resistance movements because they feel a sense
of connection to the cause. Given that people are generally risk-averse,
overcoming inertia requires a strong connection to the group identity or a
strong aversion to an individual, event, or political injustice
[@goldstone1980theories; @goldstone2011understanding]. Political resistance
offers participants the opportunity to “lose themselves” and relinquish their
individual identity in favor of identification with a larger group, contributing
to a cohesive group ready and mobilized for action [@Hoffer2011true].
The type of collective contentious behavior that manifests in society is a
function of the available political opportunity structures. The political
opportunity structure is “comprised of *(sic)* specific configurations of
resources, institutional arrangements and historical precedents for social
mobilization, which facilitate the development of protest movements in some
instances and constrain them in others” [@kitschelt1986political, p. 58].
Emergent strategies and tactics in contentious politics are a function of what
the system makes possible. This includes the number of people and their
disposition (i.e. nonviolent or violent), as well as their internal structure.
The more cohesive a group is, the greater threat they pose to the establishment.
Cohesion is a familiar concept in the study of civil conflicts, as it pertains
to group affiliation and the ability to overcome collective action problems.
Individuals must act in concert---that is, cohesively---to accomplish the task
of challenging state authority [@olson1965logic;
@eisinger1973conditions; @tilly2004contentious; @francisco1995relationship].
Viewing rebel recruitment during civil conflicts as a function of internal
cohesion, rebel movement success is influenced by the degree of group cohesion,
which itself is a function of geographic distance, ideology, and ethnicity
[@gates2002recruitment]. Within Syria, multiple rebel groups exist, sourcing
fighters locally (from the disaffected Syrian population) and internationally
(from transnational terrorist organizations). Groups compete intensely with one
another for recruitment in the rebel movement and often punish defectors for
nonparticipation [@gates2002recruitment].
Another factor influencing social cohesion during social movements and civil
wars is ethnic identity and the degree of ethnic conflict associated with the
movement. Greater degrees of ethnic conflict increase the salience of ethnic
identity, leading people to identify more strongly with their own ethnic
identity [@sambanis2009s]. As ethnic conflict intensifies, groups seek to become
more quintessentially like the typical member of their ethnic group. This can
manifest in a number of ways, including in behavior on social media. However,
this may also include a general identification across ethnic groups in a
membership of an opposition to an oppressive leader.
The civil war literature has also explored the role of social cohesion. Social
homogeneity increases the likelihood of social conflict, especially when the
society is polarized [@collier2004greed]; this is partly due to nationalist
groups' strong identification with a collective identity [@cederman2007beyond]
and their social solidarity [@sambanis2009s]. These same processes can be
similarly important for rebels [@anyanwu2002economic]. Referencing Gurr's
[-@gurr2000peoples] work on minorities at risk, Regan and Norton
[-@regan2005greed] note that
\setlength{\leftskip}{1cm}
\noindent a religious or linguistic minority might suffer disproportionately in
a given society, and this form of aggrievement can lead to unrest across the
social lines that distinguish the minority group. Therefore, while collective
cohesion is a function of targeted maldistributions, it is individual penury
that allows for this collective mobilization. (p. 323)
\setlength{\leftskip}{0pt}
The body of previous work concurs that group cohesion matters to group
mobilization. We build on this by providing a quantitative approach to
understanding cohesion from multiple levels and by incorporating analytic tools
that allow us to investigate the interconnectedness of those levels.
## Cohesion: A Multi-Leveled, Multi-Layered Concept
We contextualize linguistic cohesion with social mobilization and response to
the relative intensity of the conflict. Cohesion is a multi-layered concept
referring to the connectedness of speakers, actions, and societies. It reflects
the continuity of language and expression and shows the connectivity of
participants in a social movement. To address the relationship between language
and discourse and social movements, we adopt an interdisciplinary approach that
incorporates linguistics, psychology, and political science using nonlinear
dynamical systems methodology to disentangle the relationship between language
and action.
### Lexical and Interlocutor Cohesion
Just as real-world social cohesion can be quantified, so can online cohesion.
One kind of cohesion is the similarity in language use on social media. This
implicit or explicit similarity can indicate shared beliefs, thoughts, and
feelings, as has been studied in research on language style in psycholinguistics
and social psychology [@gonzales2010language;
@tausczik2010psychological;
@brennan1996conceptual]. Although previous work on language style has often
focused on individuals and small groups, this can be scaled up to examine
massive social-level changes [@michel2011quantitative; @fusaroli2015timescales].
The first level of cohesion that we will consider are the words within social
media posts. In Fig \@ref(fig:cohesion-in-tweets), we see four sequential
tweets of four words each. Each sequential tweet contains some content included
in the previous tweets. The linguistic construct of content word overlap
measures the relationship across units of meaning---here, tweets---where
identical and semantically similar words are repeated [@graesser2004coh]. These
repeated lexical items constitute a theme; on Twitter, this might be a
particular hashtag, a name, a location, or another nominal category. This
localized cohesion is referential; words are repeated and co-referred across
tweets, creating a sense of continuity and cohesiveness.
[Fig \@ref(fig:cohesion-in-tweets) about here]
Typically, these analyses focus on *content words* (e.g., nouns, adjectives,
verbs), which convey more meaningful semantic information than *function words*
(e.g., articles, conjunctions). While function words can provide important
information about complexity and language structure
[@tausczik2010psychological], content words have more specialized meaning and
are therefore more useful when researchers want to understand *what* people are
discussing, rather than *how* they are discussing it [@graesser2004coh;
@mcnamara2014automated].
In estimating cohesion in Twitter posts, tweets that share content words yield
higher cohesion values, even among speakers with opposing views on the subject
matter. Repetition of terms and concepts is captured by reiteration, which is a
type of lexical cohesion in which a subsequent item repeats a previous item, or
refers to it through synonymous language [@halliday1976cohesion]. We include
retweets (i.e., reposts or forwards of previous messages) because they are a
form of reiteration that increases message cohesion. They indicate that
individuals are “singing to the same tune” by amplifying the original message.
Since retweets replicate verbatim another tweet (sometimes with the addition of
commentary or supplementary messages), these tweets would also have higher
cohesion.
```{r cohesion-in-tweets, echo=FALSE, warning=FALSE, message=FALSE, error=FALSE, fig.cap = "A visual representation of cohesion in tweets. The shared content words--either identical or semantically similar--constitute a theme around which cohesion is occurring.", out.width='100%'}
# call the image
knitr::include_graphics("./figs/cohesion-in-tweets.png")
```
We also include hashtags in our analyses. Hashtags are a way of identifying
similar topics (e.g., people, events, places, themes) across users and across
tweets by typing a string of characters preceded by a “#” symbol. Users may
choose to include as many hashtags (up to the total character limit of the
tweet) as they would like, and they may choose to create their own or adopt
hashtags from other users or groups. The use of the same hashtags would increase
cohesion; however, because hashtags are often included or embedded within longer
messages, simply using the same hashtags would not be a guarantee of higher
cohesion if the rest of the words in the tweets do not overlap.
The use of similar language can provide insight into *interlocutor cohesion*, or
the degree to which people are talking about similar things. This can include
people who are talking about similar things from radically perspectives, as well
as those who are talking about something from the same perspective. Previous
research from political science has investigated these ideas [@Windsor2020]: The
“Occupy Wall Street” movement has been held up as an example of a movement with
low interlocutor and linguistic cohesion because of its wide range of signs and
slogans, while the “I am a man” protest (during the 1968 sanitation workers'
strike in Memphis, TN) or the “Black Lives Matter” placards (following the
uprising in Ferguson, MO) have high lexical and interlocutor cohesion
[@edgar2018struggle].
Fed by lexical cohesion, interlocutor cohesion ebbs and flows, contextualized by
changing real world events. Interlocutor cohesion may increase preceding an
in-person social mobilization event, alerting a repressive government that
collective mobilization is happening. It may also respond to exogenous events
such as spontaneous, unprovoked militarized conflict.
Given the importance of lexical and interlocutor cohesion for messaging in
political movements, lexical cohesion may be important trace data for
researchers interested in leveraging social media data to analyze social
movements. Similar to large-scale studies of sociocultural movements [sometimes
referred to as *culturomics*; e.g., @michel2011quantitative], political
scientists can track the ways in which conversations on social media coalesce or
diffuse around similar content words. In this way, it matters relatively little
whether the interlocutors have precisely the same views: The point is to
understand whether the society-level conversation is converging on similar
themes.
### Social and Political Cohesion
Real-world collective action is known to bring about a sense of togetherness,
also known as “collective effervescence” [@durkheim1995elementary]. This feeling
of being a part of something larger than oneself leads individuals to act in
ways that would not have on their own. Being overcome with this feeling can lead
to violence [@collins2009violence; @spaaij2014sports]. Conversely, increased
exposure to violence can result in positive social cohesion. @gilligan2014civil
found that exposure to fatal wartime violence resulted in increased prosocial
motivations, including cooperative behaviors, feelings of trust, and overall
community-level social cohesion.
At the same time, behavioral similarity in certain contexts has been connected
to important psychological and social effects. For example, shared language is
associated with rapport [both in face-to-face and computer-mediated
conversations; @niederhoffer2002linguistic; @riordan2011evidence] and can even
be connected to better performance on shared tasks [as long as the language is
relevant to the task at hand; @fusaroli2012coming]. Shared movement is
associated with greater group bonding and greater willingness to contribute to
the group [@wiltermuth2009synchrony], and shared identity leads to a greater
willingness to adopt the group's goals and motivations [@walton2012mere].
We expect to see increased linguistic cohesion among people participating in
online discussions about an idea, event, individual, or movement. While the
overall frequency of social media messages related to the event may increase as
a result of the event, their linguistic cohesion helps us understand if denizens
are using the platforms in a coordinated way that amplifies their messages or
goals and creates a virtual protest space. Given that Syrian protesters
coordinated their activity through Twitter by discussing their own events (e.g.,
planned protests, active demonstrations) and their reactions to events happening
around them (e.g., government reactions, international action or inaction), the
cohesion in social media should correlate with the social cohesion of the group
[@zeitzoff2011using].
Political and social cohesion in the world can be modeled using event data which
encodes who does what to whom on a given day. The CAMEO (Conflict and Mediation
Event Observations) coding scheme encodes a set of activities with numerical
values, rescaled to reflect a conflict-cooperation scale [@schrodt2006twenty;
@goldstein1992conflict]. These event codes include a range of behaviors of
varying intensities and valences, from making public statements and appeals to
demanding, threatening, protesting, and using unconventional mass violence.
Social cohesion can have real and wide-ranging impacts on a political movement
and the people in it. A socially cohesive group is inherently more threatening
to the existing regime because it reflects momentum in collective opposition and
the professionalization of alternate leadership. Global insurgency movements
exhibit similar patterns in their social structure [@bohorquez2009common].
Rather than patchwork messages via Twitter, emergent or intentional cohesive
online messaging---using similar language, paralleling in-person messaging
[@Windsor2020]---demonstrates increased organizational structure, inciting a
government response.
Thus, lexical and interpersonal cohesion are hypothesized to be tied to social
and political cohesion. Shared online language may lead to the development or
expression of shared identity [@wiltermuth2009synchrony]. This, in turn, may
drive individuals' propensity to engage in coordinated behavior for the group
[@wiltermuth2009synchrony; @walton2012mere]. Taken together, these processes may
increase movement participants' weak and strong ties [@granovetter1977strength;
@gladwell2010revolution;
@mcadam1986recruitment], building the movement by expanding and deepening
heterogenous ties. Therefore, during social conflict, the linguistic cohesion of
the online conversation should correspond to the degree of social cohesion on
the ground.
Steinert-Threlkeld [-@steinert2017spontaneous] modeled a similar relationship
between social media and social mobilization alongside event data. That
groundbreaking work demonstrated that participants on the fringe of a social
movement contribute substantially to the overall level of protest. Here, in
addition to providing new methods to this area, we build on this work by
demonstrating how sociolinguistic cohesion evolves in tandem with negative and
positive exogenous events.
### Sociolinguistic Cohesion: Connectedness of Speakers, Ideas, and Events
We tie the constructs of lexical, interlocutor, and social cohesion together in
a theoretical framework called *sociolinguistic cohesion*, as shown in Fig
\@ref(fig:theoretical-framework). Previous values of linguistic cohesion should
influence future values of linguistic cohesion and future actions in the real
world space. Previous events influence future events and future linguistic
cohesion in social media.
[Fig \@ref(fig:theoretical-framework) about here]
Sociolinguistic cohesion is not a static concept. Just as social movements
experience increased participation and setbacks---indicating dynamic
relationships among participants and between participants and the sociopolitical
environment---sociolinguistic cohesion is also dynamic, reflecting the changing
states of the political sphere. Other scholars have noted the diffusion of
messages that reach critical mass, allowing for “behaviors, beliefs, and
preferences” to spread within a population [@barash2012critical]. Cohesion
changes over time in response to external events and the internal dynamics of
the social movement, even if the social interaction occurs entirely online
[@centola2007complex].
Lexical cohesion, then, is a useful indicator of social cohesion in political
crisis. Lexical cohesion happens at the word level. Interlocutor cohesion
happens between speakers. Political cohesion happens at the social level during
moments of political crisis, like civil wars and social movements.
Sociolinguistic cohesion comprises lexical, interlocutor, and sociopolitical
events, in which the words and their context reveal the disposition of socially
connected linguistic communities. It accounts for the cohesion between multiple
speakers in an environment as their language becomes more similar.
Sociolinguistic cohesion also addresses how groups involved in political
rebellion converge to mount sustained resistance to state authority.
Related previous work has also explored diffusion of hashtags in viral contagion
in Nigeria, a phenomenon similar to what we identify as sociolinguistic cohesion
[@fink2016complex]. The prevalence and rate of particular hashtags contributed
to the overall sociolinguistic cohesion. The more that people reuse hashtags or
common phrases, the greater the sociolinguistic cohesion when this phenomenon
occurs during socially significant times like protests [@state2015diffusion].
Combining these separate threads, it is reasonable to expect that real-world
actions and online social interactions would be enmeshed, driving and being
driven by one another. From a dynamical systems perspective, online social
cohesion and in-person action can be considered two complementary components of
a single system--constantly constrained by, interacting with, and influencing
one another [@mahner1997foundations]. Investigation of the coupling of
real-world mobilization and social cohesion could provide new insight into the
power of social media during mobilization and the ways in which individuals
utilize it to bring about, or respond to, social change. Increasingly, social
media platforms are being utilized during social mobilization
[@mcadam2003dynamics;
@koopmans2004movements], but the relationship between virtual and real-world
mobilization is still largely unknown. The present work aims to understand how
physical and virtual movements coexist and interact from a dynamical systems
perspective.
```{r theoretical-framework, echo=FALSE, warning=FALSE, message=FALSE, error=FALSE, fig.cap = "The theoretical framework for sociolinguistic cohesion. Real-world events and linguistic cohesion continually interact and influence one another throughout time.", out.width='100%'}
# call the image
knitr::include_graphics("./figs/theoretical-framework.png")
```
## Using Nonlinear Methods to Capture Complex Relationships
Given the hypothesized interrelationship between the real-world and online
actions---and the notorious messiness of naturally occurring data---it is
critical to find appropriate analysis tools that can identify the complex
interconnections. Traditional linear analyses have led to many notable
discoveries in the domains of psychology and political science, but time series
data pose problems that are not easily overcome with these methods. Events have
historically been conceptualized as linear chains of smaller, sequential events
with one occurrence leading to the next. Certain kinds of data are relatively
well-suited to these analyses, but messy, complex, variable data from real-world
events can pose challenges, from overpowered samples to violation of underlying
statistical assumptions [@paxton2017finding].
At the same time, the foundational theoretical assumptions of a linear world
have been increasingly replaced by ideas from dynamical systems theory. Rather
than a chain of events, time series data can be thought of as nested
occurrences, where mutually constrained parts (e.g., citizens, governments,
real-world events, social media platforms) are constantly interacting, leading
to more global behaviors that are larger than the sum of the parts
[@turvey2018lectures]. When analyzed as dynamical systems, many phenomena
exhibit new nonlinear patterns that were formerly overlooked. Nonlinear methods
provide means for capturing the rich temporal dynamics and variability that
unfold in time series data.
Although a number of other nonlinear methods exist, the primary nonlinear
methods used in the current work are cross-recurrence quantification analysis
[CRQA;
@zbilut1998detecting] and its offshoot, windowed cross-recurrence quantification
analysis [WCRQA; @webber2005recurrence]. CRQA has successfully been used to gain
a better understanding of many complex human phenomena, including a variety of
human social interactions [for a review, see @fusaroli2014analyzing]. Much as it
has for psychology, we argue that the use of nonlinear methods in the
investigation of political science phenomena will lead to new insights and the
development of more robust theories.
## The Present Work
Social media has become a primary platform for social movements and activism
across the world, but the extent to which virtual activism reflects and drives
real-world mobilization is still largely unknown. In the current work, we
conceptualize the physical and virtual movements as a dynamical system, both
constantly interacting with and influencing the other. We aim to study this by
analyzing the real-world events and online social cohesion during nearly three
months of the Arab Spring. Both real- and virtual-world collective coordination
were powerful tools in the events of the Arab Spring and are undeniably linked.
This study aims to explore the dynamics of Twitter users in Syria, who used the
social media platform to raise awareness about the Arab Spring and the
relationship of those messages to mass mobilization in the real world.
Using cross-recurrence quantification analysis [CRQA; @zbilut1998detecting] and
windowed cross-recurrence quantification analysis [WCRQA;
@webber2005recurrence], we analyzed the co-evolution of social cohesion online
and real-world events. Specifically, using heterogeneous data from March 30,
2012, to June 15, 2012, we compared the social cohesion of Syrian tweets with
the counts of all, positive, and negative real-world events derived from
international event data [i.e., the Integrated Crisis Early Warning System;
@DVN/28075_2015].
We hypothesized that the count of all events and the count of negative events
would show high levels of determinism, as this time period was marked by daily
violence which often leads to increased salience. We also hypothesized that the
count of positive events would not show patterns of determinism, as these events
were likely overshadowed by the salient negative daily occurrences. After
testing these hypotheses, we then conducted exploratory analyses to further
examine the patterns identified with CRQA. Using WCRQA, we sought to identify
shifts in the dynamics of the systems that could then be linked to real-world
shifts in conflict frequency. Through these analyses, our two primary goals in
the current work were to uncover (1) how different intensities during times of
turmoil may enhance or deter social cohesion in social media communication and
(2) how social mobilization may be evident in the dynamic coupling of cohesion
and real-world events.
An additional focus of the current work is to introduce cross-recurrence
quantification analysis---a longstanding method from physics that has since
become influential in a variety of other fields---to political science. We do so
by providing detailed descriptions of the methods and providing clear directions
about interpretation and significance testing even with relatively small-*n* or
case studies. After demonstrating these methods' utility with this dataset, we
close the paper with specific suggestions to political scientists interested in
these methods.
This work is a reproducible manuscript. Code and analyses can be found at:
www.github.com/mchiovaro/arab-spring.
# Method
## Materials
### Tweet Corpus
```{r corpus_stats, echo=FALSE, warning=FALSE, message=FALSE, error=FALSE}
tweet_stats <- read.csv("./data/formatted/primary/stats.csv")
```
The corpus of tweets used included `r tweet_stats$total_tweets` English-language
Syrian tweets occurring between March 31--June 15, 2012. Using the Twitter
streaming API, servers at the University of Texas at Austin collected the
tweets. The servers were run on a stable (LTS) version of Ubuntu Linux (v.
10.04) and were connected directly to the high-bandwidth UT network maintained
by IT services.
Tweets collected contained one or more of the following keywords: “syria,”
“syrian,” “damascus,” “homs,” “al-assad,” and “sunni.” These keywords were
chosen based on the most distinctive terms used in news articles covering the
Arab Spring in Syria at the time. A node.js script was used to reinitialize the
connection to Twitter's servers whenever the script timed out due to a period of
silence. The script requested data from the API endpoint using basic HTTP
authentication (OAuth was optional, at the time) at
https://stream.twitter.com/1.1/statuses/filter.json and stored the data directly
to the local hard disk. The tweets were then post-processed with a Python script
to convert the JSON-formatted tweets to tab-separated files, which were then
compressed to maximize storage efficiency.
The corpus consisted of `r tweet_stats$percent_original_tweets`% original
tweets, `r tweet_stats$percent_retweets`% retweets, and `r
tweet_stats$percent_replies`% replies. There were `r tweet_stats$num_users`
users in the corpus and an average of `r tweet_stats$avg_daily_tweets` tweets
per day.
Scripts used for scraping tweets and compressing data can be found at
https://github.com/chbrown/twilight.
### Event Data
Event data were obtained from the 2012 Integrated Crisis Early Warning System
[ICEWS; @DVN/28075_2015]. The event data were first filtered to remove
incomplete and incorrectly formatted data. They were then filtered to include
only events that were directed at (i.e., `target`) Syria (n = `r num_target`).
To explore the impact of outwardly directed Syrian events, the data were
filtered again to include events that were both directed at (i.e., `target`) or
directed by (i.e., `source`) Syria (n = `r num_source_target`).
## Data Preparation
### Social Cohesion Metric
The social cohesion or linguistic similarity of daily tweets was calculated by
quantifying the frequency of shared content words with respect to those words’
frequencies in the entire tweet corpus. Retweets were included in the corpus and
used in the social cohesion calculation, as they indicate strong support for an
individual’s message. Stop words, punctuations, and duplicate words were not
removed from the tweets. The lengths of tweets were disregarded, as the aspect
of interest is in shared linguistic content and not breadth of description. The
presence of bots was not investigated. According to a 10-Q report for Twitter,
Inc., for a quarterly period in 2014, the estimated number of third party
applications automatically contacting the servers (i.e., bots), was less than
8.5% [@twitter2014]. As the data in this study were collected two years prior,
this number was likely even less. Thus we consider the number of bots negligible
for the present investigation.
The tweets were first sorted into ascending order by timestamp. They were then
grouped into successive sets of five tweets. Groups of five were chosen to allow
for manageability and efficiency of the analyses. Linguistic cohesion was
quantified for all possible pairs of tweets in the group of five, resulting in
10 pairs per group of tweets; all 10 cohesion values for the group were averaged
for a single group cohesion value. The cohesion value for each day was computed
as the average cohesion values for all groups whose first tweet was on that day.
Each pair’s cohesion value was calculated in three parts:
\begin{align}
\text{Inverse frequency weighting of overlap words = }\sum_{i=1}^k
\frac{log(f(t_i, d_1) + f(t_i, d_2) + 1)}{log(f(t_i) + 1)}
\end{align}
Equation 1 identifies the inverse frequency weighting of each overlapping word
$t_i$ between two tweets $d_1$ and $d_2$, relative to the word’s frequency in
the corpus D, for all overlapping words $t_{1...k}$ between the two tweets.
\begin{align} \text{Inverse frequency weighting of all words = }\sum_{i=1}^n
\frac{log(f(t_i, d_1) + f(t_i, d_2) + 1)}{log(f(t_i) + 1)} \end{align}
Equation 2 then calculates the inverse frequency of each word $t_i$ in tweets
$d_1$ and $d_2$, relative to the word’s frequency in the corpus D, for all words
$t{1...n}$ that appear in either tweet.
\begin{align} \text{Two tweets' similarity =}\frac{\text{inverse frequency
weighting of overlap words}}{\text{inverse frequency weighting of all words}}
\end{align}
Finally, Equation 3 identifies their linguistic cohesion by taking the ratio of
the results of Equations 1 and 2. Through this process, we are able to naturally
account for the frequencies of natural language generally and of online language
specifically: Words that occur many times in the corpus (which we can
operationalize as function words; e.g., prepositions, pronouns) influence the
similarity scores between two tweets much less than words that occur rarely in
the corpus (which we can operationalize as content words; e.g., names,
hashtags).
This linguistic cohesion metric is similar to *term frequency-inverse document
frequency* [tf-idf; see also @jones1972statistical;
@robertson2004understanding]. However, the method focuses more squarely on terms
(i.e., agnostic to the number of documents in the corpus), aggregates at a
higher semantic level (i.e., summing across words), and accounts for linguistic
novelty relative to the pair and the corpus (i.e., ratio of Eq. 1 to Eq. 2). The
social cohesion calculation was performed on the tweet corpus containing English
tweets. All hashtags were included in the analysis, while URLs were removed. The
raw social cohesion time series can be seen in Fig
\@ref(fig:raw-cohesion-ts).
[Fig \@ref(fig:raw-cohesion-ts) about here]
```{r raw-cohesion-ts, echo=FALSE, warning=FALSE, message=FALSE, error=FALSE, fig.cap="Raw social cohesion metric time series."}
# read in social cohesion data
raw_cohesion_data <- read.csv("./data/raw/syria_coherence_sort_5.csv")
# converge date to single variable
raw_cohesion_data$Start.Year <- "2012"
raw_cohesion_data$date <- as.Date(with(raw_cohesion_data,
paste(Start.Year, Start.Month, Start.Day,
sep="-")), "%Y-%m-%d")
# detach plyr so summarise works
#detach("package:plyr", unload=TRUE)
# calculate mean cohesion
raw_cohesion_ts <- raw_cohesion_data %>%
group_by(date) %>%
summarise(
MeanCohesion = mean(Cohesion..avg.),
sd = sd(Cohesion..avg.),
var = sd^2,
n = n(),
se = sd / n) %>%
ggplot(aes(x = date, y = MeanCohesion)) +
geom_errorbar(aes(ymin = MeanCohesion-var, ymax = MeanCohesion+var), color = "grey57") +
geom_point() +
geom_line() +
xlab("Date") +
ylab("Mean Social Cohesion")
# save the plot
ggsave("./results/primary/raw-cohesion-ts.png")
# call the image
knitr::include_graphics("./results/primary/raw-cohesion-ts.png")
```
### ICEWS Event Time Series
Using the Integrated Crisis Early Warning System (ICEWS) developed at the
Defense Advanced Research Project Agency, the daily events were scored with an
intensity rating ranging from -10 to 10. Negative events---such as riots and
attacks---are coded with low negative numbers, while positive or peaceful events
are coded with high positive numbers. This well-validated coding scheme allows
for numerical quantification of different event types.
For both the target-only and the target and source data described above, time
series of the count of all events were generated by summing the total number of
events on each day, irrespective of ICEWS score. Similarly, time series of count
of positive events were generated by summing the number of ICEWS events for each
day that had an intensity rating greater than zero. Lastly, time series of count
of negative events were generated by summing the number of ICEWS events for each
day that had an intensity rating less than zero.
The raw event time series for target filtered data can be seen in Fig
\@ref(fig:raw-event-ts-target). The raw event time series for source and target
filtered data can be seen in Fig \@ref(fig:raw-event-ts-source-target).
[Figs \@ref(fig:raw-event-ts-target) and
\@ref(fig:raw-event-ts-source-target) about here]
```{r raw-event-ts-target, echo=FALSE, warning=FALSE, message=FALSE, error=FALSE, fig.cap="Raw event count time series for target filtered data."}
# specify global plotting variables
all_event_color = "#CC79A7"
positive_event_color = "#0072B2"
negative_event_color = "#D55E00"
line_thickness = 2
# read in social cohesion data
event_df <- read.csv("./data/formatted/primary/formatted_data.csv")
# plot the time series
raw_events_ts_target <- ggplot() +
geom_line(event_df,
mapping = aes(x= as.Date(Date),
y=all_events_target,
colour = 'all events')) +
geom_line(event_df,
mapping = aes(x= as.Date(Date),
y=pos_events_target,
colour = 'positive events')) +
geom_line(event_df,
mapping = aes(x= as.Date(Date),
y=neg_events_target,
colour = 'negative events')) +
scale_color_manual(values = c(all_event_color, # all
positive_event_color, # positive
negative_event_color)) + # negative
scale_size_manual(values=line_thickness) +
xlab("Date") +
ylab("Number of events") +
ylim(0, 360)
ggsave("./results/primary/raw-event-ts-target.png")
# call the image
knitr::include_graphics("./results/primary/raw-event-ts-target.png")
```
```{r raw-event-ts-source-target, echo=FALSE, warning=FALSE, message=FALSE, error=FALSE, fig.cap="Raw event count time series for source and target filtered data."}
# plot the time series
raw_events_ts_source_target <- ggplot() +
geom_line(event_df,
mapping = aes(x = as.Date(Date),
y=all_events_source_target,
colour = 'all events')) +
geom_line(event_df,
mapping = aes(x = as.Date(Date),
y=pos_events_source_target,
colour = 'positive events')) +
geom_line(event_df,
mapping = aes(x = as.Date(Date),
y=neg_events_source_target,
colour = 'negative events')) +
scale_color_manual(values = c(all_event_color, # all
positive_event_color, # positive
negative_event_color)) + # negative
scale_size_manual(values=line_thickness) +
xlab("Date") +
ylab("Number of events") +
ylim(0, 360)
ggsave("./results/primary/raw-event-ts-source-target.png")
# call the image
knitr::include_graphics("./results/primary/raw-event-ts-source-target.png")
```
## Analyses
### Cross-Recurrence Quantification Analysis
We first used categorical cross-recurrence quantification analysis [CRQA;
@zbilut1998detecting] to investigate the temporal patterns of social cohesion
and real-world events in Syria during Arab Spring. CRQA is an extension of RQA,
which captures the patterns of revisited states of a single time series
[@zbilut1992embeddings; @webber1994dynamical]. CRQA allows for two time series
to be projected onto one another to investigate shared states and trajectories.
In CRQA, we track these shared states by identifying *recurrent points*, or
intersections of time during which two signals shared or revisited the same
state. These shared states can be at the same time or at later points in the
time series. We can further chart shared trajectories by examining the
occurrences and patterns of multiple sequential recurrent points.
To perform categorical CRQA, the time series must be converted to the same
scale, while still preserving a sampling rate that respects the native timescale
of the system [cf. Nyquist frequency in signal processing;
@grenander1959probability]. In other words, a system that changes every day must
be sampled more often than a system that changes every year and less often than
a system that changes every hour. However, to compare two time series with CRQA,
the two time series must be sampled at the same frequency as one another.
Because the social cohesion and event data have different scales and values, we
converted each time series of daily values (e.g., Fig \@ref(fig:raw-ts)) into
deciles (e.g., Fig \@ref(fig:deciled-ts)) to analyze the relative dynamics of
different levels of social cohesion and counts of all, positive, and negative
events on each day. We implemented CRQA using the `crqa` package [@R-crqa] for R
[@R-base].
[Figs \@ref(fig:raw-ts) and \@ref(fig:deciled-ts) about here]
#### Visualization
A cross-recurrence plot (CRP) is the visual representation of CRQA. To construct
a cross-recurrence plot, two time series are plotted against each other on the
$x$ and $y$ axes. At every point where they share the same state, a point is
placed on the plot to indicate recurrence. (For categorical CRQA, this process
is similar to the Cartesian product.)
We can visually analyze these plots to reveal patterns of structure and
recurrence. The main diagonal line on the plot is known as the *line of
synchrony*. This line falls where the time series match temporally. A completely
filled main diagonal line would indicate perfect in-phase synchrony between the
two time series. See Figs
\@ref(fig:plot-rp-targ)-\@ref(fig:plot-rp-source-targ) for examples of RPs.
#### Metrics
While visual inspections can be informative, we can derive a variety of
quantitative metrics from the cross-recurrence plot. Each of these metrics
provides information into the patterns and dynamics of the new shared system.
These metrics are also critical for inferential statistics. We will focus most
directly on two of these metrics: *recurrence rate* and *determinism*.
The percentage of recurrent points to total possible points is a metric called
the *recurrence rate* (*RR*). This metric gives a general description of how
frequently the time series are sharing the same states. In the current study, a
recurrent point would indicate that the level of social cohesion and frequency
of events were at the same intensity for across two particular points in time
(in this case, one-day windows). Importantly, the nature of CRQA allows us to
quantify how a system shares similar states across time: In other words, it
identifies recurrence whether a window of shared intensity occurs on the same
day for the two time series or after a one-week gap between when the two time
series exhibit the same intensity.
Recurrent points that occur in successive moments form line structures. These
are conceptualized as shared trajectories, where the two time series are moving
together across time in a shared state. The percentage of recurrent points on
the plot that fall along diagonal line structures (i.e., two or more consecutive
points) is a metric called *percent determinism* (or simply *determinism*;
*DET*). For the current project, a high DET would indicate that not only are the
levels of social cohesion and frequency of events occurring at similar
intensities, but they are doing so across consecutive days.
A number of other metrics provide insight not only into the amount of structure
between the systems but about the nature of this shared structure. The average
length of the line structures is called *mean line* (L), which can be
conceptualized as the average amount of time that the two time series move
together before one or both shift states. The longest line length is called *max
line* (*maxL*); it indicates the longest instance of being “stuck” in a
particular state together and is a measure of the stability of the system. The
total *number of lines* on the plot (*NRLINE*) scales with structure: The more
lines that exist, the more coupled the two systems are. The complexity or
variability of line lengths is captured by a metric called *entropy*
(*ENTR*)---which may also be normalized by the total number of lines (i.e.,
*normalized entropy*; *rENTR*)---and indicates the stability and predictability
of the system. When one of the signals gets “stuck” in a state and the other
moves on to other states, this results in vertical line structures, which are
quantified in *laminarity* (*LAM*; i.e., the proportion of points forming
vertical line structures) and *trapping time* (*TT*; i.e., the average length of
the vertical trajectories). For more detail on CRQA metrics, see @R-crqa.
#### Parameters
To conduct the analyses on the decile data, the parameters were set to those
typically used in categorical CRQA: Delay was set to $0$, embedding dimension
was set to $1$, and radius was set to $.001$ to allow for only exact recurrent