-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathSTAT660-01_f18-team-1_project1_data_analysis_by_LL.sas
168 lines (130 loc) · 5.39 KB
/
STAT660-01_f18-team-1_project1_data_analysis_by_LL.sas
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
*******************************************************************************;
**************** 80-character banner for column width reference ***************;
* (set window width to banner width to calibrate line length to 80 characters *;
*******************************************************************************;
*
This file uses the following analytic dataset to address several research
questions regarding FIFA 18 user information
Dataset Name: FIFA_Player_Data created in external file
STAT660-01_f18-team-1_project1_data_preparation.sas, which is assumed to be
in the same directory as this file
See included file for dataset properties
;
* environmental setup;
* set relative file import path to current directory (using standard SAS trick;
X "cd ""%substr(%sysget(SAS_EXECFILEPATH),1,%eval(%length(%sysget(SAS_EXECFILEPATH))-%length(%sysget(SAS_EXECFILENAME))))"""
;
* load external file that generates analytic dataset fifa18_analytic_file;
%include '.\STAT660-01_f18-team-1_project1_data_preparation.sas'
;
title1
'Research Question: What is the age distribution amongst each club in FIFA 18?'
;
title2
'Rationale: This could help identify which countries tend to focus on creating their players from youth as opposed to those who purchase their players'
;
footnote1
'seems that most leagues with a higher age distribution are amongst asian, usa, and english leagues while a much younger age distribution is amongst latin american teams and african teams'
;
footnote2
'This is important to understand why some leagues are of higher caliber as well as why some countries perform better in international tournaments'
;
footnote3
'I believe this is the case since some countries are behind infrastructure wise and prefer to buy the star players then to focus on youth academies'
;
*
Methodology: I think we could take the proc means of the age variable and base
it off the clubs and even possibly get a by league by club break down to
understand the age distribution amongst top clubs and top leagues
Limitations: I think our analysis won't account for which players are
designated starters on the team and which players are reserves
Possible Follow Up: Maybe a variable for starting position on the team whether
it is starter, bench, or reserves
;
proc means
data=fifa18_analytic_file
;
class club
;
var age
;
run;
title;
footnote;
title1
'Research Question: What is the Euro Value distribution amongst each league and nationality in FIFA 18?'
;
title2
'Rationale: This could correlate based off the first question determining which teams "buy their trophies" as opposed to work for them'
;
footnote1
'based on the above tables we can see that the german, african, and english leagues favor a bigger body type then latin teams who prefer a smaller body type'
;
footnote2
'This is important to determine the playstyles and differences in team compositions'
;
footnote3
'This might be due to the fact that some teams are much more dominant with defense and aggressive plays as opposed to dribbling and finesse'
;
*
Methodology: Could proc sort body type by leagues and then find the mean salary
of players based off body types to find any correlation amongst leagues that
might prefer more finesse players versus physical players
Limitations: I can see how the data might not tell us much as I am sure player
position might play a role on what body type a club would want that player to
have.
Possible Follow Up: Perhaps we could do the focus more on weight and height in
order to determine whether a league prefers a faster/lighter finesse player or
a physically bigger and muscular "power" player
;
ods graphics on / width = 8in height = 8in;
proc sgplot
data=fifa18_analytic_file
;
vbox
eur_wage / category = nationality
;
where overall > 80
;
run;
title;
footnote;
title1
'Research Question: How does the body type distribution differ in each league and is there a correlation with salary based on the preferences of each league?'
;
title2
'Rationale: This would help determine if leagues prefer different types of player body types based on league playstyles'
;
footnote1
'body type does not necessarily correlate with value, might want to check and see if we can tie this more over to skill level or game statistics as body type does not associate over with skill level'
;
footnote2
'This is important because even leagues that preferred bigger body types did not correlate with the pay of the body type showing that skill is valued higher'
;
footnote3
'not sure why this is the case, if your team is centered around a specific playstyle, why throw in a player that is the complete opposite?'
;
*
Methodology: Could we take the median and range of salaries based off of clubs,
leagues, and nationality, to see if there is a possible trend in the types of
players that clubs spend money on (i.e. Brazilian) as well as the top clubs
versus the smaller clubs
Limitations: I think the question or data might be too broad, would need to
identify the best way to determine which leagues and/or clubs buy their
trophies and players as opposed to raising their youth to play in their clubs
Possible Follow Up: Perhaps we could determine the distribution of foreign
players in each league and then take the mean salaries and values of each
leauge based on that percentage of non-home grown players
;
proc corr
data=fifa18_analytic_file
;
var
height_cm weight_kg
;
with
eur_value
;
run;
title;
footnote;