Skip to content
This repository has been archived by the owner on Mar 2, 2023. It is now read-only.
Syed Naqvi edited this page May 5, 2022 · 22 revisions

IMAT2214 Business Intelligence (BI)

Summative Assessment Task 2

Practical Development & Individual Report (50% of module coursework mark)

Imagine…

Your consulting firm has recently been hired by Citi Bike to support decision making around its bike sharing service in New York City. Citi Bike currently uses a system, for collecting data on the use of their bikes. They are quite enthusiastic about engaging with interested individuals, to provide insight on their data, using business intelligence and analytics to support decision-making. Business intelligence presents many opportunities for Citi Bike and its stakeholders through value added from data – turning insights into actions, and adopting this for change in the Citi Bike Service. As a Business Data Analyst, you have been asked to demonstrate how SAS Studio can be used to gain insight from the data they have collected on Citi Bike use. Visit the following page to learn more about Citi Bike:

Use the following guidelines to apply SAS studio:

1------------------------------------------------------------------------------------------------------------------------------------------

Access CitiBike data CSV file available for download from HERE

2-----------------------------------------------------------------------------------------------------------------------------------------

Explore the data:

       a. Have a look at the CitiBike data in your main table ‘citibike-tripdata’ to understand the kind of data you will be dealing with.

       b. The data is fairly clean, as the data scientist at your consultancy has already made an effort in Excel to prepare the data for you to analyse. However, there are some additional changes and cleaning you need to do:

              i. Some bikes were not returned by customers. This is represented by empty or incorrectly inputted data under the end_station_name and end_station_id columns. You will need to remove/exclude these from your dataset, using SAS Studio code (Hint: Use a SQL procedure (proc sql) with [delete from] and [where] statements

Syed's Code
Umar's Code

              ii. You will need to split the following columns into date and time, and reformat the date (Hint: consider [datepart] and [timepart] functions.

                  • started_at
                  • ended_at

Syed's Code
Umar's Code
       c. The following table provides a description of each attribute in your dataset table

(citibike-tripdata):

Attribute Description
ride_id The recorded instance of the bike use, by a customer or member.
started_at The start date and time the bike is undocked for use by the customer/member
ended_at The date and time, that the bike is returned to a docking station by a customer/member.
start_station_name The street or location name that the biked is collected/undocked
start_station_id The ID or reference number of the start docking station, where the bike is first collected/undocked.
end_station_name The street or location name that the biked is returned/docked
end_station_id The ID or reference number of the docking station where the bike is returned/docked.
start_lat The location latitude, where the bike is collected/undocked
start_lng The location longitude, where the bike is collected/undocked.
end_lat The location latitude, where the bike is returned/docked.
end_lng The location longitude, where the bike is returned/docked.
customer The person using the Citi Bike service can either be a casual customer, or a subscribed member customer. In this case:
• Casual = 24-hour pass or 3-day pass user;
• Member = Subscribed annual member

3-----------------------------------------------------------------------------------------------------------------------------------------

Analyse the data and develop reports and graphical charts that would be useful to Citi Bike, considering the following:
a.

What are the total number of bikes collected/undocked per Station?

  1. Start_station_name or Start_station_id
Syed's Code
Umar's Code
b.

What are the total number of bikes returned/docked per Station?

  • End_station_name or End_station_id
Syed's Code
Umar's Code
c.

What type of customer often uses Citi Bike?

  1. Customer
Syed's Code
Umar's Code
d.

What is the most common duration of bike use?

  1. Started_At and Ended_At (NB: Remember these fields will be split – see part 2 b))
Syed's Code
Umar's Code
e.

Develop a map or two maps, showing the frequency of bike collection and return in different locations:

  1. Start_lat and Start_lng
  2. End_lat and End_lng
Syed's Code
Umar's Code
f.

Which stations do Member Customers commonly collect bikes?

  1. Customer, Start_station_name
Syed's Code
Umar's Code
g.

Which stations do Casual Customers commonly collect bikes

  1. Customer, Start_station_name
Syed's Code
Umar's Code
h.

At what time of the day (group in Morning, Afternoon, and Evening) do Member Customers commonly return or dock bikes they have used?

  1. Customer, Ended_at
Syed's Code
Umar's Code
i.

Add ONE (1) additional analysis that may provide insight for decision-making.

Syed's Code
Umar's Code

4-----------------------------------------------------------------------------------------------------------------------------------------

Design a Dashboard using MS Visio [NB: this is only a sketch):
  1. Using MS Visio or MS Powerpoint as a sketching tool, design a dashboard with images of charts and report summaries you developed in Step 3 above.
  2. Apply the principles of good dashboard design for business intelligence applications. Access the videos on Linkedin Learning HERE , for guidance.

5-----------------------------------------------------------------------------------------------------------------------------------------

Write an individual report (500 words): Submit a reflective report justifying your design decisions for the ETL and dashboard that you developed. This should be no more than 500 words. This is an academic piece and references are expected.

Deliverable Checklist:

  1. In a document: o A print out of your SAS Studio code o Print out of all graphs/charts that are proposed to provide insight to Citi Bike. o A sketched design of your dashboard o An Individual report (500 words) submitted to TurnItIn by the date specified.
  2. A short 3-5 minute video, explaining your code, and running key elements of your SAS program using the Citi Bike data.

Undergraduate Mark descriptors to guide tutor evaluations in written reports:

Attribute Description
90-100% Responds to all of the assessment criteria for the task
• Displays exceptional degree of originality
• Exceptional analytical, problem-solving and/or creative skills
• No fault can be found with the work other than very minor errors, for example minor typographical issues
80-89% • Responds to all of the assessment criteria for the task
• Work of outstanding quality, evidenced by an ability to engage critically and analytically with source material
• Likely to exhibit independent lines of argument• Highly original and/or creative responses
• Extremely wide range of relevant sources used where appropriate
70-79% • Responds to all of the assessment criteria for the task
• An extremely well developed response showing clear knowledge and the ability to interpret and/or apply that knowledge
• An authoritative grasp of the subject, significant originality and insight,
• Significant evidence of ability to sustain an argument, to think analytically, critically and/or creatively and to synthesise material• Evidence of extensive study, appropriate to task
60-69% • Responds to most of the assessment criteria for the task
• A detailed response demonstrating a thorough grasp of theory, understanding of concepts, principles, methodology and content
• Clear evidence of insight and critical judgement in selecting, ordering and analysing content
• Demonstrates ability to synthesise material, to construct responses and demonstrate creative skills which reveal insight and may offer some originality
• Draws on an appropriate range of properly referenced sources
50-59% • Responds to most of the assessment criteria for the task
• An effective response demonstrating evidence of a clear grasp of relevant material, principles and key concepts
• An ability to construct and organise arguments• Some degree of critical analysis, insight and creativity
• Demonstrating some conceptual ability, critical analysis and a degree of insight
• Accurate, clearly written/presented
40-49% • Responds to some of the assessment criteria for the task
• A response demonstrating an understanding of basic points and principles sufficient to show that some of learning outcomes/assessment criteria have been achieved at a basic level
• Suitably organised work demonstrating a reasonable level of understanding
• Covers the basic subject matter and is appropriately presented but is rather too derivative and insufficiently analytical
• Demonstrates limited conceptual ability, levels of evaluation and demonstration of creativeskills
• Demonstrates adherence to the referencing conventions appropriate to the subject and/ortask
30-39% • Overall insufficient response to the assessment criteria
• A weak response, which, while addressing some elements of the task, contains significant gaps and inaccuracies
• Indicates an answer that shows only weakly developed elements of understanding and/or other skills appropriate to the task
• May contain weaknesses in presentation that constitute a significant obstacle in communicating meaning to the assessor
20-29% • Overall insufficient response to the assessment criteria
• A poor response, which falls substantially short of achieving the learning outcomes• Demonstrates little knowledge and/or other skills appropriate to the task
• Little evidence of argument and/or coherent use of material
10-19% • Overall insufficient response to the assessment criteria
• A very poor response demonstrating few relevant facts
• Displays only isolated or no knowledge and/or other skills appropriate to the task•
Little adherence to the task
0-9% • Overall insufficient response to the assessment criteria
• Displays virtually no knowledge and/or other skills appropriate to the task
• Work is inappropriate to assessment task given

Marking Grid

Criterion Weight 0-19                                     20-29                                      30-39                                                       40-49                         50-59                                      60-69                                                  70-79                                           80-89                                90-100                                                                     
Data Access and Exploration in SAS 20% Very weak or no attempt of any merit An attempt at ETL, but weak and lacking in effort. Some effort to use ETL evident, but weak overall. ETL used to a basic level, but with gaps/omissions/weaknesses. Generally sound ETL methods/codebut some weaknesses. Good use of ETL, significant effort, no major weaknesses.Some evidence of data validation/formatting. Excellent use of ETL, significant effort, very few weaknesses overall, basic validation/formatting present. Outstanding use of ETL, with significant effort, very high standards and no weaknesses evident, basic validation/formattingpresent. Exceptional use of ETL, with highest levels of effort, standards and with no weaknesses evident. Data sources accessedwith complete validation/formatting present.
Data Analysis in SAS 30% Very weak or no attempt of any merit An attempt, but weak and lacking in effort or analysis Some effort evident, but weak analysis overall A basic but useable analysis, but with gaps/omissions/weaknesses Generally sound analysis evident, but some weaknesses Good analysis and graphs, significant effort, no major weaknesses Excellent analysis and graphs, significant effort, very few weaknesses overall Outstanding analysis and graphs, with significant effort, very high standards and no weaknesses evident Exceptional work, with highest levels of effort, standards and with no weaknesses evident
DashboardDesign 30% Very weak or no attempt of any merit An attempt, but weak and lacking in effort or design Some effort evident, but weak design overall A basic but useable dashboard, but with gaps/omissions/weaknesses Generally sound dashboard, but some weaknesses Good dashboarddesign, significant effort, no major weaknesses Excellent dashboarddesign, significant effort, very few weaknesses overal Outstanding dashboard design, with significant effort, very high standards and no weaknesses evident Exceptional work, with highest levels of effort, standards and with no weaknesses evident
Individual Reflective Report 20% Very weak or no attempt of any merit An attempt, but weak and lacking in effort. Barely any work. Some effort evident, but weak overall. Sufficient and basic work, but with gaps/omissions/weaknesses. Generally sound, but some weaknesses Good work, significant effort, no major weaknesses Excellent work, significant effort, very few weaknesses overall Outstanding work, with significant effort, very high standards and no weaknesses evident Exceptional work, with highest levels of effort, standards and with no weaknesses evident
OVERALL 100% Very weak or no attempt of any merit An attempt, but weak and lacking in effort Some effort evident, but weak overall Sufficient and basic work, but with gaps/omissions/weaknesses Generally sound, but some weaknesses Good work, significant effort, no major weaknesses Excellent work, significant effort, very few weaknesses overal Outstanding work, with significant effort, very high standards and no weaknesses evident Exceptional work, with highest levels of effort, standards and with no weaknesses evident
Clone this wiki locally