For this exercise, you have been provided a dataset of traffic accidents that occurred in Davidson County which was retrieved from https://data.nashville.gov/Police/Traffic-Accidents/6v6w-hpcw.
This spreadsheet has been divided into the following sheets:
- Accidents: Contains the raw data for each accident. In order to familiarize yourself with what is contained in each column, take some time to review the data dictionary here: https://data.nashville.gov/api/views/6v6w-hpcw/files/3af9281b-fa4f-4044-83f8-1b9a46cc6e95?download=true&filename=Traffic-Accidents-Metadata-v2.pdf.
- Analysis: This sheet will hold most of your answers.
- Zero Car Crashes: This sheet will be used in question 3.
- Collision Types: Contains the text description of each collision type. You will fill in your answers for question 5 here.
- Weather Types, Illumination Types, and Harmful Types: Contain descriptions for the other three codes. Not used in this exercise.
Use formulas to answer each question. Unless otherwise stated, fill in your answers in the "Analysis" tab.
-
How many total accidents are contained in this dataset?
-
What are the earliest and latest records that appear in this dataset?
-
a. Create a new column to the right of the "Number of Motor Vehicles" column called "Single or Multiple". This column should contain "Single" if the number of vehicles is 1 and "Multiple" if it involved more than one vehicle.
b. Are there any rows that involved zero vehicles? How many? Make sure that your formula accounts for these cases.
c. Investigate the rows that have zero vehicles using the FILTER function in the "Zero Car Crashes" sheet. What do you find?
d. What percentage of crashes are single-car? -
How many accidents occurred which are hit and run and had at least one injury?
-
a. What is the overall average number of injuries?
b. Go to the "Collision Types" sheet and fill in the table to find the total number of crashes, average number of injuries, and total number of injuries per collision type. For each calculation, write a single formula and copy it down the table. What do you find? (Hint: If you're not sure how to answer this question, revisit the "Conditional functions and lookups" chapter of Data Analysis in Spreadsheets.) -
Add four new columns, Month, Year, Hour, and Weekday to the right of the Date and Time column. Use the TEXT function to extract out the Month, Year, Hour, and Weekday from the "Date and Time" column.
-
Fill in the Hour table in the Analysis spreadsheet to find the number of accidents that occurred for each hour of the day. Again, write a single formula and copy it down the table. Do you see anything unusual? What might be the explanation for this?
-
Do the same for the year and day of the week. What stands out?
-
Add a column to the right of the Collision Type Code called "Collision Type". Use the table contained in the Collision Types sheet to fill in this column.
-
Which interstate has the most accidents between I-24, I-40, I-65, and I-440? Answer this by counting the number of accidents that contain the strings "I 24", "I 40", "I 65", or "I 440" in their Street Address. Hint: You may need to make use of wildcards in combination with the CONCAT function to answer this. Then do the same but search for "I24", "I40", "I65", and "I440" and then "I-24", "I-40", "I-65", and "I-440". Sum the results to get a total count.