Accidents caused by large trucks remain one of the leading causes of injuries and deaths in the United States.
Hence, in order to capture the driving behavior of its truck drivers, ANT corporation has equipped its trucks with a device to log location and event data to increase safety.
-
Identify dangerous commercial truck drivers;
-
Create an analytical dashboard for illustrations.
-
We first utilize big data Hadoop ecosystem to process geospatial data, including database and tables creation, data ingestion, transform.
-
Then, we perform data exploration and visualization in Tableau by connecting to Hadoop ecosystem server.
-
Besides, we modeled the regression relationship between factors and truck driver accident risks by connecting to R and external analysis in STATA.
-
Finally, we draw our conclusion and propose suggestions on how to lower the probability of large trucks accidents.
Answer: Most risky driver: A73 (10 risk factor); Least risky driver: A2 (0.966 risk factor)
Answer: City with highest average risk factor: Hollister (with risk factor 5.498)
Answer: Unsafe following distance causes highest number of unnormal event.
Answer: MPG or Speed does not have obvious relationship with risk factor.
Answer: Different tuck model has different frequency of unnormal events, thus different level of risk factor.
Business Question 6: Using linear regression and correlation to explore the factors affecting risk factor ~
Inference: risk factor is almost perfectly correlated with only the number of events
-
Risk factors are not dependent on MPG/Velocity, it is perfectly correlated with count of unnormal events.
-
Most unnormal event occurring city is Willits, it may be worth to analyze further on that, if local factors, such as traffic, road conditions, road maps are related to the risk factors.
-
Top 5 Risky drivers can be counseled about unsafe driving and consequences and given more training if needed.
-
Since Unsafe following distance is the most occurring event and it affects the risk factor most, an alert system can be installed on all the trucks which will notify the drivers if they have crossed the threshold of safe distance.