An Investigation: Shootings in the United States

Presented by

Ahle, Matthew    Ahrens, Kyra    Henley, Emmanuel

Aim

What are some of the factors that contribute gun violence, and what is the soical media response made by politicians in light of these crisis?

There have a total of 116 Mass Shootings since 1982, where the number of fatalities per shooting has been three or more. Where five of the most deadliest shooting have occured since 2007. There have been two independent studies conducted by Harvard University and the Federal Bureau of Investigation (FBI), where they have found not only has there been an increas of incidents but also they have proven that the deadliness of the shootings have also been increasing in recent times. Although these incidents seem isolated (only affecting the those directly involved), it infact affects the USA as a whole. According to a report published by the vice chair of the Joint Economic Committee on September of 2019, they have found that in this year alone, America has spent a total of $229 Billion Dollars.

For this project, we decided to focus on three destinct areas:

To complete this project we utilized several different sources of data. This data included strucutured and unstructured data. Our goal was to find data pretaining to statitics on demographics for each individual state of America as well as data that contained the state's gun laws. To gague the reaction on social media, chose a data set that focused on Tweets made by policitians in the 115th US Congress. The following are some of the main data sets we used, as well as the links to them.

Since we used various types of data (statistical, textural, etc.), we were able to apply a variety of data science techniques for this project. We used time series analaysis when analysing data in which we needed to adjust window size, such as with finding tweets that fall within a certain date range of an incident. When it came to analaysis correlations we focused on a state level and was able to employ the use of both Spearman and Pearson l to find correlations between and incident and a state statistic. For analayzing textual data, we untilized td-idf in addition to suggested stopwords to find meaning text within a tweet. Additional, we were able to use bag of words to find the most frequent terms used by politicians before and after an incident. Lastly, due to its complexity, we used multilevel modeling to gain useful insights on our gun laws data.

We were able to find several different insights on the topic of this project. As part of our research, we have determined the correlation between five different demographic determinants and the absolute number of incidents related to gun violence in schools. Our outcomes comprise heat maps representing the characteristics of all evaluated attributes per state as well as results of the correlation analysis according to Pearson and Spearman. The following visualizations depicts one of those correlations.

Absolute Number of School Shootings 1970-2019
Average Youth Poverty (%) 1990-2017

When it comes to guns laws, we found that states with higher numbers of gun related incidents tend to have higher number of adoptions of new gun laws throughout the years we studied. This can be viewed in the chart below.

For the reaction on social media by politicians, we focused on tweets made one month prior, one week prior, one day prior, the day of, one day after, one week after, and one month after an incident. We found most frequent words the day after, one week after, and one month after an incident where words relating to gun violence. Constradictory of the post made prior to an event. Given below are the most common words found in the tweets of politicians one day prior, and one day after an event has taken place.