This study explores dog registration record data from NYC, Seattle, Edmonton, and Adelaide and combines this data with data about particular attributes of dogs (bold, trainable, calm, sociable, and obedient). I look at the attributes according to the AKC grouping the dog is in, its location (by city and for New York, by borough), and the ancestral origin for the dog. I use the Tukey statistical test to compare the means of the dog attributes by New York borough and by city. I also perform hypothesis tests on particular aspects of the data.
This code should be ready to use; the Jupyter Notebook file analysis.ipynb
is in the src
directory. Data cleaning was performed on the original datasets in order to ensure consistency. All data is in the data
directory.
There's much more to explore with this data. I plan to undertake some of the following analyses at a later date:
-
Voter registration information is public, so it might be interesting to correlate what kinds of dogs Republicans, Democrats, and Independent voters prefer.
-
Academics have already looked at genetic markers, but it might be interesting to look at genetic markers by location.
-
I did not dig into census data, including the census tract-level statistics for NYC.
-
I did not look at neighborhood-level (zip code) results.
City of Adelaide Dog Registrations
Data on canine intelligence by Stanley Coren and others
Trainability and boldness traits differ between dog breed clusters based on conventional breed categories and genetic relatedness by Borbála Turcsán, Enikő Kubinyi, Ádám Miklósi