Introduction Exploratory data analysis has been done on dataset of a image rendering of Newcastle city. The datasets were well considered and then informed choice was made for exploratory data analysis.
Files to be found: (folder submitted on NESS)
- Structured Abstract
- Report on full eda(.ipynb) notebook
- Snapshot of git log from github
Steps to replicate:
- Download the ipynb file and directly run in google colaboratory. it can also be accessed through: https://github.com/psagar2/recipes/blob/ac55464d0fa7c26022b76b51d526d37298e7a944/terapixel.ipynb .
- You first need the access of the dataset as it is not publicly available.
- After the dataset is accessed upload it to the colab notebook or place it in the folder of jupyter.
- just press run all button and the full analysis would be performed in front of you.
- As per google colab to save this file you can download it as ipynb and can also view it in vs code.
Platform used: This project has been made and tested on google colab, with 25GB ram and 225gb hardisk space. It would easily be replicated in a similar or better environment. It may need some additional encoding for different OS.
Miscellaneous: If your environment doesn't have any packages installed for these libraries:
- Pandas
- MatplotLib
- Pandasql
- Seaborn
just run pip install (name of the library)