Python utilities for scraping ultimate tournament data from USAU
This repository includes some simple python modules for scraping the USAU website for tournament results, and some utility functions to manipulate and clean this data. It also includes some examples of such data downloaded as csv files, and some jupyter notebooks with some visualizations from this data.
The top_n_players.py
CLI script provides a quick way to glean the top player contributions from tournaments like nationals, where player statistics are tracked with some reliability.
- 2016 D-I College Nationals Fantasy Wrap-up: Winners of the /r/ultimate fantasy contest and some overall player statistics
To install with setuptools:
# Check if https_proxy environment variable needs to be set
git clone https://github.com/azjps/usau-py.git
cd usau-py
./setup.py install
or alternatively with pip:
pip install git+https://github.com/azjps/usau-py.git
This should be both python2.7 and python3 compatible.
See the requirements.txt; pandas 0.13+ is the only main module required. pandas 0.17.1+, jupyter, and seaborn are recommended to run and view the notebooks. beautifulsoup4 and lxml are recommended for scraping web data on-the-fly from play.usaultimate.org.