Skip to content

Latest commit

 

History

History
6 lines (4 loc) · 664 Bytes

README.md

File metadata and controls

6 lines (4 loc) · 664 Bytes

This repository contains canonical copies of useful reference datasets for the Wikimedia movement. It is maintained jointly by the Wikimedia Foundation's Product Analytics and Data Engineering teams.

Data format

The data here is stored in tab-separated values (TSV) format. For simplicity, values should not contain any tabs or newlines.

This approach avoids the escaping and quoting issues often caused by the CSV format (for an example, see T327983).