Reddit comments and submissions from 2005-06 to 2024-12 collected by pushshift and u/RaiderBDev. These are zstandard compressed ndjson files. Example python scripts for parsing the data can be found here The more recent dumps are collected by u/RaiderBDev