A dataset of 1.2 million molecules with DFT-level quantum chemical annotations for molecular representation learning Nature