A 512-bit register is f64x8, not f64x4. Likely a copy-paste error from the _m256d documentation, which seems correct.