This patch changes 32 bytes stores to two 16 bytes stores to fix a recent degradation due to 32 bytes stores.
This patch makes use of new POWER10 vector pair instructions for loads and stores.