[PATCH 6.12.y 2/5] crypto: x86/aegis128 - optimize length block preparation using SSE4.1