[PATCH 6.12.y 2/4] crypto: x86/aegis128 - optimize length block preparation using SSE4.1