[PATCH 6.6.y 2/5] crypto: x86/aegis128 - optimize length block preparation using SSE4.1