tinycc-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Tinycc-devel] Optimizing for avx512


From: Elijah Stone
Subject: Re: [Tinycc-devel] Optimizing for avx512
Date: Sun, 6 Feb 2022 02:16:23 -0800 (PST)

On Sun, 6 Feb 2022, Samir Ribić via Tinycc-devel wrote:

A bit faster than VMOVUPS is VMOVAPS, but the numbers must be at addresses divisible by 64.

They are the same speed. VMOVUPS will go slowly if you give it an unaligned address, while VMOVAPS will fault; but they do exactly the same thing given an aligned address. That said, if you do intend to use aligned addresses for performance, you should probably use the 'aligned' instruction variants where you can, to check your work.

 -E

reply via email to

[Prev in Thread] Current Thread [Next in Thread]