RISC-V ASM unaligned read/writes: alternative assembly by SparkiDev · Pull Request #10530 · wolfSSL/wolfssl

SparkiDev · 2026-05-26T03:02:28Z

Description

Not all RISC-V chips allow unaligned reads and writes with basic assembly instructions like lw/sw.
Add alternative assembly that is turned on with:
WOLFSSL_RISCV_ASM_NO_UNALIGNED.

Fixes #10525

Testing

./configure --disable-shared LDFLAGS=--static --host=riscv64 CC=riscv64-linux-gnu-gcc --enable-riscv-asm CFLAGS=-DWOLFSSL_RISCV_ASM_NO_UNALIGNED

SparkiDev · 2026-05-26T03:55:44Z

Jenkins: retest this please

EAlexJ · 2026-05-26T08:04:16Z

I would like to push back on this implementation a bit.

Currently, all instructions that could be unaligned get emulated.

I think it makes sense to check if the pointers are actually unaligned and only then use the more costly emulation.
I suggest something like this (with the check guarded behind WOLFSSL_RISCV_ASM_NO_UNALIGNED):

Sha512Transform(wc_Sha512* sha512, const byte* data,
    word32 blocks)
{
if((uintptr_t) data % 8) // modulo 8 as the routine uses UNALIGNED_[LS]D
    <emulate using the UNALIGNED_* macro>
else
    <normal execution>
}

There should only be a negligible impact on performance in case the data is aligned. This also only introduces a small size overhead, the check should be a few instructions at most.

EAlexJ · 2026-05-26T08:06:26Z

On a separate issue: In my opinion readability would be increased if you just used the UNALIGNED_* Macro unconditionally and only check once in the actual definition of the macro if WOLFSSL_RISCV_ASM_NO_UNALIGNED is defined and then emit the emulation.
This is then similar to how REV8 and others are handled. It would then also make sense to name the macro differently.

EDIT: This is probably obsolete in case the suggestion above gets adopted

Not all RISC-V chips allow unaligned reads and writes with basic assembly instructions like lw/sw. Add alternative assembly that is turned on with: WOLFSSL_RISCV_ASM_NO_UNALIGNED.

EAlexJ · 2026-05-27T10:04:49Z

I think you missed some in

int wc_AesSetKey(Aes* aes, const byte* key, word32 keyLen, const byte* iv, int dir) {

The pointer to key is not necessarily aligned (Found it out the hard way)

Are you saying then that none of the fields of Aes will be aligned?
If so, then I will need to change the access to the fields: key, reg and tmp.

That I do not know, the best solution for me is to align the buffers at its source, so I went into the ssl struct and corrected the placement of byte* key.
This was sufficient, so in this very case only key seems to not be aligned by default

EAlexJ · 2026-05-27T10:06:18Z

Also, why did you choose to use ALIGN16, is ALIGN8 not already sufficient?

Because they are 16 byte buffers in AES, I made it align on 16 bytes.
Looking into it, the vector instructions are 64 bit loads and stores so changing to ALIGN8 will be fine.

SparkiDev · 2026-05-28T07:31:38Z

I've modified the macros to check the alignment and choose the sequence to use.
Let me know if that is better.

Thanks,
Sean

EAlexJ · 2026-05-28T08:58:46Z

I see the following pattern several times:

addi t, p, o andi t, t, <alignment mask> bnez t, <label> ---- t: scratch register p: register holding the base address o: offset

This checks the alignment of p+o, but since o is always a valid offset (is it not?) it is sufficient to check p only.
One instruction can then be saved by doing:

andi t, p, <alignment mask> bnez t, <label>

EAlexJ · 2026-05-28T09:15:38Z

One issue I see is that there is a lot of double checking for alignments:
The bulk variants call the underlying N times, but since the check for the alignment is done in the underlying only, it also gets checked N times.

In case the data is actually aligned there should only be one check.

I'd argue that most of the time the data is (or at the very least should be) aligned, so I'd try to keep the overhead for this case as small as possible.

If the data is unaligned, performance will take a hit even on hardware that supports misaligned loads and stores, e.g. when accesses cross cache-line boundaries.

SparkiDev self-assigned this May 26, 2026

This was referenced May 26, 2026

RISC-V extension selection is not completely accurate #10526

Closed

[Bug]: Cannot build RISC-V Assembly for SHA3 in Debug #10515

Closed

[Bug]: Several Issues with respect to alignment for RISC-V assembly routines #10525

Open

SparkiDev force-pushed the riscv_unaligned_fix branch from 3b8e981 to b4611fa Compare May 26, 2026 04:19

SparkiDev force-pushed the riscv_unaligned_fix branch from b4611fa to bd46955 Compare May 26, 2026 23:05

RISC-V ASM unaligned read/writes: alternative assembly

26c45cf

Not all RISC-V chips allow unaligned reads and writes with basic assembly instructions like lw/sw. Add alternative assembly that is turned on with: WOLFSSL_RISCV_ASM_NO_UNALIGNED.

SparkiDev force-pushed the riscv_unaligned_fix branch from bd46955 to 26c45cf Compare May 27, 2026 00:21

EAlexJ reviewed May 27, 2026

View reviewed changes

fixup

11ccbc3

EAlexJ reviewed May 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RISC-V ASM unaligned read/writes: alternative assembly#10530

RISC-V ASM unaligned read/writes: alternative assembly#10530
SparkiDev wants to merge 2 commits into
wolfSSL:masterfrom
SparkiDev:riscv_unaligned_fix

SparkiDev commented May 26, 2026

Uh oh!

SparkiDev commented May 26, 2026

Uh oh!

EAlexJ commented May 26, 2026 •

edited

Loading

Uh oh!

EAlexJ commented May 26, 2026 •

edited

Loading

Uh oh!

EAlexJ May 27, 2026 •

edited

Loading

Uh oh!

SparkiDev May 28, 2026

Uh oh!

EAlexJ May 28, 2026

Uh oh!

EAlexJ May 27, 2026

Uh oh!

SparkiDev May 28, 2026

Uh oh!

SparkiDev commented May 28, 2026

Uh oh!

EAlexJ May 28, 2026 •

edited

Loading

Uh oh!

EAlexJ May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SparkiDev commented May 26, 2026

Description

Testing

Uh oh!

SparkiDev commented May 26, 2026

Uh oh!

EAlexJ commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EAlexJ commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EAlexJ May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SparkiDev May 28, 2026

Choose a reason for hiding this comment

Uh oh!

EAlexJ May 28, 2026

Choose a reason for hiding this comment

Uh oh!

EAlexJ May 27, 2026

Choose a reason for hiding this comment

Uh oh!

SparkiDev May 28, 2026

Choose a reason for hiding this comment

Uh oh!

SparkiDev commented May 28, 2026

Uh oh!

EAlexJ May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

EAlexJ May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

EAlexJ commented May 26, 2026 •

edited

Loading

EAlexJ commented May 26, 2026 •

edited

Loading

EAlexJ May 27, 2026 •

edited

Loading

EAlexJ May 28, 2026 •

edited

Loading