SDK • Anyone got assembly code for 32x32 => 64 that works?

Sorry to wimp out, but I'm struggling with unfamiliarity with the instruction set and assembler syntax. I'm trying to get the fastest 32x32 multiply into 64-bit result that I can, the version here is around 25% faster than the C version but I can't get it to work. If anyone can help I'd be dead chuffed. I'm sure it's just something dumb I'm doing, but I have all day failed to find it and life is too short

The C is just this :

Code:

inline int64_t mul32x32 ( const int32_t a, const in32_t b ){    return a * (int64_t) b;}

but the above compiles to 6 multiplies and the use case I want - 32 and 32 in, 64 out - only needs 4.

Code:

mul32x32_64 :.global mul32x32_64// We want// signed   ahi*bhi// signed   ahi * blo// signed   alo * bhi// unsigned alo * blo => but it just gives us a 32-bit pattern, so unsigned doesn't matter                         // here r0 = a r1 = buxth    r2,r1            // alo => r2 = a & 0xfffflsrs    r1,r1,#16        // ahi => r0 = a >> 16lsrs    r3,r0,#16        // bhi => r3 = b >> 16uxth    r0,r0            // blo => r1 = b & 0xffffpush    {r4}movs    r4,r0            // r4 = bhi whymuls    r0,r2            // lolo => r0 = blo * alo - that's why, we corrupt r0muls    r4,r1            // x1   => r4 = ahi * blomuls    r1,r3            // hihi => r1 = ahi * bhimuls    r3,r2            // x2   => r3 = bhi * alolsls    r2,r4,#16        // r2 = (x1 << 16)lsrs    r4,r4,#16        // r4 = x1 >> 16adds    r0,r4,#0adcs    r1,r2pop     {r4}lsls    r2,r3,#16        // r2 = x2 << 16lsrs    r3,r3,#16        // r3 = x2 >> 16adds    r0,r3,#0adcs    r1,r2bx      lr

Statistics: Posted by omenie — Thu Sep 11, 2025 4:50 pm — Replies 3 — Views 262

SDK • Anyone got assembly code for 32x32 => 64 that works?

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...