Add fexpa learning path by ClaudioMartino-arm · Pull Request #2632 · ArmDeveloperEcosystem/arm-learning-paths

ClaudioMartino-arm · 2025-12-10T15:37:21Z

Before submitting a pull request for a new Learning Path, please review Create a Learning Path

I have reviewed Create a Learning Path

Please do not include any confidential information in your contribution. This includes confidential microarchitecture details and unannounced product information.

I have checked my contribution for confidential information

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of the Creative Commons Attribution 4.0 International License.

Added draft status and cascade settings to the index file.

blapie

Looking good and reads very well.

blapie · 2025-12-17T07:39:35Z

+FEXPA can be used to rapidly perform the table lookup. With this instruction a degree-2 polynomial is sufficient to obtain the same accuracy of the implementation we have seen before:
+
+```C
+svfloat32_t lane_consts = svld1rq(pg, ln2_lo); // Load only ln2_lo


Do you need ld1rq here? It's confusing cos the comment says that only ln2_lo is loaded.
If ld1rq is used why is it not loading ln2_hi too?

blapie · 2025-12-17T09:32:52Z

+---
+
+## Conclusion
+The SVE2 FEXPA instruction can speed-up the computation of the exponential function by implementing Look-Up and bit manipulation. 


SVE rather than SVE2?

blapie · 2025-12-17T09:34:28Z

+
+Arm introduced in SVE an instruction called FEXPA: the Floating Point Exponential Accelerator. 
+
+Let’s segment the IEEE754 floating-point representation fraction part into several sub-fields (Index, Exp and Remaining bits) with respective length of Idxb, Expb and Remb bits.


blapie · 2025-12-17T09:34:38Z

+
+Let’s segment the IEEE754 floating-point representation fraction part into several sub-fields (Index, Exp and Remaining bits) with respective length of Idxb, Expb and Remb bits.
+
+| IEEE754 precision       | Idxb | Expb | Remb |


blapie · 2025-12-17T09:37:23Z

+Given what we said in the previous chapters, the exponential function can be implemented with SVE intrinsics in the following way:
+
+```C
+svfloat32_t lane_consts = svld1rq(pg, constants); // Load ln2_lo, c0, c2, c4 in register


Is it worth using ld1rq in the example. That is not the most approachable for this audience.
I think it would save a few lines and help understanding to use duplication instead.
Then you can make a note that further memory-access optimisation can be performed, and maybe link to AOR versions.

Besides using pg is wrong here, you need to use an all true predicate.

blapie · 2025-12-17T12:09:31Z

+---
+
+## Conclusion
+The SVE2 FEXPA instruction can speed-up the computation of the exponential function by implementing Look-Up and bit manipulation. 


I would generalise to "exponential functionS" (e^x, 2^x, 10^x, x^y...) by virtue of accelerating the computation of 2^n/N. Up to you

blapie · 2025-12-17T12:10:34Z

+- Fewer instructions (no back-and-forth to scalar/SVE code)
+- Potentially higher aggregate throughput (more exponentials per cycle)
+- Lower power & bandwidth (data being kept in SME engine)
+- Cleaner fusion with GEMM/GEMV workloads


Are you intentionally not mentioning SoftMax and AI applications? It could help understanding the use for such fusion.

Add fexpa learning path

3b9e64a

annietllnd self-assigned this Dec 16, 2025

pareenaverma added the ACM Arm Cloud Migration label Dec 16, 2025

pareenaverma added this to Arm Learning Paths Roadmap Dec 16, 2025

pareenaverma moved this to In Progress in Arm Learning Paths Roadmap Dec 16, 2025

pareenaverma added awaiting_tech_review tech_review and removed awaiting_tech_review labels Dec 16, 2025

Set draft status for FEXPA learning path

57b4588

Added draft status and cascade settings to the index file.

pareenaverma merged commit 9a0026d into ArmDeveloperEcosystem:main Dec 16, 2025
1 check failed

blapie reviewed Dec 17, 2025

View reviewed changes

pareenaverma assigned madeline-underwood Jan 5, 2026

pareenaverma added editorial_review and removed tech_review labels Jan 5, 2026

madeline-underwood assigned jasonrandrews and unassigned annietllnd and madeline-underwood Jan 10, 2026

madeline-underwood added ready_to_publish Ready for final review and publish and removed editorial_review labels Jan 10, 2026

jasonrandrews moved this from In Progress to Done in Arm Learning Paths Roadmap Jan 17, 2026

jasonrandrews added publish and removed ready_to_publish Ready for final review and publish labels Jan 17, 2026

jasonrandrews removed their assignment Jan 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add fexpa learning path#2632

Add fexpa learning path#2632
pareenaverma merged 2 commits into
ArmDeveloperEcosystem:mainfrom
ClaudioMartino-arm:main

ClaudioMartino-arm commented Dec 10, 2025

Uh oh!

Uh oh!

blapie left a comment

Uh oh!

blapie Dec 17, 2025

Uh oh!

blapie Dec 17, 2025

Uh oh!

blapie Dec 17, 2025

Uh oh!

blapie Dec 17, 2025

Uh oh!

blapie Dec 17, 2025

Uh oh!

blapie Dec 17, 2025

Uh oh!

blapie Dec 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants


		Arm introduced in SVE an instruction called FEXPA: the Floating Point Exponential Accelerator.

		Let’s segment the IEEE754 floating-point representation fraction part into several sub-fields (Index, Exp and Remaining bits) with respective length of Idxb, Expb and Remb bits.


		Let’s segment the IEEE754 floating-point representation fraction part into several sub-fields (Index, Exp and Remaining bits) with respective length of Idxb, Expb and Remb bits.

		\| IEEE754 precision \| Idxb \| Expb \| Remb \|

Uh oh!

Conversation

ClaudioMartino-arm commented Dec 10, 2025

Uh oh!

Uh oh!

blapie left a comment

Choose a reason for hiding this comment

Uh oh!

blapie Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

blapie Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

blapie Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

blapie Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

blapie Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

blapie Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

blapie Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants