900 – Continue work on Atomics extension

Bug 900 - Continue work on Atomics extension

Summary: Continue work on Atomics extension

Status:	DEFERRED

Alias:	None

Product:	Libre-SOC's first SoC
Classification:	Unclassified
Component:	Specification (show other bugs)
Version:	unspecified
Hardware:	PC Linux

Importance:	--- enhancement
Assignee:	Jacob Lifshay

URL:	https://libre-soc.org/openpower/atomics

Depends on:	236
Blocks:
	Show dependency tree / graph

Reported:	2022-08-02 06:18 BST by Jacob Lifshay
Modified:	2023-10-29 07:00 GMT (History)
CC List:	3 users (show)

See Also:	236
NLnet milestone:	Future
total budget (EUR) for completion of task and all subtasks:	0
budget (EUR) for this task, excluding subtasks' budget:	0
parent task for budget allocation:
child tasks for budget allocation:
The table of payments (in EUR) for this task; TOML format:

Attachments
Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Jacob Lifshay 2022-08-02 06:18:14 BST

Continuation of work started in https://bugs.libre-soc.org/show_bug.cgi?id=236

Blocked on obtaining more budget for work.

Comment 1 Jacob Lifshay 2023-07-25 00:51:30 BST

I just thought of a way to add atomic ops that don't need memory model work: we simply define them in terms of the existing instruction sequences for the atomic ops. this also means they are trivially interoperable with all existing atomic ops.

e.g. relaxed fetch_add is currently:
https://rust.godbolt.org/z/eP974v3cf
# addr in r3, addend in r4, result in r5
.LBB0_1:
        lbarx 5, 0, 3
        add 6, 4, 5
        stbcx. 6, 0, 3
        bne     0, .LBB0_1

so amoadd r5, r4, r3 would be defined to be equivalent to the above code (except not modifying CR0 or r6)

Comment 2 Jacob Lifshay 2023-07-25 00:54:41 BST

(In reply to Jacob Lifshay from comment #1)
> I just thought of a way to add atomic ops that don't need memory model work:
> we simply define them in terms of the existing instruction sequences for the
> atomic ops.

this saves us a lot of work and saves OPF and IBM a lot of work verifying the new instructions work in the memory model, the only problem is they may be slower than necessary for C++11's memory model.

Comment 3 Jacob Lifshay 2023-10-29 06:51:25 GMT

LLVM may be changing how it lowers relaxed loads to include a dummy branch:
https://discourse.llvm.org/t/rfc-strengthen-relaxed-atomics-implementation-behind-mstrict-rlx-atomics-flag/74473

We should take the outcome of that into account when designing our atomic instructions.