Add synchronization for multicore #782

soyersoyer · 2025-01-03T02:15:50Z

There is currently no memory synchronization between the processor cores, which could cause problems in theory. I added memory barriers and atomic access to the cores' status.

I also suspend the cores when waiting. I thought this would make the PI a little cooler, but I haven't been able to measure it. Can someone measure it with a USB power meter?

Maybe not all of these changes are necessary. I'll get to know the barriers better.

github-actions · 2025-01-03T02:22:58Z

Build for testing:
MiniDexed_2025-01-03-0b26d6c
Use at your own risk.

probonopd · 2025-01-03T07:36:51Z

Thanks @soyersoyer.
@rsta2, maybe you could have a quick glance at these changes to advise us whether we are on the right track here? Thank you very much!

github-actions · 2025-01-03T10:37:39Z

Build for testing:
MiniDexed_2025-01-03-40e62e2
Use at your own risk.

github-actions · 2025-01-06T23:21:35Z

Build for testing:
MiniDexed_2025-01-06-9e9d74e
Use at your own risk.

soyersoyer · 2025-01-06T23:52:55Z

Barriers are not needed because they are already in the Acquire () / Release () of the spinlock of the CDexedAdapter getSamples () function.

Volatile variables are written and read with STR/LDR ARM instructions, which are atomic, and it would be enough if there were no other m_nFramesToProcess variable, for which there is no guarantee that its new value would be available to the other cores sooner than the new value of m_CoreStatus. If I understand correctly?
For std::atomic, the STLR/LDAR instruction is used for release/acquire and seq_cst (default) modes, which already ensures that the new value of m_nFrameToProcess is also available.

If volatiles remain, I think that another solution could be to omit the nFrames variable in ProcessSound (use m_nFramesToProcess instead)?
Or pass n_samples as a pointer to CDexedAdapter::getSamples and dereference it after m_SpinLock.Acquire () (DataMemoryBarrier)?

The other thing I don't know is whether Core1 can wait indefinitely if m_CoreStatus[nCore] != CoreStatusIdle is read, then it starts waiting for the interrupt, but between the read and the wait for the interrupt, m_CoreStatus changes (by Core2) and it has already received its Interrupt. Or, how can this be ensured so that it doesn't happen.

I haven't been able to measure yet whether it really consumes less this way and whether it's really worth it, I'll let you know when the meter arrives.

soyersoyer force-pushed the synch branch from 593d5d3 to b71a426 Compare January 3, 2025 10:30

soyersoyer added 2 commits January 6, 2025 23:54

add synchronization for multcore operations

b0e8f55

suspend processors while waiting

b063c93

soyersoyer force-pushed the synch branch from b71a426 to b063c93 Compare January 6, 2025 23:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add synchronization for multicore #782

Add synchronization for multicore #782

soyersoyer commented Jan 3, 2025

github-actions bot commented Jan 3, 2025

probonopd commented Jan 3, 2025

github-actions bot commented Jan 3, 2025

github-actions bot commented Jan 6, 2025

soyersoyer commented Jan 6, 2025

Add synchronization for multicore #782

Are you sure you want to change the base?

Add synchronization for multicore #782

Conversation

soyersoyer commented Jan 3, 2025

github-actions bot commented Jan 3, 2025

probonopd commented Jan 3, 2025

github-actions bot commented Jan 3, 2025

github-actions bot commented Jan 6, 2025

soyersoyer commented Jan 6, 2025