Fix and optimize StateMonitor template #201

denisalevi · 2021-04-14T11:52:28Z

Statemonitor only works for <1024 recorded neurons and the implementation seems rather inefficient.

This is a follow-up issue from #50. Here are my relevant comments from that issue:

The StateMonitor is also not working working for many neurons. Currently the StateMonitor is called with 1 block and as many threads as there are Neurons (or Synapses etc) to record. So if there are more Neurons then the maximum number of threads, the kernel fails to launch.

Originally posted by @denisalevi in #50 (comment)

And the global memory writes are not coalesced. Currently we have a 2D data structure of dimensions indices x record_times (vector of vectors) for each variable monitor. And we fill that in the kernel like this

monitor[tid][current_iteration] = ...

For coalesced writes we could just "transpose" the monitor data structure so we can use

monitor[current_iteration][tid] = ...

We might have to resort the monitor in the end though since it might then not fit with the format that Brian expects to read back.

Originally posted by @denisalevi in #50 (comment)

The text was updated successfully, but these errors were encountered:

Fails until #201 is fixed

Fixes bug in #201, without optimizing it yet.

Now it actually tests something instead just failing to notify about issue #201 (which is solved now)

Fixes bug in #201, without optimizing it yet.

Now it actually tests something instead just failing to notify about issue #201 (which is solved now)

denisalevi · 2022-03-31T13:51:15Z

Closed with PR #279. Further optimization options are collected in #278.

denisalevi added bug optimisation labels Apr 14, 2021

denisalevi added a commit that referenced this issue Apr 14, 2021

Add test that fails for #201

1c743d1

This was referenced Apr 14, 2021

Fix monitor indexing issues #202

Merged

StateMonitor idexing with slice syntax doesn't work #50

Closed

denisalevi added a commit that referenced this issue Apr 14, 2021

Add test that fails for #201

f62749c

denisalevi added a commit that referenced this issue Apr 14, 2021

Add test that fails for #201

15f97ef

denisalevi mentioned this issue Apr 14, 2021

Test failing: test_user_defined_function is failing #203

Closed

denisalevi added a commit that referenced this issue Aug 4, 2021

Add test that fails for #201

f7d3b42

denisalevi added a commit that referenced this issue Aug 6, 2021

Add test that fails for #201

056cba7

denisalevi added a commit that referenced this issue Aug 6, 2021

Add test with SpikeMonitor of >1024 variables

b8fcb0f

Fails until #201 is fixed

denisalevi mentioned this issue Mar 3, 2022

Astrocytes example fails in brian2cuda #214

Closed

denisalevi added a commit that referenced this issue Mar 31, 2022

Refactor and fix statemonitor template

fd3173c

Fixes bug in #201, without optimizing it yet.

denisalevi added a commit that referenced this issue Mar 31, 2022

Modify state monitor test for >1024 threads

0eda13a

Now it actually tests something instead just failing to notify about issue #201 (which is solved now)

denisalevi added a commit that referenced this issue Mar 31, 2022

Refactor and fix statemonitor template

a774976

Fixes bug in #201, without optimizing it yet.

denisalevi added a commit that referenced this issue Mar 31, 2022

Modify state monitor test for >1024 threads

2d19fbc

Now it actually tests something instead just failing to notify about issue #201 (which is solved now)

This was referenced Mar 31, 2022

Optimize StateMonitor #278

Open

Fix statemonitor #279

Merged

denisalevi closed this as completed Mar 31, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix and optimize StateMonitor template #201

Fix and optimize StateMonitor template #201

denisalevi commented Apr 14, 2021

denisalevi commented Mar 31, 2022

Fix and optimize StateMonitor template #201

Fix and optimize StateMonitor template #201

Comments

denisalevi commented Apr 14, 2021

denisalevi commented Mar 31, 2022