Skip to content

Commit

Permalink
add a bit more on debuggin with precise memory
Browse files Browse the repository at this point in the history
  • Loading branch information
zingale committed Nov 21, 2024
1 parent abc7917 commit 65fffd5
Showing 1 changed file with 12 additions and 3 deletions.
15 changes: 12 additions & 3 deletions sphinx_docs/source/olcf-workflow.rst
Original file line number Diff line number Diff line change
Expand Up @@ -208,7 +208,16 @@ If it doesn't crash with the trace, then try:
interrupt
bt

It might say that the memory location is not precise, to enable precise
memory, in the debugger, do:

.. prompt::
:prompts: (gdb)

set amdgpu precise-memory on
show amdgpu precise-memory

and rerun.



Expand All @@ -222,11 +231,11 @@ Workaround to prevent hangs for collectives:
export FI_MR_CACHE_MONITOR=memhooks


Some AMReX reports are that it hangs if the initial Arena size is too big, and we should do
Some AMReX reports are that it hangs if the initial Arena size is too
big, and we should do

::

amrex.the_arena_init_size=0

The arena size would then grow as needed with time. There is a suggestion that if the size is
larger than
The arena size would then grow as needed with time.

0 comments on commit 65fffd5

Please sign in to comment.