Support multithreaded profiling #353

georgeharker · 2024-12-04T23:12:48Z

As per #352 I have modified pyinstrument to support multiple threads.

It does this by separating out both timing and frames for the different threads - as such it modifies a substantial amount of the data model (usually in the form of List[frames] -> Dict[str, List[Frames]] and the equivalents in typescript.

I don't do much typescript so it's possible there are prettier ways of doing the modifications on that side.

I've modified most of(but not all) of the renderers (I don't use / have speedscope so I'm not sure of the best fix there).
I'm sure the ux could be tidied, the threads come out in arbitrary order in the timeline view and it would ideally be possible to easily toggle various threads off etc).

I'd be interested in discussing how this looks to you

lucamuscat · 2024-12-21T08:48:39Z

Hey 👋

Good work!

It seems that you still need to insert the thread_start_times field in Session.from_json & Session.combine

joerick

Thanks for sending this over. It's certainly ambitious! But it would be a good improvement to get this working. I'm still a bit confused about how this works. Do you have a sample script that illustrates how it's supposed to work?

joerick · 2025-01-01T15:18:30Z

pyinstrument/stack_sampler.py

+@dataclass
+class StackSamplerSubscriberTarget:
+    call_stack: SubscriberCallstackFn
+    event: SubscriberEventFn


I wonder if we can combine this event and async_state change concepts into one interface. It seems that they're pretty similar to me.

joerick · 2025-01-01T15:22:52Z

pyinstrument/profiler.py

+    def record_thread_start(self, thread_id: str, time: float) -> None:
+        if not self.thread_start_times:
+            self.first_start_time = time
+        self.thread_start_times[thread_id] = time - self.first_start_time


Why not store the raw start time in each, rather than the offset from the first_start_time? Generally I find it better to store truth over computed values.

I initially tried raw time but it had some complexity when trying to view the traces on the same view, i'd be happy to look at that again.

joerick · 2025-01-01T15:32:27Z

pyinstrument/profiler.py

+            )
+        if event == 'thread_start':
+            self._active_session.record_thread_start(thread_id, time)
+
    # pylint: disable=W0613
    def _sampler_saw_call_stack(


I'm confused why this method _sampler_saw_call_stack doesn't need to change? Don't we need to separate out the storage of the different threads?

The frame records get a root thread id added which allows us to reconstruct what is separate - this is done in build_call_stack

(different threads will never clash in the pythons code due to the GIL)

georgeharker · 2025-01-15T19:07:31Z

Done!

Hey 👋

Good work!

It seems that you still need to insert the thread_start_times field in Session.from_json & Session.combine

Done!

georgeharker added 6 commits December 3, 2024 10:51

Support profiling child threads

276a533

WIP threading support

94a85c0

functional threaded profiling

d8b6948

update resources

6a52a08

regen js

cd42766

update some renderers

6fa7e01

joerick reviewed Jan 1, 2025

View reviewed changes

Add thread_start_times to Session.from_json and Session.combine

88d77a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multithreaded profiling #353

Support multithreaded profiling #353

georgeharker commented Dec 4, 2024

lucamuscat commented Dec 21, 2024

joerick left a comment

joerick Jan 1, 2025

joerick Jan 1, 2025

georgeharker Jan 15, 2025

joerick Jan 1, 2025 •

edited

Loading

georgeharker Jan 15, 2025

georgeharker Jan 15, 2025

georgeharker commented Jan 15, 2025

Support multithreaded profiling #353

Are you sure you want to change the base?

Support multithreaded profiling #353

Conversation

georgeharker commented Dec 4, 2024

lucamuscat commented Dec 21, 2024

joerick left a comment

Choose a reason for hiding this comment

joerick Jan 1, 2025

Choose a reason for hiding this comment

joerick Jan 1, 2025

Choose a reason for hiding this comment

georgeharker Jan 15, 2025

Choose a reason for hiding this comment

joerick Jan 1, 2025 • edited Loading

Choose a reason for hiding this comment

georgeharker Jan 15, 2025

Choose a reason for hiding this comment

georgeharker Jan 15, 2025

Choose a reason for hiding this comment

georgeharker commented Jan 15, 2025

joerick Jan 1, 2025 •

edited

Loading