[XLA:MSA] Fix the AllocationRequest for window prefetch #21736

copybara-service · 2025-01-23T00:37:50Z

[XLA:MSA] Fix the AllocationRequest for window prefetch

We currently have two performance issues in window prefetch.

The first is that we specified the created WindowPrefetchedAllocation to consume CopyResources by using non-zero shape. This could consume all resources and interfere with the prefetching decision of other tensors. In fact, we don't have prefetching implemented yet, so we could specify using zero CopyResource.

Another issue is that the generated allocation from window prefetch currently spans too long in time. Its earliest prefetch time is set to the operand's define time, and its end is the use time. We should keep the earliest prefetch time to be as close as the use time as possible. This is for keeping the interference to the prefetching of other tensors to be minimal.

We updated WindowPrefetch() to simply just allocate chunk for the exposed span vmem at this moment. Since we don't call Prefetch() from WindowPrefetch(), we can simplify the data structure of AllocationRequest and PrefetchContext a bit.

We currently have two performance issues in window prefetch. The first is that we specified the created WindowPrefetchedAllocation to consume CopyResources by using non-zero shape. This could consume all resources and interfere with the prefetching decision of other tensors. In fact, we don't have prefetching implemented yet, so we could specify using zero CopyResource. Another issue is that the generated allocation from window prefetch currently spans too long in time. Its earliest prefetch time is set to the operand's define time, and its end is the use time. We should keep the earliest prefetch time to be as close as the use time as possible. This is for keeping the interference to the prefetching of other tensors to be minimal. We updated WindowPrefetch() to simply just allocate chunk for the exposed span vmem at this moment. Since we don't call Prefetch() from WindowPrefetch(), we can simplify the data structure of AllocationRequest and PrefetchContext a bit. PiperOrigin-RevId: 718439738

copybara-service bot force-pushed the test_718439738 branch 3 times, most recently from 0e7d18d to 4de2457 Compare January 24, 2025 23:59

copybara-service bot force-pushed the test_718439738 branch from 4de2457 to 2f1d65c Compare January 25, 2025 04:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XLA:MSA] Fix the AllocationRequest for window prefetch #21736

[XLA:MSA] Fix the AllocationRequest for window prefetch #21736

copybara-service bot commented Jan 23, 2025 •

edited

Loading

[XLA:MSA] Fix the AllocationRequest for window prefetch #21736

Are you sure you want to change the base?

[XLA:MSA] Fix the AllocationRequest for window prefetch #21736

Conversation

copybara-service bot commented Jan 23, 2025 • edited Loading

copybara-service bot commented Jan 23, 2025 •

edited

Loading