Custom implementation of weighted sampling #20

Tortar · 2024-10-27T20:15:04Z

This helps the simulation time, shaving another 15% from the timings of https://github.com/bancaditalia/BeforeIT.jl/blob/main/examples/benchmark_w_matlab.jl#L24. Unfortunately it seems to slightly change the results, this should be only because the implementation is different from StatsBase.wsample

Tortar · 2024-10-27T21:55:39Z

Fixed the issue with the tests, now everything works the same as before

Devetak · 2024-10-28T14:14:01Z

Great!

AldoGl · 2024-10-28T14:26:30Z

Before merging I would actually make sure we find a non negligible speedup.
I did run the "benchmark_with_matlabl" script in the examples folder and I could not measure it

Tortar · 2024-10-28T14:32:11Z

The problem is that the GC runs unreliably between runs, so to measure realibly I think we should use @benchmark or @btime, will come back with the results using them

Tortar · 2024-10-28T14:41:07Z

With

using BeforeIT

# We will run the model without any output to avoid the overhead of printing the results.
function run(parameters, initial_conditions, T; multi_threading = false)
    model = BeforeIT.init_model(parameters, initial_conditions, T)
    data = BeforeIT.init_data(model);
    
    for _ in 1:T
        BeforeIT.one_epoch!(model; multi_threading = multi_threading)
        BeforeIT.update_data!(data, model)
    end
    return model, data
end

parameters = BeforeIT.AUSTRIA2010Q1.parameters
initial_conditions = BeforeIT.AUSTRIA2010Q1.initial_conditions
T = 1

@benchmark run($parameters, $initial_conditions, $T)

This is what I see

julia> @benchmark run(parameters, initial_conditions, T) # this PR
BenchmarkTools.Trial: 135 samples with 1 evaluation.
 Range (min … max):  35.383 ms …  41.835 ms  ┊ GC (min … max): 4.44% … 3.82%
 Time  (median):     36.941 ms               ┊ GC (median):    4.67%
 Time  (mean ± σ):   37.141 ms ± 944.112 μs  ┊ GC (mean ± σ):  5.08% ± 1.09%

           ▂  ▂▁▅▇ ▅▄ ▁▅▁ █▁                                    
  ▃▁▁▁▅▅▅▃▃█▆▆███████████▆██▁█▃▁▃▃▃▅▅▆▃▁▁▆▃▃▁▁▁▃▃▁▁▃▁▁▁▁▃▁▁▁▁▃ ▃
  35.4 ms         Histogram: frequency by time         40.4 ms <

 Memory estimate: 122.47 MiB, allocs estimate: 130124.

julia> @benchmark run(parameters, initial_conditions, T) # master
BenchmarkTools.Trial: 117 samples with 1 evaluation.
 Range (min … max):  40.476 ms … 47.697 ms  ┊ GC (min … max): 6.15% … 14.14%
 Time  (median):     42.503 ms              ┊ GC (median):    7.65%
 Time  (mean ± σ):   42.956 ms ±  1.570 ms  ┊ GC (mean ± σ):  8.75% ±  2.40%

              █      ▂  ▃                                      
  ▃▁▁▁▁▆▅▅▆▇▆███▇▇▅█▆█▇▅█▇▅▅▁▃▃▃▅▃▁▁▁▁▆▅▁▃▁▁▁▇▅▆▁▁▅▃▃▁▃▃▁▃▅▁▅ ▃
  40.5 ms         Histogram: frequency by time        46.7 ms <

 Memory estimate: 139.23 MiB, allocs estimate: 678914.

Devetak · 2024-10-28T14:45:30Z

Confirm @Tortar. With the example from main.jl without printing the speed up for me, with a single thread is 50%.
One thing before merging. In general the function you provide should work best for small-medium sized weights. @AldoGl do we have a larger test case from Poledna's paper?

AldoGl · 2024-10-28T15:00:56Z

Unfortunately there is no medium-size default parametrisation yet. The standard one has around 8000 agents. We probably need to add one with around 100 000, which btw could be a target size for all experimentations. To generate one would be sufficient to use the calibration script with a custom "scale" parameter

Tortar · 2024-10-28T15:18:14Z

I guess this should always be the best algorithm, because I don't think there could exist a case where this function is slower if the simulation is dynamic and we can't reuse the sum of weights between different runs (which is the case here as far as I can tell)

Devetak · 2024-10-28T15:35:42Z

You are absolutely correct! I suggest we merge since it makes the code faster.
I benchmarked the custom weighted sampling with the one from StatsBase and they take the same amount. So I am not sure how the BeforeIT version is so much faster.

function test(N)
       a = rand(N)
       b = rand(N)
       @btime wsample_single(a, b)
       @btime wsample(a, b)
       end

Tortar · 2024-10-28T16:08:45Z

I think because this version is non-allocating so the GC should be triggered less often, it is probably possible to improve the version of StatsBase, but I'm a bit unsure the time it would take, because maintenance is rather slow there (I have some PRs floating around there from months)

Tortar · 2024-10-28T22:51:30Z

I have actually pushed some more improvements exploting the unique properties which are satisfied in this case, now it should be even a bit faster

Tortar · 2024-11-01T01:21:43Z

The last commit broke the tests but it shouldn't have any logical difference between the previous method, it just select differently but with the same probability structure

Tortar · 2024-11-02T00:36:21Z

@Devetak what do you think? It seems to me that we merge this one after adjusting tests

Devetak · 2024-11-02T07:38:11Z

@Devetak what do you think? It seems to me that we merge this one after adjusting tests

Few sparse comments:
(a) see also comment on Issue #25, I think some of the later changes make the code less readable. So I will keep deleteat! or put a comment
(b) the test that fail are the ones that compare with matlab. hence the most important ones! We should not change those. Those are the most important ones.

Btw the bug is the change of deleteat! I think. Since

deleteat!([a,b,c,d], 2) = [a,c,d] but

F_g[e], F_g[end] = F_g[end], F_g[e] # [a,d,c,e]
pop!(F_g) # [a,d,c]

In practice they are equivalent, but making it impossible to compare to Matlab is in my opinion not worth it. That is really the most important check! How much speed up there is from that?

Tortar · 2024-11-02T12:43:07Z

It has a big impact (something like 30 seconds in the big models with scale=0.01) but let's keep this simple and discuss possible improvements later, so I will just remove that improvement. I will just add a function for that for the record. I think that nonetheless we could reproduce the results "in expectation"

Tortar · 2024-11-02T13:03:17Z

Now it should be okay

Devetak · 2024-11-02T13:07:57Z

An idea could be to change swap pop to removeat for the deterministic run. This would then mirror the Matlab code.

Tortar · 2024-11-02T13:15:14Z

you are totally right, we will have to change the some search_and_matching tests anyway but they are not as critical I believe, right?

Devetak · 2024-11-02T13:18:26Z

Which ones? Also make a test for swap_pop.

Tortar · 2024-11-02T13:23:21Z

you can see them here: https://github.com/bancaditalia/BeforeIT.jl/actions/runs/11643028585/job/32423294518?pr=20

Will add a test

Devetak · 2024-11-02T13:28:14Z

I would ask @AldoGl how where the tests constructed and see from there. Definitely important to have comprehensive tests if we are going to work on performance.

Tortar · 2024-11-17T20:14:14Z

Closing in favour of #30

Tortar added 6 commits October 27, 2024 21:13

Custom implementation of weighted sampling

0aefc33

Update BeforeIT.jl

76b7504

Update search_and_matching.jl

bccb9ed

Update make_model_deterministic.jl

9bbc6d0

Update wsample.jl

0614e9c

Update make_model_deterministic.jl

e12f37e

Update search_and_matching.jl

1530e6f

Devetak self-assigned this Oct 28, 2024

Devetak mentioned this pull request Oct 28, 2024

Improve TTFX #19

Merged

Tortar mentioned this pull request Oct 28, 2024

Improve speed of simulations #18

Merged

Update wsample.jl

a8e9d19

Tortar added 3 commits October 28, 2024 23:37

Update search_and_matching.jl

f1c0e96

Update wsample.jl

73f32fc

Update search_and_matching.jl

354238d

Tortar added 4 commits October 29, 2024 00:05

Update make_model_deterministic.jl

00cfb8a

Update wsample.jl

31fa6ea

Merge branch 'main' into patch-1

79a7c5a

Update search_and_matching.jl

99403de

Tortar added 10 commits November 1, 2024 12:32

Merge branch 'main' into patch-1

1cb6a1e

New approach!

99d8f26

Update search_and_matching.jl

a89b716

Update wsample.jl

4bdf165

Update make_model_deterministic.jl

a8d6463

Update make_model_deterministic.jl

773d41a

Update search_and_matching.jl

afe9ee4

Update search_and_matching.jl

7673642

Update wsample.jl

2421191

Update make_model_deterministic.jl

b4ab02d

Revert

9bb8a87

Update search_and_matching.jl

3c703c2

Tortar added 2 commits November 2, 2024 14:16

go back

98cd9e6

Update make_model_deterministic.jl

527d3a2

Merge branch 'main' into patch-1

687bb98

Tortar mentioned this pull request Nov 10, 2024

Use a dynamic sampler to speed-up weighted sampling #30

Merged

Tortar closed this Nov 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom implementation of weighted sampling #20

Custom implementation of weighted sampling #20

Tortar commented Oct 27, 2024 •

edited

Loading

Tortar commented Oct 27, 2024

Devetak commented Oct 28, 2024

AldoGl commented Oct 28, 2024

Tortar commented Oct 28, 2024

Tortar commented Oct 28, 2024 •

edited

Loading

Devetak commented Oct 28, 2024

AldoGl commented Oct 28, 2024

Tortar commented Oct 28, 2024 •

edited

Loading

Devetak commented Oct 28, 2024

Tortar commented Oct 28, 2024

Tortar commented Oct 28, 2024

Tortar commented Nov 1, 2024

Tortar commented Nov 2, 2024

Devetak commented Nov 2, 2024

Tortar commented Nov 2, 2024

Tortar commented Nov 2, 2024

Devetak commented Nov 2, 2024 •

edited

Loading

Tortar commented Nov 2, 2024 •

edited

Loading

Devetak commented Nov 2, 2024

Tortar commented Nov 2, 2024

Devetak commented Nov 2, 2024

Tortar commented Nov 17, 2024

Custom implementation of weighted sampling #20

Custom implementation of weighted sampling #20

Conversation

Tortar commented Oct 27, 2024 • edited Loading

Tortar commented Oct 27, 2024

Devetak commented Oct 28, 2024

AldoGl commented Oct 28, 2024

Tortar commented Oct 28, 2024

Tortar commented Oct 28, 2024 • edited Loading

Devetak commented Oct 28, 2024

AldoGl commented Oct 28, 2024

Tortar commented Oct 28, 2024 • edited Loading

Devetak commented Oct 28, 2024

Tortar commented Oct 28, 2024

Tortar commented Oct 28, 2024

Tortar commented Nov 1, 2024

Tortar commented Nov 2, 2024

Devetak commented Nov 2, 2024

Tortar commented Nov 2, 2024

Tortar commented Nov 2, 2024

Devetak commented Nov 2, 2024 • edited Loading

Tortar commented Nov 2, 2024 • edited Loading

Devetak commented Nov 2, 2024

Tortar commented Nov 2, 2024

Devetak commented Nov 2, 2024

Tortar commented Nov 17, 2024

Tortar commented Oct 27, 2024 •

edited

Loading

Tortar commented Oct 28, 2024 •

edited

Loading

Tortar commented Oct 28, 2024 •

edited

Loading

Devetak commented Nov 2, 2024 •

edited

Loading

Tortar commented Nov 2, 2024 •

edited

Loading