merge Fourier method into main #23

MuellerSeb · 2024-07-20T13:48:11Z

Since the 1.0 tag was created from this branch, it should be merged into main.

macos-14 runner image doesn't support py3.9 anymore.

adamreichold · 2024-07-20T13:57:08Z

Cargo.toml

@@ -29,7 +29,7 @@ lto = true
 codegen-units = 1

 [dependencies]
-pyo3 = { version = "0.20", features = ["abi3-py38", "extension-module"] }
+pyo3 = { version = "0.20", features = ["abi3-py310", "extension-module"] }


It is not trivial due to the new Bound-API, but you may want to consider upgrading to pyo3 0.22 as it significantly reduces Python-to-Rust call overheads.

Hi Adam, it's great to hear from you! Yeah, I realized that updating the version created a few problems and I wanted to publish the new function as quickly as possible. But now, I should have some time to see how to upgrade pyo3.

The benchmarks are inconclusive, but there might actually be a slight improvement 👍

adamreichold · 2024-07-20T14:04:02Z

src/field.rs

+
+    rayon::ThreadPoolBuilder::new()
+        .num_threads(num_threads.unwrap_or(rayon::current_num_threads()))
+        .build()


This will add a significant cost for initializing a new thread pool for each and every call, i.e. via the thread start-up cost.

Maybe this could be changed to use the global/default thread pool if num_threads is None? Especially as rayon::current_num_threads is not the default (that would be RAYON_NUM_THREADS or std::thread::available_parallelism otherwise), but the number of threads in the global thread pool in contrast to the function documentation. (So basically it will use the number of threads chosen when the global pool was initialized which of course will be the default if it was initialized implicitly/automatically.)

Maybe something like

fn with_num_threads<F, R>(num_threads: Option<usize>, f: F) where F: FnOnce() -> R { if let Some(num_threads) = num_threads { ThreadPoolBuilder::new(num_threads).build().unwrap().install(f) } else { f() } }

Thanks for pointing that out and thanks for the suggestion. I'll take a closer look soon.

adamreichold · 2024-07-20T14:06:20Z

src/field.rs

+                    .and(spectrum_factor)
+                    .and(z1)
+                    .and(z2)
+                    .fold(0.0, |sum, k, &spectrum_factor, &z1, &z2| {


Since the accumulator is a scalar and hence cheap to copy/reduce, I wonder whether this would actually benefit from using a par_fold instead of the sequential fold?

And thanks again :-) I'll run a few benchmarks soon to see, if you are right.

Interesting, the par_fold version is about 50% slower in my benchmarks.

Quite possible that the overhead is unhelpful for the inner loop which would otherwise benefit from auto-vectorization. Using with_min_len might be a nice compromise between granularity and overhead, but it does add another tunable.

LSchueler · 2024-08-08T13:20:22Z

Oh man, the weeks before my holidays where pretty stressful and I had to constantly jump between different things. But still, creating the tag from this branch?! Thanks for catching this!

LSchueler added 10 commits July 2, 2024 12:28

Add Fourier method

cc167a6

Fix formatting

4531f19

Update Py version for actions

df81b39

macos-14 runner image doesn't support py3.9 anymore.

Update min. Py ABI version

93194aa

Update dep. versions

91b62bd

Update versions of Python dependencies

60ce7d5

Fix yaml interpreting semvar as float

2c91615

Include intel & arm architecture for macos

78621e9

Update action versions

df8f2a5

Bump version

6bb55fb

MuellerSeb added the enhancement New feature or request label Jul 20, 2024

MuellerSeb requested a review from LSchueler July 20, 2024 13:48

adamreichold reviewed Jul 20, 2024

View reviewed changes

LSchueler merged commit 0c43f81 into main Aug 8, 2024
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge Fourier method into main #23

merge Fourier method into main #23

MuellerSeb commented Jul 20, 2024

adamreichold Jul 20, 2024

LSchueler Aug 8, 2024

LSchueler Aug 9, 2024

adamreichold Jul 20, 2024

LSchueler Aug 8, 2024

adamreichold Jul 20, 2024

LSchueler Aug 8, 2024

LSchueler Aug 9, 2024

adamreichold Aug 9, 2024

LSchueler commented Aug 8, 2024

merge Fourier method into main #23

merge Fourier method into main #23

Conversation

MuellerSeb commented Jul 20, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LSchueler commented Aug 8, 2024