run benchmarks for 24h simulation time #844

juliasloan25 · 2024-06-08T02:50:28Z

Purpose

The benchmark runs have been set to run for 12 hours, but it looks like there's still some variability in SYPD at that point. Since we're iterating on the table output itself less now, it's probably worth it to increase the simulation length from 12 hours to 1 day.

The 2 original runs, AMIP and ClimaAtmos with diagnostic EDMF, seem to have a stable SYPD after 12 hours, but the newest ClimaAtmos without diagnostic EDMF seems to still have variability at 12 hours.

To-do

Content

I have read and checked the items on the review checklist.

Sbozzolo · 2024-06-10T16:05:51Z

Could you please quantify the variability?

There shouldn't be much for a 12h run and we should understand why things fluctuate if they do.

juliasloan25 · 2024-06-10T19:30:30Z

Could you please quantify the variability?

There shouldn't be much for a 12h run and we should understand why things fluctuate if they do.

Here are the SYPDs for the 3 runs [coupled, atmos with diag. edmf, atmos without diag. edmf], in 3 builds from the last week all using the same package versions and no performance changes in ClimaCoupler:

build #160: [1.0674, 1.1146, 4.85]
build #164: [1.0844, 1.0839, 4.972]
build #165: [1.0693, 1.0973, 4.8936]

In Atmos, @szy21 has seen that the output SYPDs sometimes take up to 24 hours of simulation time to converge to the number that we take to be the accurate measurement, so we may need to increase our runtime here too

Sbozzolo · 2024-06-10T20:16:14Z

The difference is less than 2 %. It is very reasonable variability to have and I don't think we should be concerned with reducing it further. Such variability could even be due to the physical temperature of the device or with how processes are distributed by the operating system.

If we want to have a really accurate measurament, we would have to run a statistically significant number of runs (as we do for the bucket in ClimaLand) and take statistics out of it.

szy21 · 2024-06-10T20:18:41Z

In the atmos now we run most of the simulations for 12 hours for scaling. The difference I saw is also within 2% so I think it's ok?

juliasloan25 · 2024-06-12T00:54:25Z

Okay, I won't change it here then

run benchmarks for 24h

b78f2f4

juliasloan25 closed this Jun 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

run benchmarks for 24h simulation time #844

run benchmarks for 24h simulation time #844

juliasloan25 commented Jun 8, 2024 •

edited

Loading

Sbozzolo commented Jun 10, 2024

juliasloan25 commented Jun 10, 2024

Sbozzolo commented Jun 10, 2024 •

edited

Loading

szy21 commented Jun 10, 2024

juliasloan25 commented Jun 12, 2024

run benchmarks for 24h simulation time #844

run benchmarks for 24h simulation time #844

Conversation

juliasloan25 commented Jun 8, 2024 • edited Loading

Purpose

To-do

Content

Sbozzolo commented Jun 10, 2024

juliasloan25 commented Jun 10, 2024

Sbozzolo commented Jun 10, 2024 • edited Loading

szy21 commented Jun 10, 2024

juliasloan25 commented Jun 12, 2024

juliasloan25 commented Jun 8, 2024 •

edited

Loading

Sbozzolo commented Jun 10, 2024 •

edited

Loading