Read volume vars from file #83

twsearle · 2024-03-18T16:29:31Z

Description

We resolved #70 by reducing the memory usage of each process. Unfortunately, we found that the program spent most of its time in IO. This change reads the entirety of a volume variable in one go to reduce the number of file reads, rather than reading and broadcasting one horizontal level at a time.

Dependencies

Impact

The second cycle of the global ocean runs in MetOffice/sith/pull/271 should hopefully complete.

Checklist

twsearle · 2024-03-20T10:34:12Z

So far in my limited testing it appears this change might actually be making JOPA slower! I might withdraw this PR unless you can see anything wrong with my results. It is puzzling though, as I was pretty sure a change like this had delivered improvements in the past.

spice gl_ocn:

spice gl_ocn on this ticket takes 955 secs with 4 MPI tasks
spice gl_ocn on develop takes 835 secs with 4 MPI tasks

xc ocnd:

xc ocnd on this ticket takes 2178 secs with 16 MPI tasks
xc ocnd on develop takes 1497 secs on 16 MPI tasks

xc ocnd orca12:

xc ocnd orca12 on this ticket takes 1502 secs on 12 MPI tasks
xc ocnd orca12 on develop takes 1492 secs on 12 MPI tasks

There can be a lot of variability in runtimes on xc depending on the load of the file system I think, but still it seems that this change has either neutral impact or makes things worse.

s-good · 2024-03-20T11:21:58Z

It does seem like this change should speed things up!

This is a complete guess, but I'm wondering about the changes in src/orca-jedi/nemo_io/ReadServer.cc, where it is doing the distribution of data in memory by level previously and now all at once. Is it possible that there is some optimisation happening that makes the old version go faster? I.e. could it be fastest to read all data from file in one go and then distribute in memory by level?

twsearle · 2024-05-03T13:36:41Z

Closing this for now as we couldn't demonstrate any impact.

Read volume vars from file

f6e3bc7

twsearle self-assigned this Mar 18, 2024

twsearle requested a review from s-good March 20, 2024 10:34

Merge branch 'develop' into feature/reduce-read-io-for-volume-vars

c3e0f1b

twsearle closed this May 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read volume vars from file #83

Read volume vars from file #83

twsearle commented Mar 18, 2024 •

edited

Loading

twsearle commented Mar 20, 2024

s-good commented Mar 20, 2024

twsearle commented May 3, 2024

Read volume vars from file #83

Read volume vars from file #83

Conversation

twsearle commented Mar 18, 2024 • edited Loading

Description

Dependencies

Impact

Checklist

twsearle commented Mar 20, 2024

s-good commented Mar 20, 2024

twsearle commented May 3, 2024

twsearle commented Mar 18, 2024 •

edited

Loading