Improvement to Samudra v2: Larger effective resolutions via increased dilation schedule#753
Improvement to Samudra v2: Larger effective resolutions via increased dilation schedule#753alxmrs wants to merge 1 commit into
Conversation
…ts to dilations schedule.
|
Thanks, @alxmrs. My first impression is that this is a pretty mixed bag for 1-degree and I don't understand why we'd want a dilation of 1/2 degree results do look better, but the coarsest half-degree level is still 22x45 which (with a 33x33 receptive field) still gets mostly vertical padding. Basically, I would feel better about one of:
What do you think? |
|
I like your analysis, Jesse. Yeah, I'd be comfortable to land this change just for 1/2 and 1/4 degree only. Maybe, it'd be in our interest to tune the last hparam to be specific to each resolution. My Claude did warn me of the vertical degree padding vs receptive field issue, but it passed it off with the idea that the NN would be able to learn to deal with these boundary conditions.
I'd like to double down on this point: My plan for landing this will be to pause my experiments until I can rigorously evaluate this/these hparam changes with @YuanYuan98's new evaluation system. I see this change as serving as a useful test case to see how well we can make the metrics system automated.
I also agree with this push back in principle. I was experimenting with a really wide breadth of levers in my initial investigation, and I could see this branch becoming a narrower search just on the optimal dilation hparams per each resolution. I mentioned this somewhere in chat, but one experiment I am especially curious about is to see if we could tune the dilations to mimic GraphCasts' hierarchical icosahedral grid (in receptive field coverage). In sum, thanks for your review, and I'm happy to postpone landing this change until later. If you'd like this PR to be merged sooner, I could make a narrower contribution to make this specific to 1/2 degree samudra. |
|
OK, thanks for the detailed response. I think I'm in favor of using this as a test for our new metrics so let's pause for now and we can move forward once we have those results. |
This hyper-parameter config improves rollouts in the deep ocean with some regression to upper level depths.
Full experiment results
_These are ranked by Niño R² _.
(Full one degree viz comparison in this folder)
I have not yet run this at quarter degree resolution, but I can. Happy to add more results / collect more data to see if this is something we want to merge! Full experiment configurations can be found in #727. This was the most effective result, so we'll only merge this. My goal is to land the learnings from these experiments, which I am now beyond my time box.