-
Notifications
You must be signed in to change notification settings - Fork 64
For discussion: initial configuration of Codee Fortran formatter with examples (src/*.F90)
#707
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
| ColumnLimit: 120 | ||
| CommentDirectivePrefixes: [] | ||
| DisabledDirectivePrefixes: [] | ||
| IndentSize: 2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
To be discussed
NEPTUNE and UFS use 2 spaces for indentation. The CCPP framework code in src uses 3 at the moment. This particular change is responsible for most of the differences in this PR.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We currently have issues with lines that are too long so while this will not help a huge amount but it's at least in the correct direction.
| FirstLineFit: FitIfPossible | ||
| BreakBeforeBinaryOperators: true | ||
| Casing: | ||
| Identifiers: Lowercase # Preserve |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
NEPTUNE uses Lowercase, "Preserve" is the codee default
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Six of one, half of the other in the UFS.
My OCD is suggesting that we should adopt a convention and apply across UFS, but "Preserve" is fine for now
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm fine with whatever so perhaps lowercase is best to appease the NEPTUNE?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lowercase is nice, but whatever works for SIMA and the UFS will do. Within NEPTUNE, we had to make an exception for a nuopc subdirecty, because esmf/nuopc uses CamelCase extensively, and the notorious long names in esmf code become unreadable unless you add underscores. Long story short, no strong preference here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this just for the Fortran bits of the framework? If so, we can just make sure it is readable in lowercase.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, only Fortran.
| FixedFormLabelAlignment: Right | ||
| ContinuationIndentSize: DoubleIndentSize | ||
| DoubleColonSeparator: AddAlways | ||
| EndOfLineNormalization: Unix # Autodetect |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unix is NEPTUNE, Autodetect is Codee default
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I looked at the documentation and don't fully understand this one?
(Maybe you or someone could explain (slowly) to me at out meeting tomorrow?)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I need to look this up, but as far as we all are concerned, Unix should be the correct choice.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't actually know if I've ever seen a case that mixes line endings in one file (as described in the codee documentation). If you use "auto-detect", it will use the first line ending across the entire file. We should definitely use "Unix" here so that all line endings, including Windows line endings for entire files/parts of files are converted to Unix.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unix please!
| LeftParenthesisKeyword: OnlyLeading | ||
| RightParenthesisExpression: NoLeading | ||
| RightParenthesisGeneric: NoLeading | ||
| RightParenthesisKeyword: OnlyTrailing # NoLeading |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OnlyTrailing is NEPTUNE, NoLeading is Codee default
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As expected, UFS is all over the place wrt "SpacesAroundOperators".
(I'm super impressed that NEPTUNE is following a standard!)
I do love the idea of adopting this (more granular?) level of code formatting.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No strong feeling but squeezing spaces makes lines shorter.
| "src/ccpp_types.F90:free" | ||
| ) | ||
|
|
||
| for entry in "${files[@]}"; do |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is overly complicated and not really needed for ccpp-framework. Just for initial testing/demonstration purposes. This file will be either before this PR is merged, or at the latest once codee is integrated in GitHub actions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is the idea that we will pass any file modified in the current PR through the formatter as part of the PR?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My suggestion is to format all existing Fortran files in the repository as part of the PR. In a next step, implement GitHub actions. In a third step, integrate with capgen and make sure that the auto-generated code also complies with the formatting rules.
| # requested 2025/12/08 using the Codee Online Form | ||
| IndentExceptions: | ||
| ModuleContains: IndentBeforeAndAfter | ||
| Comments: IndentIfAlreadyIndented # Indent |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
NEPTUNE is "Indent", here I am using "IndentIfAlreadyIndented"
Also note the comment above for ModuleContains exceptions and the missing support for TypeContains/FunctionContains etc exceptions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It's because of the CCPP metadata hooks in the Fortran code that are typically not indented. I think capgen's parser is fine if they get indented, though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm for consistency so I prefer 'Indent'.
dustinswales
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@climbfuji Thanks for incorporating the code formatter here!
I love the idea, we just need to decide on a few details and incorporate this into our workflows.
As a github action, running the formatter could be invoked either diagnostically (report format violations) or prognostically (fix format violations).
| FirstLineFit: FitIfPossible | ||
| BreakBeforeBinaryOperators: true | ||
| Casing: | ||
| Identifiers: Lowercase # Preserve |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Six of one, half of the other in the UFS.
My OCD is suggesting that we should adopt a convention and apply across UFS, but "Preserve" is fine for now
| # requested 2025/12/08 using the Codee Online Form | ||
| IndentExceptions: | ||
| ModuleContains: IndentBeforeAndAfter | ||
| Comments: IndentIfAlreadyIndented # Indent |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| FixedFormLabelAlignment: Right | ||
| ContinuationIndentSize: DoubleIndentSize | ||
| DoubleColonSeparator: AddAlways | ||
| EndOfLineNormalization: Unix # Autodetect |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I looked at the documentation and don't fully understand this one?
(Maybe you or someone could explain (slowly) to me at out meeting tomorrow?)
| LeftParenthesisKeyword: OnlyLeading | ||
| RightParenthesisExpression: NoLeading | ||
| RightParenthesisGeneric: NoLeading | ||
| RightParenthesisKeyword: OnlyTrailing # NoLeading |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As expected, UFS is all over the place wrt "SpacesAroundOperators".
(I'm super impressed that NEPTUNE is following a standard!)
I do love the idea of adopting this (more granular?) level of code formatting.
| #!/usr/bin/env bash | ||
|
|
||
| files=( | ||
| "src/ccpp_constituent_prop_mod.F90:free" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@climbfuji Is the "free" suffix here to use "free formatted" version of Codee?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, I had some logic in that script previously that would use this, it's no longer there because not needed.
Since the script won't be merged, I didn't bother taking this out.
| "src/ccpp_types.F90:free" | ||
| ) | ||
|
|
||
| for entry in "${files[@]}"; do |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is the idea that we will pass any file modified in the current PR through the formatter as part of the PR?
| FirstLineFit: FitIfPossible | ||
| BreakBeforeBinaryOperators: true | ||
| Casing: | ||
| Identifiers: Lowercase # Preserve |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm fine with whatever so perhaps lowercase is best to appease the NEPTUNE?
| Comma: OnlyTrailing | ||
| Concat: Both | ||
| DoubleColon: Both |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we remove the enforcement of Comma and DoubleColon white spacing?
We do this kind of thing (lining up intents, function arguments, etc) in SIMA quite often (which we prefer for ease of readability):
subroutine whatever(argument1, arg2, a3)
integer, intent(in) :: argument1
real(kind_phys), intent(in) :: arg2
integer, intent(out) :: a3
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is a very good reason to not line up intents. If the longest line changes, then every single line has to change, even though there are no code changes. This increases the possibility of merge conflicts and make changes/pull requests unnecessarily longer. This is the reason why the code default is single whitespace, and why we've adopted it in NEPTUNE.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@dustinswales @gold2718 do you have a preference on this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
While I am sympathetic to the argument that the one-space rule minimizes potential merge conflicts and PR simplicity (one of the goals of these tools), at the end of the day, the code exists for its human users and they need to be able to read it to understand it. I know at NCAR, the users (scientists and RSEs) strongly preferred lining up the colons to make it easy to pick out the variables. Who is the customer here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We talked about this today in the meeting; we agreed that aligning the colons doesn't need to be done for the auto-generated code, it's enough to do this for the few handwritten Fortran files in the repository. Thus, we just "preserve" the whitespaces on a per-file basis.
And importantly, host models can do whatever they want for the rest of their code ...


Initial configuration of Codee Fortran formatter with examples (
src/*.F90) - for discussionThis PR adds an initial
.codee-formatconfiguration file for the free Codee Fortran formatter and a temporary scriptrun_codee_tmp.shto get started with the tool (note that the day-to-day usage will be much simpler than in the script). The PR also demonstrates the effect of the formatting rules for the four Fortran source files insrc/.The goal of this PR is to provide the necessary information to decide on the formatting rules we want to use. Once we have agreed on the format, we can update the
src/*.F90files as needed and merge this PR. The integration with GitHub actions and the simplified usage on the command line will come after that. In a 3rd PR, we will work on a tighter integration of the codee format config with the capgen Fortran write to make sure that the auto-generated code also complies with the formatting rules.The majority of the diffs are because NEPTUNE (and UFS) use two whitespaces as indentation, whereas the files in
src/currently use 3 whitespaces. We can change the codee config, but every space we save makes the lines shorter. To see the remaining differences, please select "Hide whitespaces" when looking at the files changed.User interface changes?: No
Working toward #703
Testing: No changes to the tests - GitHub actions CI tests passed
test removed:
unit tests:
system tests:
manual testing: