`PMD` Comments Removal #889

Malmahrouqi3 · 2025-06-16T21:11:51Z

Description

This is subsequent to (#882) with the mere addition of filtering out inline comments and commented lines. For the PMD details, check out the original issue (#646)

To show off the difference:
Filter-off: pmd-old branch https://github.com/Malmahrouqi3/MFC-mo2/actions/runs/15693861383/job/44214858621
Filter-on: pmd-new branch https://github.com/Malmahrouqi3/MFC-mo2/actions/runs/15694477828/job/44216673771

.github/workflows/pmd.yml

sbryngelson · 2025-06-16T22:13:14Z

.github/workflows/pmd.yml

+                  else
+                      # Overwrite the original file with the processed content
+                      mv "$TMP_FILE" "$file"
+                      echo -e "Successfully processed $file"


Malmahrouqi3 · 2025-06-16T22:15:22Z

@sbryngelson the number of violations is identical (63) whether the filter is on or off.

sbryngelson · 2025-06-16T22:20:27Z

strange

sbryngelson · 2025-06-16T23:00:41Z

idk about the number of violations, but the violations are different. This looks like an improvement to me.

sbryngelson · 2025-06-16T23:04:28Z

.github/workflows/pmd.yml

+                      -e '/^[ \t]*[cC*dD]/s/.*//' \
+                      -e 's/([^"'\'']*("[^"]*"[^"'\'']*|'\''[^'\'']*'\''[^"'\'']*)*[^"'\'']*)[!].*$/\1/' \
+                      "$file" > "$TMP_FILE"
+


also remove line breaks: https://chatgpt.com/s/t_6850a2d138c0819187272dc044cfb208

sed -E ' :a # ① label for looping /&$/ { # ② if the *current* line ends with “&” N # read the *next* line into the pattern space s/&[[:space:]]*\n[[:space:]]*&/ / # ③ delete the trailing “&”, # the following newline, the # leading “&” on the next line, # and join them with one space ba # ④ repeat until the next line no longer starts # with “&” (handles runs of many “&” lines) } s/&//g # ⑤ finally strip any stray “&” left in the file ' input.txt > output.txt

(i haven't checked to see if this works myself). PS this will mess up the line numbers but whatever.

this might break for openacc line continuations which have something like

!$acc <......> & !$acc <......>

idk what it will do to code that looks like this.

It deems them as comments as far as I know

supposedly this will work for the acc statements but the first command might break acc w/ line continuation. i would run the acc one first, then the one above

sed -E ' :a /[[:space:]]*&[[:space:]]*$/ { N; s/[[:space:]]*&[[:space:]]*\n[[:space:]]*/ /; ba } s/^[[:space:]]*!\$acc[[:space:]]*/!$acc / :b s/[[:space:]]+!\$acc[[:space:]]*/ /g tb ' source.f90 > clean.f90

It deems them as comments as far as I know

it has no notion of comments, it just looks at text. i'm trying to make the text as simple as possible, and having line continuations in different places is not a meaningful code difference so we should remove them before CPD

There were some syntax errors and I got rid off em. Also I lowered the number of tokens=20 kinda semi-helpful cuz at 40 we only see big blocks mostly that are barely affected by comments. Further, I added these flags to avoid dummy errors/violations. --no-fail-on-violation \--no-fail-on-error so the test will fail iff pmd.yml itself is messed up - syntax wise.

Current implementation

sed -E ' :a /&$/ { N s/&[[:space:]]*\n[[:space:]]*&/ / ba } s/&//g ' "$file" | \ sed -E \ -e '/^\s*!/s/.*//' \ -e '/^[cC*dD]/s/.*//' \ -e '/^[ \t]*[cC*dD]/s/.*//' \ -e 's/([^"'\'']*("[^"]*"[^"'\'']*|'\''[^'\'']*'\''[^"'\'']*)*[^"'\'']*)[!].*$/\1/' \ > "$TMP_FILE"

(i haven't checked to see if this works myself). PS this will mess up the line numbers but whatever.

You can just copy and paste the redundant lines and search them up. It ain't gonna be a wild hunting job.

sbryngelson · 2025-06-17T02:27:22Z

something isn't right with concatenating the line continuations because this is a repeated pattern in pmd:

                                q_sf(j, k, l) = q_sf(j, k, l) 
                                                + q_prim_vf0(mom_idx%beg)%sf(j, k, l)*fd_coeff_x(r, j)* 
                                                q_prim_vf0(mom_idx%beg)%sf(r + j, k, l) 
                                                + q_prim_vf0(mom_idx%beg + 1)%sf(j, k, l)*fd_coeff_y(r, k)* 
                                                q_prim_vf0(mom_idx%beg)%sf(j, r + k, l)

if they were concatted then this would be on one line

Malmahrouqi3 · 2025-06-17T22:08:35Z

I guess it should now presumably detect more duplicate lines if they are exactly the same operation but split out into lines differently.

q_sf(j, k, l) = q_sf(j, k, l)+q_prim_vf0(mom_idx%beg)%sf(j, k, l)*fd_coeff_x(r, j)*q_prim_vf0(mom_idx%beg)%sf(r+j, k, l)+q_prim_vf0(mom_idx%beg+1)%sf(j, k, l)*fd_coeff_y(r, k)*q_prim_vf0(mom_idx%beg)%sf(j, r+k, l)+q_prim_vf0(mom_idx%end)%sf(j, k, l)*fd_coeff_z(r, l)*q_prim_vf0(mom_idx%beg)%sf(j, k, r+l)/y_cc(k)

sbryngelson · 2025-06-17T22:19:44Z

yes that's what i was thinking. you could even strip spaces (maybe?)

sbryngelson · 2025-06-17T22:19:57Z

this is actually a very valuable tool!

Malmahrouqi3 · 2025-06-17T22:47:34Z

I left behind = .or. .and. and few subtle things.
Other than that, all spaces should be taken off around math/comparison operators, inside indexing parentheses and brackets.

sbryngelson · 2025-06-17T22:52:14Z

I left behind = .or. .and. and few subtle things. Other than that, all spaces should be taken off around math/comparison operators, inside indexing parentheses and brackets.

yes agreed. so you already did this or not yet?

Malmahrouqi3 · 2025-06-17T22:54:01Z

yup, you can check out the last commit PMD check.

Malmahrouqi3 · 2025-06-17T22:55:11Z

Filter Full Implementation

                  sed -E '
                    # First handle & continuation style (modern Fortran)
                    :ampersand_loop
                    /&[[:space:]]*$/ {
                      N
                      s/&[[:space:]]*\n[[:space:]]*(&)?/ /g
                      tampersand_loop
                    }

                    # Handle fixed-form continuation (column 6 indicator)
                    :fixed_form_loop
                    /^[[:space:]]{0,5}[^[:space:]!&]/ {
                      N
                      s/\n[[:space:]]{5}[^[:space:]]/ /g
                      tfixed_form_loop
                    }

                    # Remove any remaining continuation markers
                    s/&//g

                    # Normalize spacing - replace multiple spaces with single space
                    s/[[:space:]]{2,}/ /g

                    # Remove spaces around mathematical operators
                    s/[[:space:]]*\*[[:space:]]*/*/g
                    s/[[:space:]]*\+[[:space:]]*/+/g
                    s/[[:space:]]*-[[:space:]]*/-/g
                    s/[[:space:]]*\/[[:space:]]*/\//g
                    s/[[:space:]]*\*\*[[:space:]]*/\*\*/g

                    # Remove spaces in common Fortran constructs (array indexing, function calls)
                    s/\([[:space:]]*([^,)[:space:]]+)[[:space:]]*,/(\1,/g      # First argument
                    s/,[[:space:]]*([^,)[:space:]]+)[[:space:]]*,/,\1,/g       # Middle arguments
                    s/,[[:space:]]*([^,)[:space:]]+)[[:space:]]*\)/,\1)/g      # Last argument
                    s/\([[:space:]]*([^,)[:space:]]+)[[:space:]]*\)/(\1)/g     # Single argument

                    # Remove spaces around brackets and parentheses
                    s/\[[[:space:]]*/</g
                    s/\[[[:space:]]*/>/g
                    s/\[[[:space:]]*/</g
                    s/[[:space:]]*\]/]/g
                    s/\([[:space:]]*/(/g
                    s/[[:space:]]*\)/)/g

                    # Remove spaces around comparison operators
                    s/[[:space:]]*<=[[:space:]]*/</g
                    s/[[:space:]]*>=[[:space:]]*/>/g
                    s/[[:space:]]*<[[:space:]]*/</g
                    s/[[:space:]]*>[[:space:]]*/>/g
                    s/[[:space:]]*==[[:space:]]*/==/g

                    # Remove full-line comments
                    /^\s*!/d
                    /^[cC*dD]/d
                    /^[ \t]*[cC*dD]/d

                    # Remove end-of-line comments, preserving quoted strings
                    s/([^"'\''\\]*("[^"]*")?('\''[^'\'']*'\''?)?[^"'\''\\]*)[!].*$/\1/
                  ' "$file" > "$TMP_FILE"

Malmahrouqi3 and others added 23 commits June 12, 2025 16:57

integrated pmd into CI (MFlowCode#646)

de3a040

create rulset file

a1fe811

corrected directory

8defa9a

changed ruleset pattern typo

1fde2bc

added rules to python and fortran

0332cf1

ruleset for py

9f46e71

individual rules

8c3fb08

java rules - errorprone

cd8e2a5

java rules

4db4277

old school integration of PMD into workflow

d6d3bc8

removed Detect File Changes

54a6fc9

changed to cat to display reports

515c32a

added java compiler as dependency

4f4134a

removed something

b3ae8fa

just checking syntax

085eaa5

set env var pmd=/pmd/bin/pmd

e3626f6

quick syntax correction

f022a85

made PMD_COMMAND globally recognized

4fdb0c3

corrected package path

c7c1bdb

moved alias command under Running PMD

393d69b

comments removal

7448609

comments removal 2

36fb29e

comments removal 3

76352dd

Malmahrouqi3 requested a review from sbryngelson as a code owner June 16, 2025 21:11

Malmahrouqi3 force-pushed the CI-pmd branch from 9e425be to 76352dd Compare June 16, 2025 21:52

sbryngelson reviewed Jun 16, 2025

View reviewed changes

.github/workflows/pmd.yml Show resolved Hide resolved

sbryngelson reviewed Jun 16, 2025

View reviewed changes

Malmahrouqi3 force-pushed the CI-pmd branch from 4ca8da1 to d355280 Compare June 16, 2025 22:20

reduced number of tokens

0aa7abe

Malmahrouqi3 force-pushed the CI-pmd branch from d355280 to 0aa7abe Compare June 16, 2025 22:49

Merge branch 'master' into CI-pmd

d47b810

sbryngelson reviewed Jun 16, 2025

View reviewed changes

Malmahrouqi3 added 6 commits June 16, 2025 19:17

Update pmd.yml

52aeefc

Update pmd.yml

037c2e6

Update pmd.yml

26f3228

Update pmd.yml

9605a29

Update pmd.yml

3c01cfa

Update pmd.yml

d2fff04

Malmahrouqi3 and others added 10 commits June 17, 2025 15:40

Update pmd.yml

3bd04f3

Update pmd.yml

d4b77fb

Update pmd.yml

502ab2c

Merge branch 'master' into CI-pmd

23d44d3

Update pmd.yml

31b873c

Update pmd.yml

8cf02c0

Update pmd.yml

1008246

Update pmd.yml

cd8f908

more cleanup

ae4cbf5

tokens=20

27c9932

strip out majority of spaces

3c0682d

PMD Comments Removal #889

Are you sure you want to change the base?

PMD Comments Removal #889

Conversation

Malmahrouqi3 commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Uh oh!

sbryngelson Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

Malmahrouqi3 commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sbryngelson commented Jun 16, 2025

Uh oh!

sbryngelson commented Jun 16, 2025

Uh oh!

sbryngelson Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sbryngelson Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

Malmahrouqi3 Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

sbryngelson Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

sbryngelson Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

Malmahrouqi3 Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Malmahrouqi3 Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sbryngelson commented Jun 17, 2025

Uh oh!

Malmahrouqi3 commented Jun 17, 2025

Uh oh!

sbryngelson commented Jun 17, 2025

Uh oh!

sbryngelson commented Jun 17, 2025

Uh oh!

Malmahrouqi3 commented Jun 17, 2025

Uh oh!

sbryngelson commented Jun 17, 2025

Uh oh!

Malmahrouqi3 commented Jun 17, 2025

Uh oh!

Malmahrouqi3 commented Jun 17, 2025

Uh oh!

Uh oh!

`PMD` Comments Removal #889

`PMD` Comments Removal #889

Malmahrouqi3 commented Jun 16, 2025 •

edited

Loading

Malmahrouqi3 commented Jun 16, 2025 •

edited

Loading

sbryngelson Jun 16, 2025 •

edited

Loading

Malmahrouqi3 Jun 16, 2025 •

edited

Loading

Malmahrouqi3 Jun 16, 2025 •

edited

Loading