Add QAQC auto reporting tool #184

bemcdonnell · 2018-05-27T12:13:08Z

As we work toward making releases, I believe it would be valuable to have a tool that automatically runs the benchmarks and plots/docs the results for the new version. It doesn’t need to be exhaustive for all objects / results - maybe the top differences.

What this tool should use:

PyLaTex

Python / Matplotlib
LaTeX

What we should show:

scatter plots of totalized and average values for all object / results.
time series plots for object results (top X number of differences)
numerical fitness between benchmark and new release (some examples: Nash-Sutcliffe, Logaritmic Nash-Sutcliffe, Percent Bias, Kling-Gupta).

dickinsonre · 2018-05-27T12:20:22Z

@bemcdonnell Good ideas, it might be also good to have the min and max of all elements. It also needs better example files, the bulk of the sample files are from Extran3 and SWMM4 converted to SWMM5. The sample files, IMHO, should have more than a dozen elements. It would be nice to have sample file with everything possible in SWMM5 as a hundred nodes, links and Subcatchments so that it at least looks like a decent sample model.

bemcdonnell · 2018-05-27T12:29:53Z

@dickinsonre, for previous "official" releases of SWMM, how many benchmark models were used? And what was the process for cherry-picking results to present in the SWMM users manual?

dickinsonre · 2018-05-27T12:47:42Z

@bemcdonnell We did have about 700 test models for SWMM 5. You can see most of them on my Box account (I just made you and editor). However, most of the files on Box suffer from the same problems as these test files (the user example files do look like real networks as they came from real networks).

Cherry picking, i don't know how Lew picked them but they seem to be most of the Extran3/4 files along with some PCSWMM files. I used the PCSWMM files for SWMM4 testing back in the day.

I just checked and we about 200 test models for SWMM4, 130 hydraulics, 70 Hydrology and Water Quality. SWMM4 also had a nice Graph Block that compared the statistics of the model simulation for selected flows and depths to a user data file. Lew added may more calibration graphs but we lost the nice Statistical comparison possible in SWMM4. It is a good idea to use Python to add that back to the software for testing purposes.

Example 1: pass
Example 2: pass
Example 3: pass
Example 4: pass
Example 5: pass
Example 6: pass
extran 1: pass
extran 10: pass
extran 2: pass
extran 3: pass
extran 4: pass
extran 5: pass
extran 6: pass
extran 7: pass
extran 8a: pass
extran 8b: pass
extran 9: pass
test 1: pass
test 2: pass
test 3: pass
test 4: pass
test 5: pass
user 1: pass
user 2: pass
user 3: pass
user 4: pass
user 5: pass

dickinsonre · 2018-05-31T08:52:00Z

Hello @bemcdonnell just to follow up on your cherry picking comment. I went to the QA report on the EPA Download site and converted it to a DOC file. The testing Lew Rossman did in the really nice STORM WATER MANAGEMENT MODEL QUALITY ASSURANCE REPORT: Dynamic Wave Flow Routing
involved three sets of data (1) the ten data sets from Extran3 and SWMM4, (2) SWMM5 test data sets (Challenge Test Cases) and (3) User Test Cases (for the most part these come from CDM Cini). I will also mention that all of these cases are included on this GITHUB as part of the NRTEST suite. Does this answer your question about Cherry Picking?

Routing Models 6
Testing Procedure 13
Extran Manual Test Cases 15
Challenge Test Cases 53
User-Supplied Test Cases 72
Summary and Conclusions 110
References 111
Appendix A. SWMM 4 Routing Models 112
Appendix B. Test Data Sets 115

bemcdonnell · 2018-05-31T10:43:49Z

@dickinsonre, excellent info! Thanks for diving into that!

dickinsonre · 2018-05-31T10:47:27Z

@bemcdonnell My pleasure, however, it looks like a lot of work done by Lew to make that document. It will be nice to have a complete automated testing with stats for every node, link, lid, gw, subcatchment etc etc. that can be saved as log files for future backwards and forwards testing.

michaeltryby · 2018-05-31T12:40:07Z

@dickensonre cool idea!

michaeltryby · 2018-05-31T12:55:01Z

What if we develop this then morphed it into a feature. Like a calibration report that SWMM users could run as part of their QA. That would save Engineers tons of grunt work. Perhaps help them to be less fixated on changing model results.

dickinsonre · 2018-06-05T21:09:40Z

Link for Nash-Sutcliffe, Logarithmic Nash-Sutcliffe
https://www.adv-geosci.net/5/89/2005/adgeo-5-89-2005.pdf
Link for Kling-Gupta
http://www.meteo.mcgill.ca/~huardda/articles/gupta09.pdf

dickinsonre · 2018-07-09T12:27:08Z

I have a conceptual question @bemcdonnell and @michaeltryby say you had a NS value for each node, link and subcatchment. If you have a large model this is still many numbers to view and absorb. How can one come up with one overall index of model comparison? A mean of the NS values even though the node depths and link flows while related are from two different sections of the code? Report the mean node, link and subcatchment NS value?

bemcdonnell · 2018-07-09T13:36:45Z

@dickinsonre, could always report the top offenders above some threshold in a table as well as MEAN and STD?

dickinsonre · 2018-08-18T20:31:10Z

A few other references on KGE
https://www.rforge.net/doc/packages/hydroGOF/KGE.html
http://www.smhi.net/hype/wiki/doku.php?id=start:hype_file_reference:info.txt:criteria_equations

dickinsonre · 2018-10-22T05:15:10Z

Okay, for some reason your @bemcdonnell cherry picking comment got stuck in my head! We have implemented comparison tests for all nodes, links and subcatchments. I will update my dickinsonre GitHub soon after I learn more about GIT. My son was complaining today I am using Git wrong. Thanks for your patience in this matter.

bemcdonnell added this to the v5.2.0.dev3 milestone May 27, 2018

bemcdonnell added type:tech-debt type:task Easy Easy Issue labels May 27, 2018

bemcdonnell modified the milestones: v5.2.0.dev5, v5.3.0.dev0 Jul 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add QAQC auto reporting tool #184

Add QAQC auto reporting tool #184

bemcdonnell commented May 27, 2018 •

edited

Loading

dickinsonre commented May 27, 2018

bemcdonnell commented May 27, 2018

dickinsonre commented May 27, 2018 •

edited

Loading

dickinsonre commented May 31, 2018

bemcdonnell commented May 31, 2018

dickinsonre commented May 31, 2018

michaeltryby commented May 31, 2018

michaeltryby commented May 31, 2018

dickinsonre commented Jun 5, 2018 •

edited

Loading

dickinsonre commented Jul 9, 2018

bemcdonnell commented Jul 9, 2018

dickinsonre commented Aug 18, 2018

dickinsonre commented Oct 22, 2018

Add QAQC auto reporting tool #184

Add QAQC auto reporting tool #184

Comments

bemcdonnell commented May 27, 2018 • edited Loading

What this tool should use:

What we should show:

dickinsonre commented May 27, 2018

bemcdonnell commented May 27, 2018

dickinsonre commented May 27, 2018 • edited Loading

dickinsonre commented May 31, 2018

bemcdonnell commented May 31, 2018

dickinsonre commented May 31, 2018

michaeltryby commented May 31, 2018

michaeltryby commented May 31, 2018

dickinsonre commented Jun 5, 2018 • edited Loading

dickinsonre commented Jul 9, 2018

bemcdonnell commented Jul 9, 2018

dickinsonre commented Aug 18, 2018

dickinsonre commented Oct 22, 2018

bemcdonnell commented May 27, 2018 •

edited

Loading

dickinsonre commented May 27, 2018 •

edited

Loading

dickinsonre commented Jun 5, 2018 •

edited

Loading