Code Validator and Reference Validator enhancements #17

LakshmiDintakurty · 2017-08-02T20:33:20Z

Code Enhancements: Below are the list of code changes made for reference-ccda-validator (and code-validator). Please review the changes and let us know if you need any additional information and we can setup a meeting to discuss.

Async thread pool for code validation

VocabularyValidationService uses a pool of ValidateWorker objects to perform code validation in parallel.
Fixed Thread safety issues in Vocab validator
VTD – Object pool pre-compiled XPaths

a. VTD is an alternate high performance XML Parser.
b. The auto pilot package (org.sitenv.vocabularies.validation.pool) encapsulates a pool of precompiled XPaths based on ccdaReferenceValidatorConfig.xml file
c. The pool configuration is specified in CodeValidator.properties.
Switched from HSQL to H2 DB

a. Configurable DatabaseConnection pool in CodeValidator.properties.
b. Replaced sql.Connection with sql.DataSource.
c. All the loader classes now use DataSource to load the valuesets/codesets from files to H2 DB and at the time of loading from H2 into HashSets.
Vocab lookup using Java HashSets/HashMaps instead of JPA queries

a. Reimplemented the Repository classes to respective DAO classes (CodeSystemCodeDAO and ValueSetDAO)
b. References to the CodeRepository and VsacValueSetRepository in CodeSystemCodeValidator & ValueSetCodeValidator are replaced with respective DAOs (CodeSystemCodeDAO and ValueSetDAO)
DB Cleanup after loading codes and valuesets to HashSets

a. In order to reduce the memory footprint, after loading all the codes and valuesets to HashSets, all the data is deleted from H2DB.
b. VocabularyLoadRunner performs the cleanup based on the flag ‘cleanUpDatabaseAfterLoadingHashSets’ set in CodeValidator.properties.
c. Additional comments provided on the side effects/conditions w.r.t to setting the flag to true vs false in VocabularyLoadRunner class in the finally block of afterPropertiesSet().
ReferenceCCDAValidationService : Added new overloaded service methods to handle document validation

Ex., support optional REST request parameter SeverityLevel (allowable values: error, warning, info). if SeverityLevel=”error”, don’t evaluate “warning” and “info” Vocab Conformance Rules
Added new functionality to support R1.1 CCDA doc Vocabulary Validation (VocabularyValidationService)
Ability to upload a compressed file for validation in addition to plain xml file to improve network latency

Added functionality to support .zip format in addition to .xml file format in ReferenceCCDAValidationService. Useful when CCDA documents are large in size.
Perform MDHT & Vocab Validation of a document regardless whether the doc has schemaError or not, to better handle MU3 CERT Validator negative test cases
Support 100% accurate source CCDA file Line Numbers for each of the Vocabulary Validation error/warning/info results.
For supportability, added the Vocabulary Conformance Rule ID feature

a. A unique Vocabulary Conformance Rule ID is defined for each configured in ccdaReferenceValidatorConfig.xml. Such as < validator id="1" >
b. In each Vocabulary Validation error/warning/info result, output the conformance Rule ID it's violating. Such as "ruleID": "140"

Improved validationObjective defaulting logic for both R1.1 and R2.1 documents to use CCDATypes.NON_SPECIFIC_CCDAR2 so there's no need to parse ccdaFile up front.

…th) to CDAValidationController

Plow74 · 2017-08-08T13:23:34Z

Thanks! We will review the changes. Async is a welcomed improvement.

drbgfc · 2017-08-14T14:41:19Z

Hi Haiwen and the rest of Cerner. Sounds like a lot of great work as we had discussed on some of the MDHT calls. I am especially interested (as I suspect all would be) in the performance improvements. Good idea to submit a PR. I haven't had time to review specifically, but a note that this, "Perform MDHT & Vocab Validation of a document regardless whether the doc has schemaError or not, to better handle MU3 CERT Validator negative test cases" might be an issue. Not performing validation when there is/are schema error(s) was an ONC requirement so it's a feature not a bug. If you could expand on the reasoning I could see if they would be interested in the change.

drbgfc · 2017-12-15T18:52:56Z

Hi, what is the status on this? Last we had spoken you were going to create a new PR without conflicts and I think which defaulted to not run vocab validation if there were schema errors (via an implemented switchable config). I would suggest that this happen directly after a release. Releases happen on the last Monday of every month with the exception of Decemeber. So, the next release is Jan 22nd 2018. So a great time for the PR would be directly after that. However, I don't expect many changes in addition to what is in the repo already, so, submitting mid development phase could be fine too. Especially since the main developers, including me, will be off until after the New Year. Let me know what you think, no rush. Thanks, Dan.

LakshmiDintakurty added 3 commits July 28, 2017 11:53

Cerner's initial performance changes and other tweaks.

43e606a

Improved validationObjective defaulting logic for both R1.1 and R2.1 documents to use CCDATypes.NON_SPECIFIC_CCDAR2 so there's no need to parse ccdaFile up front.

Code Cleanup - remove commented lines of code

037c641

Moving the method doValidation_NoOAuth (/validateDocumentByFile_NoOAu…

2394184

…th) to CDAValidationController

onc-healthit locked and limited conversation to collaborators Aug 8, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code Validator and Reference Validator enhancements #17

Code Validator and Reference Validator enhancements #17

LakshmiDintakurty commented Aug 2, 2017

Plow74 commented Aug 8, 2017

drbgfc commented Aug 14, 2017

drbgfc commented Dec 15, 2017

Code Validator and Reference Validator enhancements #17

Are you sure you want to change the base?

Code Validator and Reference Validator enhancements #17

Conversation

LakshmiDintakurty commented Aug 2, 2017

Plow74 commented Aug 8, 2017

drbgfc commented Aug 14, 2017

drbgfc commented Dec 15, 2017