Skip to content

Commit c8e5155

Browse files
committed
PropertyComparisonProcessor: use deduplicated values for determining deviations and omissions to avoid duplicated reporting
1 parent 0addb4c commit c8e5155

File tree

2 files changed

+9
-6
lines changed

2 files changed

+9
-6
lines changed

CHANGELOG.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,10 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
66
## [Unreleased]
77

88
### Fixed
9-
* fix `PopulationComparisonProcessor`: fix measurement of **Absolute Coveredness** to not increase in case of a duplicated resource in one dataset without a corresponding resource in any other dataset
9+
* fix `PopulationComparisonProcessor`: fix measurement of **Absolute Coveredness** to not increase in case of a duplicated resource in one dataset without a corresponding resource in any other dataset
10+
11+
### Changed
12+
* changed `PropertyComparisonProcessor`: use deduplicated values for determining deviations and omissions to avoid duplicated reporting
1013

1114
## [3.1.1] - 2024-11-13
1215

abecto-core/src/main/java/de/uni_jena/cs/fusion/abecto/processor/PropertyComparisonProcessor.java

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -380,16 +380,16 @@ protected void reportDeviationsAndOmissions() {
380380

381381
protected void reportDeviationsAndOmissionsForDatasetPair(ResourcePair datasetPair) {
382382
for (String variable : variables) {
383-
Map<Resource, Map<RDFNode, Set<Resource>>> resourcesByNonDistinctValueByDataset = resourcesByNonDistinctValueByDatasetByVariable.get(variable);
383+
Map<Resource, Map<RDFNode, Set<Resource>>> resourcesByDistinctValueByDataset = resourcesByDistinctValueByDatasetByVariable.get(variable);
384384
if (theAspect.variableCoveredByDatasets(variable, datasetPair.first, datasetPair.second)) {
385-
Map<RDFNode, Set<Resource>> resourceByNonDistinctValuesOfFirstDataset = resourcesByNonDistinctValueByDataset.get(datasetPair.first);
386-
Map<RDFNode, Set<Resource>> resourceByNonDistinctValuesOfSecondDataset = resourcesByNonDistinctValueByDataset.get(datasetPair.second);
385+
Map<RDFNode, Set<Resource>> resourceByDistinctValuesOfFirstDataset = resourcesByDistinctValueByDataset.get(datasetPair.first);
386+
Map<RDFNode, Set<Resource>> resourceByDistinctValuesOfSecondDataset = resourcesByDistinctValueByDataset.get(datasetPair.second);
387387
for (Resource firstResource : correspondingResourcesByDataset.get(datasetPair.first)) {
388388
for (Resource secondResource : correspondingResourcesByDataset.get(datasetPair.second)) {
389389
Set<RDFNode> uncoveredValuesOfFirstResource =
390-
getUncoveredValuesOfResource(firstResource, secondResource, resourceByNonDistinctValuesOfFirstDataset, resourceByNonDistinctValuesOfSecondDataset);
390+
getUncoveredValuesOfResource(firstResource, secondResource, resourceByDistinctValuesOfFirstDataset, resourceByDistinctValuesOfSecondDataset);
391391
Set<RDFNode> uncoveredValuesOfSecondResource =
392-
getUncoveredValuesOfResource(secondResource, firstResource, resourceByNonDistinctValuesOfSecondDataset, resourceByNonDistinctValuesOfFirstDataset);
392+
getUncoveredValuesOfResource(secondResource, firstResource, resourceByDistinctValuesOfSecondDataset, resourceByDistinctValuesOfFirstDataset);
393393

394394
// deviation: a pair of resources with each having a value not present in the
395395
// other resource

0 commit comments

Comments
 (0)