Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add enumeration for component_derivation_type #10

Open
kaiiam opened this issue Jun 10, 2024 · 4 comments
Open

Add enumeration for component_derivation_type #10

kaiiam opened this issue Jun 10, 2024 · 4 comments
Labels
new content request Adding or modifying content of MIFC requires discussion

Comments

@kaiiam
Copy link
Owner

kaiiam commented Jun 10, 2024

The attribute measured_compound_derivation_type (can be renamed if necessary) is likely to be an important part of the MIFC standard, denoting of a value is analytical, calculated, inferred/estimated, sourced from literature, labelled data, etc. As such I think it should be a required field.

See the FDC SR legacy csv export's food_nutrient_derivation.csv file with a longer list of possibilities (79). Some of those could probably be broken up with extra additional supporting fields to capture other metadata. For example we may want to have another source field (e.g. measured_compound_derivation_source) for if the data was from literature or external source to say where it was from doi or name of institution perhaps? Other metadata fields like specific retention factors uses/not used for concentration adjustments).

For the moment the enumeration could look something like the following:

analytical
calculated (break this down into from other foods and from a linear regression etc)
literature sourced (this could also just be analytical or another type which is sourced from elsewhere)
inferred/estimated (from ingredient list/recipe, foods physical composition, another/similar foods etc) 
label claim
assumed zero

Could also break it down to have a field from if the measured_compound value is original or if it's externally sourced. Not sure how that would fit with calculated/inferred values. Much more to be done here. Could use additional expertise.

@kaiiam
Copy link
Owner Author

kaiiam commented Jun 11, 2024

Note measured_compound_derivation_type is now renamed to component_derivation_type.

@kaiiam kaiiam changed the title Add enumeration for measured_compound_derivation_type Add enumeration for component_derivation_type Jun 20, 2024
@kaiiam kaiiam added the new content request Adding or modifying content of MIFC label Jun 26, 2024
@kaiiam
Copy link
Owner Author

kaiiam commented Jul 3, 2024

In order to accommodate food label data, component_derivation_type should probably be a required field and be used to differentiate analytical data from label claims, or other things like calculated values etc.

@kaiiam
Copy link
Owner Author

kaiiam commented Oct 2, 2024

@esteliotes Can you have a look at this and let me know if you have any thoughts on this?

@kaiiam
Copy link
Owner Author

kaiiam commented Oct 8, 2024

Example from the USDA MAFCL special interest iodine database, we encounter examples of borrowed data from another institution/organization.

This is maybe similar to literature sourced, perhaps this can be captured with an additional enum value here such as externally sourced data record. Or possibly this might require a new MIFC slot for compound_analytical_organization_name?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new content request Adding or modifying content of MIFC requires discussion
Projects
None yet
Development

No branches or pull requests

1 participant