Add food affordability data #1381

pabloarosado · 2023-07-25T13:20:34Z

Add steps for World Bank's Food prices for nutrition dataset.
Also, I added a small improvement of etl.harmonize: To include the region code as another alias.

…ordability dataset

paarriagadap

Looks good! I mostly added comments in the metadata to change the units of the monetary measures in constant international-$

etl/steps/data/meadow/wb/2023-07-24/food_prices_for_nutrition.py

paarriagadap · 2023-07-25T14:34:03Z

etl/steps/data/garden/wb/2023-07-24/food_prices_for_nutrition.countries.json

+  "NAC": "North America (WB)",
+  "SAS": "South Asia (WB)",
+  "SSF": "Sub-Saharan Africa (WB)",
+  "UMC": "Upper-middle-income countries",


The assignment of (WB) seems inconsistent between the WB income groups: Low-income countries has it, but not High income or Upper-middle or Lower-middle income

Good catch! By the way, I'm assuming that their income groups are consistent (and up-to-date) in all World Bank datasets (and hence consistent with OWID), let me know if that's not a good assumption.

I assume that as well, but it will depend on when that dataset was released. If it's sufficiently old it can have a previous definition of income groups. I see they change every year.

Now that you say it, it should be that groups are dynamically changing per year, so that would be consistent. I don't know that with 100% certainty.

If they assume a definition of income groups that changes over the years, then those income groups are inconsistent with ours. Because we assume the current definition of income groups. But for this dataset (with just a few years of data) I suppose it should not matter much.

Right, I wasn't sure about that. I would assume theirs is dynamic, but I don't know

etl/steps/data/garden/wb/2023-07-24/food_prices_for_nutrition.countries.json

paarriagadap · 2023-07-25T14:41:18Z

etl/steps/data/garden/wb/2023-07-24/food_prices_for_nutrition.countries.json

+  "BON": "Bonaire (WB)",
+  "EAS": "East Asia & Pacific (WB)",
+  "ECS": "Europe & Central Asia (WB)",
+  "HIC": "High-income countries",


In general we are using the hyphen only for lower-middle or upper-middle. I don't think I have seen high"-"income or low"-"income or lower-middle"-"income much.

This is currently how we name income groups:

MAPPING_CLASSIFICATION = { "..": np.nan, # no available classification for country-year (maybe country didn't exist yet/anymore) "L": "Low-income countries", "H": "High-income countries", "UM": "Upper-middle-income countries", "LM": "Lower-middle-income countries", "LM*": "Lower-middle-income countries", }

For consistency, those are the names we should use. We could consider changing them (but it would be a significant refactor, similar to the one of East Timor).

etl/steps/data/garden/wb/2023-07-24/food_prices_for_nutrition.meta.yml

paarriagadap · 2023-07-25T15:29:04Z

etl/steps/data/garden/wb/2023-07-24/food_prices_for_nutrition.meta.yml

+          The indicator expresses the total number of people who cannot afford an energy-sufficient diet in a given country and year. The indicator is computed by multiplying the percentage of the population in a country unable to afford a healthy diet by population data taken from the World Development Indicators (WDI) of the World Bank. A value of zero indicates a null or a small number rounded down at the current precision level.
+      cost_of_a_healthy_diet:
+        title: Cost of a healthy diet
+        unit: current PPP$/person/day


It's weird that they use these units here. It won't be of much use to compare countries, unless only the year 2017 is shown.

I see they use the ratio between this and the cost of an energy sufficient diet below (and many other cost afterward), which is not wrong just because the latter only has data for 2017

This one is unclear to me. Currently in our chart we assume it's 2017$ (like all other variables), but that may not be accurate. I'll merge for now and consider restricting the chart to 2017.

pabloarosado added 3 commits July 24, 2023 16:11

chore: Install WB python package

999c987

feat(harmonize): Include country code as another alias in harmonize tool

9069840

feat(wb): Add snapshot, meadow, garden and grapher steps for food aff…

a54453f

…ordability dataset

pabloarosado requested a review from paarriagadap July 25, 2023 13:20

github-actions bot assigned pabloarosado Jul 25, 2023

pabloarosado marked this pull request as ready for review July 25, 2023 13:20

paarriagadap requested changes Jul 25, 2023

View reviewed changes

Implement Pablo As suggestions

4e32135

pabloarosado requested a review from paarriagadap July 26, 2023 09:49

paarriagadap approved these changes Jul 26, 2023

View reviewed changes

pabloarosado merged commit b71be83 into master Jul 26, 2023
3 checks passed

pabloarosado deleted the add-food-affordability-data branch July 26, 2023 09:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add food affordability data #1381

Add food affordability data #1381

pabloarosado commented Jul 25, 2023

paarriagadap left a comment

paarriagadap Jul 25, 2023

pabloarosado Jul 26, 2023

paarriagadap Jul 26, 2023

paarriagadap Jul 26, 2023

pabloarosado Jul 26, 2023

paarriagadap Jul 26, 2023

paarriagadap Jul 25, 2023

pabloarosado Jul 26, 2023

paarriagadap Jul 25, 2023

paarriagadap Jul 25, 2023

pabloarosado Jul 26, 2023

Add food affordability data #1381

Add food affordability data #1381

Conversation

pabloarosado commented Jul 25, 2023

paarriagadap left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment