Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Explore additional datasets for inclusion on the Education Dashboard #318

Open
taz-chiles opened this issue Jan 9, 2024 · 4 comments
Open
Milestone

Comments

@taz-chiles
Copy link
Collaborator

taz-chiles commented Jan 9, 2024

YFF will provide links to the datasets.

Explore whether we can visualise the below datasets on the Education Dashboard, however there may be some data quality issues and suppressed values. We'll need to assess how much data processing will be required and go back to YFF confirming whether they'll be added or pushed into the next phase of work.

  • Pupil destinations by KS4 - at local authority level, published by DfE (to add link)
  • Children in need/ children in care outcomes - local authority level (to add link)
@taz-chiles taz-chiles added this to the Phase 3 Deadline milestone Jan 9, 2024
@gilesdring
Copy link
Member

gilesdring commented Jan 25, 2024

KS4

https://explore-education-statistics.service.gov.uk/find-statistics/key-stage-4-performance

Dataset: '2223_la_data_provisional' - I think standard achievement rates come from this dataset.

Variables I think we are interested in:

  • Local authority geography
  • 'version' - Final
  • 'avg_att8' - 'Average Attainment 8 score of all pupils'
  • 'avg_p8score' - 'Average Progress 8 score of all pupils'

Dataset: '2223_la_char_data_provisional'

Variables:

  • Local authority geography
  • 'version' - Final
  • 'avg_att8' & 'avg_p8score' broken down by:
    • 'gender'
    • 'ethnicity_major'
    • 'free_school_meals'
    • 'Sen_status'

16-18 destinations (to add on to the education dashboard)

https://explore-education-statistics.service.gov.uk/find-statistics/16-18-destination-measures

Dataset: '1618_dm_ud_202122_la_prov'

Potential variables (there may be more but this is a starting point)

  • 'fe' - Further education
  • he' - Higher education institution
  • 'notsust' - Education or employment not sustained
  • 'notsust_ben' - Education or employment not sustained and known to be in receipt of out of work benefits
  • 'overall' - Sustained education, apprenticeship or employment
  • 'appl2' - Level 2 apprenticeships
  • 'appl3' - Level 3 apprenticeships
  • 'appl4' - Level 4 (higher and degree level) apprenticeships

@gilesdring
Copy link
Member

Will look at this tomorrow @taz-chiles I assume I've put this on the right ticket! ;-)

@gilesdring
Copy link
Member

Have added download and transform script for dfe data

See notebook for processin and (at the bottom) how to deal with the ks4_char data.

https://github.com/open-innovations/yff-data-pipelines/blob/main/pipelines/education/transform.ipynb

Data is in

https://github.com/open-innovations/yff-data-pipelines/tree/main/data/processed/dfe

There are, for each dataset, *.csv and *.parquet files (hint - parquet is much more efficient in size, and has indexes built in!), as well as a *.metadata.json file which lists the levels in the indexes.

@gilesdring
Copy link
Member

For destinations, I've used the KS4 provider level methodology, as this is more likely to capture where the students live (i.e. closer to school rather than closer to college)

@taz-chiles taz-chiles modified the milestones: Phase 3 Deadline , Phase 4 Apr 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Status: Backlog
Development

No branches or pull requests

2 participants