acf-ohs-nprm-2024-18279

Analysis of public comments received on proposed rule on Supporting the Head Start Workforce and Consistent Quality Programming

This purpose of open-sourcing this repository is to be transparent about how AI was used to assist in efficiently analyzing public comments and to provide a starting point for others who would like to explore using commercial Large Language Models to aide in the public comment analysis process.

Links to Project Documentation:

How to Run: Outlines how to replicate the project and run the files in this repo.
Technical Documentation: Full technical documentation for this project including technical considerations for future project iterations and the rationale behind some of our choices.
Cloud Architecture: A detailed outline of how we structured our cloud infrastructure.
Lessons Learned: A collection of lessons learned from the Policy Team and the Data Surge Team.

Folder explanation:

inputs/: Should hold pickle file and file used for bill tagging

json_outputs/: Holds one output for each chunk of text that's sent to chatGPT with a prompt.

logs/: Log files will be created when you run data_processing.py and gpt_parallel.py. Logs are timestamped and indicate if there were any issues with particular comments when sending to chatGPT, and the time it takes to run both scripts.

outputs/: Holds an "intermediate" and "final" folder. "Intermediate" folder holds the chunked pickle file created in part of the pipeline. "Final" holds the final csvs exported in long and wide formats as well as the failed_jsons_files.csv and the summaries documents

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
documentation		documentation
inputs		inputs
json_outputs		json_outputs
outputs		outputs
.gitignore		.gitignore
01_move_files_to_subfolder.ipynb		01_move_files_to_subfolder.ipynb
02_clean_raw_data.ipynb		02_clean_raw_data.ipynb
03_data_processing.py		03_data_processing.py
04_gpt_parallel.py		04_gpt_parallel.py
05_segment_return.ipynb		05_segment_return.ipynb
06_postgpt.py		06_postgpt.py
07_create_summaries.py		07_create_summaries.py
DISCLAIMER		DISCLAIMER
LICENSE		LICENSE
README.md		README.md
dataclean_utils.py		dataclean_utils.py
model_utils.py		model_utils.py
model_workflow_v3.pptx		model_workflow_v3.pptx
postgpt__utils.py		postgpt__utils.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

acf-ohs-nprm-2024-18279

Links to Project Documentation:

Folder explanation:

About

Releases

Packages

Languages

License

HHS/acf-ohs-nprm-2024-18279

Folders and files

Latest commit

History

Repository files navigation

acf-ohs-nprm-2024-18279

Links to Project Documentation:

Folder explanation:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages