A central repository for Princeton Univeristy departmental data, including major, degree, and certificate requirements. Available for Princeton students to develop apps that help with choosing majors and certificates, navigating and understanding departmental requirements, selecting courses, etc.
Currently used by:
Each of the folders named majors
, certificates
, minors
, and degrees
contains requirement files in YAML
format.
The scripts
folder contains a sample script, verifier.py
, with an API that helps to parse and process these requirement files to check which requirements are satisfied by a user's courses. There is also a preprocessor script, preprocess.js
,
that checks the requirement files for errors, standardizes the format, and converts them to both YAML
and JSON
formats.
The json
folder contains the preprocessed JSON
files.
Degrees are either AB or BSE, and are automatically selected depending on the degree type of the user's major. This could in the future be expanded to include graduate degrees.
Majors are listed according to the three-letter course code of their respective departments.
Certificates are also listed according to three-letter codes, but this is less standardized, and the codes are not officially used by the registrar.
The three-letter codes used for naming majors and certificates requirement
files are listed in scripts/university_info.py
.
Each requirements file is structured as a
tree with the full major at
the root, and each requirement containing subrequirements as children.
The leaves of the tree represent the simplest kind of requirement,
which is usually a list of courses (encoded as a course_list
).
Intermediate nodes (which are encoded as a req_list
, which is short for
"requirement list") represent more complex requirement types, and express
some kind of relationship among the subrequirements (for example, it may
require that ALL subrequirements be completed, or it may require a choice of
at least 1 of the subrequirements).
Requirement files are written in YAML
.
YAML
is a superset of JSON
but which
also includes extra features which allow for the possibility to write cleaner
and more human-friendly requirement files. While JSON
is also accepted
by the YAML
parser, the more succinct YAML
form is preferred, since it
is more easily read and maintained by a human.
Note that it is easy to convert between the two using any YAML
parser
(for example, using an online conversion tool).
The expected format of the requirements files, as well as how they are
interpreted, is explained below.
A preprocessing script, preprocess.js
, exists to
check the requirement files for errors, standardize the format, and convert
them to both YAML
and JSON
formats (placed in the out
directory). Please
read the file comment of the script for more information on how to use it.
Note: The script is written in JavaScript and requires Node.js to run.
At the bottom of the requirement trees, the leaves are nearly always
course_list
s. A course_list
is just a list of possible courses that
can count towards a requirement.
Most course codes in these lists are just simply the registrar's course code
for that course, such as AAA 111
.
This is optionally followed by a colon (:
) and the name or title of the course.
Note that everything after the colon is not parsed, and is
in fact just like a comment, used to aid a human reader in following along and
checking the list of courses.
There are several placeholders/wildcard codes that can capture many courses using a single list item. These are used to capture broader categories of courses, such as "300-level courses from the AAA" department, or "200-level language courses".
Here are the possible types of course codes:
code | meaning |
---|---|
AAA 111 AAA 123/BBB 133 |
course listing |
AAA *** AAA * |
any course in department AAA |
AAA 2** AAA 2* |
any 200 level course in department AAA |
LANG 101 | the 101 course in any language department |
STL STN EC, EM, HA, etc |
Distribution requirements |
AAA 101: Intro to Aardvarks | course listing with title (title is ignored and only for human readability) |
A *
in the comment describing a field signifies that it is required.
In YAML
format, the requirements files should be formatted to look like this:
---
#* Major, Certificate, or Degree
type: Major
#* major/certificate name (If separate files for same program, specify AB/BSE.
# For instance, "Computer Science - BSE")
name: Name Studies
#* Three-letter dept code, as listed in university_info.py
# For Spanish/Portuguese and French/Italian, use SPO and FIT, respectively.
code: NST
#* the degree type of a major. AB or BSE for majors, or null otherwise
degree: AB
description: |-
These are a few sentences describing the major/certificate.
It is not strictly required, but should be copied from an official source.
It can contain <i>html-type formatting</i>, and is displayed to the user in
a tooltip when they hover over the major's name.
# allowed_majors is only relevant for certificates and should not appear for
# majors and degrees. It is a list of majors that are allowed to be taken with
# the certificate. Default if empty / not present is that all majors are allowed
allowed_majors:
- NST
# excluded_majors is only relevant for certificates and should not appear for
# majors and degrees. It is a list of majors that are not allowed to be taken with
# the certificate. Default if empty / not present is that no majors are excluded
excluded_majors:
- XXX
# excluded_minors is only relevant for certificates and should not appear for
# majors and degrees. It is a list of minors that are not allowed to be taken with
# the certificate. Default if empty / not present is that no minors are excluded
excluded_minors:
- XXX
# declaration_limit is only relevant for certificates and should not appear for
# majors and degrees. It is the latest semester by which a student must declare
# the certificate. Default if empty / not present is that there is no limit
declaration_limit:
- 6
# every source of information on the listed requirements should be linked to here
# and the info in the links should back up every detail of the requirements file
urls: #* links to requirements pages
- "https://ua.princeton.edu/academic-units/name-studies"
contacts: #* departmental office contacts for the department or certificate
- type: Departmental Representative
name: Dr. Professor
email: [email protected]
#* requirement lists contain requirements and/or subrequirements
req_list: # the highest level **must** contain a req_list
- name: Prerequisites #* requirement name
max_counted: 1 # > 0 or null: max units passed up to the parent requirement. ALL if null or missing
min_needed: 4 #* >= 0 or "ALL": min units demanded of children (subrequirements)
explanation: |- #* long human readable description of the requirement
This text should be copied almost word-for-word from the department website.
It can contain <i>html-type formatting</i> and can include the explicit list of courses if this is how the department website lists it.
This text is displayed to the user in a tooltip when they hover over the requirement.
double_counting_allowed_local: false # whether courses may count for multiple subrequirements of this requirement
# should only be explicitly listed for the root of the subtree to which it applies
# if empty/null/missing false is assumed
double_counting_allowed: false # whether courses may count for multiple subrequirements of this requirement
# if empty/null/missing false is assumed
max_common_with_major: 0 # number of courses that can be in common with major
# only relevant for certificates
pdfs_allowed: false # whether student is allowed to take the courses SPDF (student-elected Pass/D/Fail)
# can be false, true, or a number indicating how many courses can
# if empty/null/missing false/0 is assumed
completed_by_semester: 4 # 1-8: semester by the end of which the requirement must be complete
# optional and usually only needed for some prerequisites
#* the requirement may contain only one of: a course_list, a req_list, a dist_req, or a num_courses
# each has a different purpose and function
req_list: #* a req_list defines this requirement as composed of a list of smaller subrequirements
- name: First Prerequisite #* a subreq. is only revealed to user if its name field exists and is non-null
# otherwise, it is a 'hidden subrequirement'
max_counted: 1 #* see above
min_needed: 1 #* see above
no_crosslist: #* course_list does not include crosslistings
explanation: Take an interesting course in the department #*
#* this again may be any one of: a course_list, a req_list, a dist_req, or a num_courses
course_list: #* a course_list defines this requirement as an explicit or implicit list of courses
- NST 100 # course lists contain the course codes
- NST 2** # of courses satisfying the requirement
- NST 312C # and, optionally, a colon-sparated course name which is ignored
- NST 96 # by the parser (only for human reference)
- NST 482/ACR 382 # (see the course code conventions table above)
- NST 487: The Study of Modern Names
# an optional list of courses excluded from counting for this requirement
excluded_course_list: # the format is the same as for a course_list
- NST 221 # this prevents NST 221 from counting, despite NST 2** listed above
# any requirement that the app cannot possibly verify,
# such as a Senior`` Thesis or internship, should contain a no_req
# instead of a req_list or course_list. Here is an example:
- name: Unverifiable Requirement
max_counted: #*
min_needed: #*
explanation: All students must visit New York City three times. #*
iw_relationship: combined | hybrid | separate #* whether the requirement is combined, hybrid, or separate with the major IW requirement
no_req: #* The value of the no_req field is ignored and may be empty/null
Note: For the root node of the requirement tree, min_needed
is always
implicitly "ALL"
and max_counted
is irrelevant, so neither is
explicitly present. In rare cases, min_needed
may be provided explicitly at
the root to override the default (for example if the top level of the tree
happens to be an OR of different options).
For legacy requirement files, many of which are still written in JSON
,
the format looks instead something like the following. Note that these are
both equivalent and equally valid YAML
formats.
{ # Note: JSON does not normally allow comments, but this is JSON parsed as YAML
"type": "Major", # for descriptions of these fields, see the YAML version above
"name": "Name Studies",
"code": "NST",
"degree": "AB",
"description": "These are at most a couple sentences describing the Name Studies major/certificate.\nIt is not strictly required, but should be copied from an official source.",
"allowed_majors": [
"NST"
],
"urls": [
"https://ua.princeton.edu/academic-units/name-studies"
],
"contacts": [
{
"type": "Departmental Representative",
"name": "Dr. Professor",
"email": "[email protected]"
}
],
"req_list": [
{
"name": "Prerequisites",
"max_counted": 1,
"min_needed": 4,
"explanation": "Long text\nfrom dept website",
"double_counting_allowed": false,
"max_common_with_major": 0,
"pdfs_allowed": false,
"completed_by_semester": 4,
"req_list": [
{
"name": "First Prerequisite",
"max_counted": 1,
"min_needed": 1,
"explanation": "Long description",
"course_list": [
"NST 100",
"NST 2**",
"NST 312C",
"NST 96",
"NST 482/ACR 382",
"NST 487: The Study of Modern Names"
]
}
]
},
{
"name": "Unverifiable Requirement",
"max_counted": null,
"min_needed": null,
"explanation": "All students must visit New York City three times.",
"no_req": null
}
]
}
Additional subrequirement types are available for use in the requirement files for degree types (AB: "Bachelor of Arts", and BSE: "Bachelor of Science in Engineering"), but will normally not be used in major or certificate requirement files, except perhaps in rare scenarios.
These include num_courses
, which verifies that users have taken at
least a certain prescribed number of courses by different checkpoints in
their 4 years (used to track degree progress), and dist_req
, which allows
any courses that fall under the specified
distribution area
to count.
Here is an example of their usage:
---
type: Degree
name: A.B. #* "A.B." or "B.S.E."
code: AB #* "AB" or "BSE"
urls:
- "https://ua.princeton.edu/contents/program-study-degree-bachelor-arts"
contacts:
- type: Dean
name: Dr. Professor
email: [email protected]
req_list:
- name: Degree Progress
max_counted: 1
min_needed: ALL
explanation: Explanation of degree progress requirements
req_list:
- name: By 6th semester
max_counted: 1
explanation:
completed_by_semester: 6 #* semester by which these courses must be taken
num_courses: 25 #* number of course credits that must be completed
# note: min_needed is not present and is automatically set to num_courses
- name: Total courses
max_counted: 1
explanation:
completed_by_semester: 8
num_courses: 31
- name: Epistemology and Cognition
max_counted: 1
min_needed: 1
explanation: Explanation
dist_req: # list of distribution area codes: EC, EM, HA, etc.
- EC # usually, the list will contain only one code
- name: Quantitative and Computational Reasoning
max_counted: 1
min_needed: 1
explanation: Explanation
dist_req: # for codes that changed names, include both old and new codes
- QR
- QCR
The requirements file format supports another primitive called year_switch
,
which decides from a list of possibilities depending on the user's class year.
This is useful when the requirements differ for students in different class
years.
For example, the requirements for a certain major might change, but then
only apply to the incoming class and later, or an exception might be made
for a specific class year (for instance some requirements were suspended or
modified during the COVID19 pandemic).
A year_switch
is formatted almost like a req_list
, in that its value is
a list of subrequirements.
Each subrequirement must have a year_code
field, which is a string or integer
which specifies which class year range that the subrequirement applies to
(refer to the table below for possible values).
Style dictates that, for readability, the year_code
field should always
appear at the top of the subrequirement.
The first requirement in the list whose year_code
matches the student's
class year is selected, at which point its explicitly listed fields override
those of the parent requirement (the one that contains the year_switch
),
and the year_switch
and year_code
fields are dropped.
Interpretation of the requirements file then proceeds as usual (with any
remaining year_switch
s evaluated recursively).
As an example, this requirement
---
name: A Transitioning Requirement
max_counted: 1
min_needed:
explanation: Students from the class of 2021 should take any one NST course,
while students from the classes of 2022 and later should take one 300-level
and one 400-level NST course.
year_switch:
- year_code: 2021
min_needed: 1 # note that this overrides the existing min_needed if selected
course_list:
- NST *
- year_code: ">=2022"
min_needed: ALL
double_counting_allowed: true # any additional fields can be added here too
req_list: # the structure need not be the same among the different cases
- name: One 300-level course
max_counted: 1
min_needed: 1
explanation:
course_list:
- NST 3**
- name: One 400-level course
max_counted: 1
min_needed: 1
explanation:
course_list:
- NST 4**
- year_code: default # equivalently, null or blank
max_counted:
min_needed:
explanation: This requirement only applies to classes of 2021 and after,
so lucky you, you don't have to do anything!
no_req:
will appear differently to students of different class years.
To a student of the class of 2021, it will appear as if it were
---
name: A Transitioning Requirement
max_counted: 1
min_needed: 1
explanation: Students from the class of 2021 should take any one NST course,
while students from the classes of 2022 and later should take one 300-level
and one 400-level NST course.
course_list:
- NST *
while to a student from the classes of 2022 and later, it will appear as
---
name: A Transitioning Requirement
max_counted: 1
min_needed: ALL
explanation: Students from the class of 2021 should take any one NST course,
while students from the classes of 2022 and later should take one 300-level
and one 400-level NST course.
double_counting_allowed: true
req_list:
- name: One 300-level course
max_counted: 1
min_needed: 1
explanation:
course_list:
- NST 3**
- name: One 400-level course
max_counted: 1
min_needed: 1
explanation:
course_list:
- NST 4**
To a student of any earlier class, it would fall to the default case and look like
---
name: A Transitioning Requirement
max_counted:
min_needed:
explanation: This requirement only applies to classes of 2021 and after,
so lucky you, you don't have to do anything!
no_req:
Note that if no default case is specified, and a user's class year does not
fall under any of the cases, then the year_switch
is just dropped without
modifying the parent requirement.
This means that whatever fields are specified explicitly in the parent are
used.
In the example above, this would result in an incomplete requirement, since
the parent requirement is not fully specified by itself, but this doesn't
have to be the case.
For example, the following requirement is fully specified with or without the
contents of the year_switch
.
---
name: Any NST Course
max_counted: 1
min_needed: 1
explanation: Any one NST course. From the class of 2023, NST 300 doesn't count.
course_list:
- NST *
year_switch:
- year_code: ">=2023"
excluded_course_list:
- NST 300
The possible values for the year_code
field are as follows:
code | meaning |
---|---|
"<XXXX", "<=XXXX", ">XXXX", ">=XXXX", "==XXXX", "!=XXXX" | less than, less than or equal to, etc. the year XXXX |
"XXXX" (a str) or XXXX (an int) | same as ==XXXX |
"XXXX-YYYY" | inclusive range of years: >=XXXX and <=YYYY |
"default", null, empty or missing | default case, applied when no other case matches |
Course lists sometimes appear multiple times in the same requirements file,
and maintaining the duplicated lists separately can become a source of
headache. Fortunately, YAML
provides a way to alias and reference previously
defined course lists.
For instance, in the following requirement the "Electives" subrequirement
excludes all the courses from the course_list
of the "Core" subrequirement.
Any time the list of core courses is modified, it must be edited in both
locations and they must be manually checked for consistency.
req_list:
- name: Core courses
max_counted: 1
min_needed: ALL
explanation: Take all the required core courses
course_list:
- NST 237
- NST 244
- NST 253
- NST 273
- name: Electives
max_counted: 1
min_needed: 2
explanation: Two 200-level courses beyond the core courses
course_list:
- NST 2**
excluded_course_list: # core courses do not count
- NST 237
- NST 244
- NST 253
- NST 273
Instead, the first course list may be assigned an alias, which is then referenced the second time. The following is identical to the requirement above, but the course list is deduplicated, which allows it to be maintained in a single location.
req_list:
- name: Core courses
max_counted: 1
min_needed: ALL
explanation: Take all the required core courses
course_list: &core_courses_list # an alias is defined for the following list
- NST 237
- NST 244
- NST 253
- NST 273
- name: Electives
max_counted: 1
min_needed: 2
explanation: Two 200-level courses beyond the core courses
course_list:
- NST 2**
excluded_course_list: *core_courses_list # references the list defined above
Note that this feature of YAML
may not be intuitive to human readers,
so always include comments explaining the situation whenever using this
feature. For example, when referencing a previously defined alias, add
a comment specifying which course list this is referencing and where to
find the original definition. Furthermore, always use a descriptive name
for the alias (i.e. "core_courses_list"
as opposed to something generic
like "id001"
).