Name		Name	Last commit message	Last commit date
parent directory ..
data		data
output/main-results		output/main-results
run		run
scripts		scripts
.gitignore		.gitignore
README.md		README.md
common.py		common.py
contradiction_sentence.py		contradiction_sentence.py
contradiction_structured.py		contradiction_structured.py
extraction.py		extraction.py
get_classes.py		get_classes.py
metrics.py		metrics.py
paraphrase.py		paraphrase.py
pyproject.toml		pyproject.toml

README.md

ChatGPT scripts

Scripts:

extraction.py: extract cause, effect and relations from text.
- Uses the same data format as the genqa_joint model
- Uses some instances as demonstration examples
contradiction_sentence.py: generates a simple contradiction to the input. No examples.
contradiction_structured.py: creates a contradiction from structured input.
- The contradiction is obtained by swapping cause and effect
- Uses the same data format as the reconstruct model
- Uses some instances as demonstration examples

Run python <script>.py -h for options.

For convenience, there's a run_extraction.fish script that sets up common options. There's also experiment_extraction.fish that runs the extraction script through various combinations. Requires the Fish shell.

Tools used:

pyright (basic)
ruff (lint and format)
sourcery

The scripts assume a specific location for the python binary. Set up the virtualenv with:

uv venv
source .venv/bin/activate.fish # remove .fish if $SHELL is bash/zsh
uv pip install -r requirements.txt

Use uv because it's a lot faster. Standard Python venv/pip works too.

The data is already preprocessed in the data folder.

Main GPT extraction results

The are in the output/main-results folder.

The real output file is output.tags.json and the real metrics one is output.tags.metrics.json.

The other files are the original ones, but they used the LINES format, which is better for GPT but worse for the rest of the project, which uses TAGS. I preserved them, but gzipped them to save space.

They were converted from LINES to TAGS with the following command:

chatgpt/scripts/convert_lines_to_tags.py output.json > output.tags.json

The metrics were calculated using the following command:

self_critique/scripts/eval_std.py output.tags.json

Both of these commands were run in the root of the project and require the venv from self_critique to be active.

Git commit from these runs: ce8f008

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chatgpt

chatgpt

README.md

ChatGPT scripts

Main GPT extraction results

Files

chatgpt

Directory actions

More options

Directory actions

More options

Latest commit

History

chatgpt

Folders and files

parent directory

README.md

ChatGPT scripts

Main GPT extraction results