Meta-inferential properties of NLI

This repository contains code for the paper Reverse-engineering NLI: A study of the meta-inferential properties of Natural Language Inference.

By meta-inferential, we mean properties like the transitivity of entailment:

Given SNLI examples $(P_1, H_1, E)$ and $(P_2, H_2, E$) both with label $E$ for entailment, if $P_2=H_1$ then we can conclude $(P_1, H_2, E)$ is also a valid example.

More generally, we are interested in what can be inferred from NLI examples with any overlap between the sentences in the premises and hypothesis. One reason this is interesting is that the valid meta-inferential patterns depend on how the NLI labels of entailment, contradiction and neutral are interpreted. By observing which meta-inferential patterns models trained on a particular NLI dataset validate, we can reverse-engineer which reading of inference labels the model learned from the data.

We exploit two (actually 3) sources of overlapping sentences to test meta-inferential patterns for models trained on SNLI:

Each SNLI premise is used to construct up to three different examples (one for each label). Given examples $(P, H_1, L_1)$ and $(P, H_2, L_2)$ we can consider how the model deals with $(H_1, H_2)$ and $(H_2, H_1)$.
We also use LLMs to generate new examples. Given $(P, H, L)$, we ask the model to generate an example $(H, H', L')$ for each label and test model predictions for $(P, H'$) and $(H', P)$.
Given an example $(P, H, L)$ we can get model predictios for $(H, P)$.

... more details in the paper!

Data

The main data used is SNLI, which can be downloaded here. The code expects to find it at data/snli_1.0.

data.py - data loader for SNIL and exteneded SNLI datasets
find_prompt_examples.py - searches SNLI for examples with "perfect" agreement among re-annotators & save them for prompting LLM
generate_nli_items.py - use LLM to generate new items based on SNLI hypotheses
infer_nli_items.py - create a dataset of new items according to the schemes described above

Models

We test a vanilla BERT model with a 3-way classifier off the pooler token and RoBERTa+Self Explaining, which is a recently SOTA model on SNLI.

The self-explaining code is adapted from the paper repository found here.

nli.py - train the NLI models
evaluate.py - produce model predictions for SNLI, generated, and inferred test sets.

Analysis

analysis.py - create the tables found in the paper

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
.nix		.nix
paper		paper
prompts		prompts
self_explaining_structures		self_explaining_structures
.gitignore		.gitignore
README.md		README.md
analysis.py		analysis.py
appendix_examples.py		appendix_examples.py
data.py		data.py
evaluate.py		evaluate.py
evaluate_llm.py		evaluate_llm.py
find_prompt_examples.py		find_prompt_examples.py
generate_nli_items.py		generate_nli_items.py
infer_nli_items.py		infer_nli_items.py
nli.py		nli.py
secondary_analysis.py		secondary_analysis.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Meta-inferential properties of NLI

Data

Models

Analysis

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

GU-CLASP/reverse-engineering-NLI

Folders and files

Latest commit

History

Repository files navigation

Meta-inferential properties of NLI

Data

Models

Analysis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages