Skip to content

Commit

Permalink
update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
AndreFCruz committed Jun 12, 2024
1 parent b361ed0 commit e074d6b
Show file tree
Hide file tree
Showing 2 changed files with 13 additions and 13 deletions.
10 changes: 5 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -106,15 +106,15 @@ clf.predict(dataset)

# Compute a variety of evaluation metrics on calibration and accuracy
from folktexts.benchmark import CalibrationBenchmark
benchmark_results = CalibrationBenchmark(clf, dataset, results_dir="results").run()
benchmark_results = CalibrationBenchmark(clf, dataset).run(results_root_dir=".")
```

## Benchmark options

```
usage: run_acs_benchmark.py [-h] --model MODEL --task-name TASK_NAME --results-dir RESULTS_DIR --data-dir DATA_DIR [--few-shot FEW_SHOT] [--batch-size BATCH_SIZE] [--context-size CONTEXT_SIZE] [--fit-threshold FIT_THRESHOLD]
[--subsampling SUBSAMPLING] [--seed SEED] [--dont-correct-order-bias] [--chat-prompt] [--direct-risk-prompting] [--reuse-few-shot-examples] [--logger-level {DEBUG,INFO,WARNING,ERROR,CRITICAL}]
[--use-feature-subset [USE_FEATURE_SUBSET ...]] [--use-population-filter [USE_POPULATION_FILTER ...]]
[--subsampling SUBSAMPLING] [--seed SEED] [--dont-correct-order-bias] [--chat-prompt] [--direct-risk-prompting] [--reuse-few-shot-examples] [--use-feature-subset [USE_FEATURE_SUBSET ...]]
[--use-population-filter [USE_POPULATION_FILTER ...]] [--logger-level {DEBUG,INFO,WARNING,ERROR,CRITICAL}]
Run an LLM as a classifier experiment.
Expand Down Expand Up @@ -143,12 +143,12 @@ options:
[bool] Whether to directly prompt for risk-estimates instead of multiple-choice Q&A
--reuse-few-shot-examples
[bool] Whether to reuse the same samples for few-shot prompting (or sample new ones every time)
--logger-level {DEBUG,INFO,WARNING,ERROR,CRITICAL}
[str] The logging level to use for the experiment
--use-feature-subset [USE_FEATURE_SUBSET ...]
[str] Optional subset of features to use for prediction
--use-population-filter [USE_POPULATION_FILTER ...]
[str] Optional population filter for this benchmark; must follow the format 'column_name=value' to filter the dataset by a specific value.
--logger-level {DEBUG,INFO,WARNING,ERROR,CRITICAL}
[str] The logging level to use for the experiment
```


Expand Down
16 changes: 8 additions & 8 deletions folktexts/cli/run_acs_benchmark.py
Original file line number Diff line number Diff line change
Expand Up @@ -69,14 +69,6 @@ def setup_arg_parser() -> ArgumentParser:
default=False,
)

parser.add_argument(
"--logger-level",
type=str,
help="[str] The logging level to use for the experiment",
choices=["DEBUG", "INFO", "WARNING", "ERROR", "CRITICAL"],
required=False,
)

# Optionally, receive a list of features to use (subset of original list)
parser.add_argument(
"--use-feature-subset",
Expand All @@ -97,6 +89,14 @@ def setup_arg_parser() -> ArgumentParser:
required=False,
)

parser.add_argument(
"--logger-level",
type=str,
help="[str] The logging level to use for the experiment",
choices=["DEBUG", "INFO", "WARNING", "ERROR", "CRITICAL"],
required=False,
)

return parser


Expand Down

0 comments on commit e074d6b

Please sign in to comment.