Implemented remaining ACS columns and prediction tasks (#1)

* readme update * added remaining ACS columns * added ACS target columns * created Threshold class * fixing type annotations * added healthinsurance task * minor bug fixes * hash is now deterministic :) * fixed random hash changes * remaining ACS tasks seem to be working * fixed ACSDataset assignment of new task * minor fix to setting new task on ACSDataset * minor change * minor updates * minor updates
socialfoundations · Jun 24, 2024 · 78556f0 · 78556f0
1 parent cc65e1b
commit 78556f0
Show file tree

Hide file tree

Showing 22 changed files with 1,124 additions and 391 deletions.
diff --git a/README.md b/README.md
@@ -5,14 +5,12 @@
 ![Documentation status](https://github.com/socialfoundations/folktexts/actions/workflows/python-docs.yml/badge.svg)
 ![PyPI version](https://badgen.net/pypi/v/folktexts)
 ![PyPI - License](https://img.shields.io/pypi/l/folktexts)
-<!-- ![OSI license](https://badgen.net/pypi/license/folktexts) -->
-<!-- [![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE) -->
 ![Python compatibility](https://badgen.net/pypi/python/folktexts)
 
-Folktexts is a python package to evaluate and benchmark calibration of large
-language models.
-It enables using any transformers model as a classifier for tabular data tasks, 
-and extracting risk score estimates from the model's output log-odds.
+
+Folktexts is a python package to compute and evaluate classification risk scores
+using large language models.
+It enables using any transformers model as a classifier for tabular data tasks.
 
 Several benchmark tasks are provided based on data from the American Community Survey.
 Namely, each prediction task from the popular 

diff --git a/docs/_static/PUMS_Data_Dictionary_2018.pdf b/docs/_static/PUMS_Data_Dictionary_2018.pdf
diff --git a/folktexts/__init__.py b/folktexts/__init__.py
@@ -1,4 +1,5 @@
 from ._version import __version__, __version_info__
-from .acs import ACSDataset, ACSTaskMetadata
+from .task import TaskMetadata
 from .benchmark import BenchmarkConfig, CalibrationBenchmark
 from .classifier import LLMClassifier
+from .acs import ACSDataset, ACSTaskMetadata
diff --git a/folktexts/_utils.py b/folktexts/_utils.py
@@ -8,6 +8,7 @@
 from datetime import datetime
 from functools import partial, reduce
 from pathlib import Path
+from contextlib import contextmanager
 
 import numpy as np
 
@@ -91,7 +92,13 @@ def standardize_path(path: str | Path) -> str:
     return Path(path).expanduser().resolve().as_posix()
 
 
-def get_thresholded_column_name(column_name: str, threshold: float | int) -> str:
-    """Standardizes naming of thresholded columns."""
-    threshold_str = f"{threshold:.2f}".replace(".", "_") if isinstance(threshold, float) else str(threshold)
-    return f"{column_name}_binary_{threshold_str}"
+@contextmanager
+def suppress_logging(new_level):
+    """Suppresses all logs of a given level within a context block."""
+    logger = logging.getLogger()
+    previous_level = logger.level
+    logger.setLevel(new_level)
+    try:
+        yield
+    finally:
+        logger.setLevel(previous_level)