Skip to content

Releases: klarna-incubator/mleko

v0.8.0

06 Sep 21:33
Compare
Choose a tag to compare

Feature

  • model: ✨ Add LGBMModel along with base class which can be extended for all types of future models. (b47a241)
  • ✨ Add DataSchema which tracks dataset features throughout the pipeline and methods. (e03bd2c)
  • feature selection: ✨ Update BaseFeatureSelector and children to use the fit, transform and fit_transform pattern. (62e4dd1)
  • transformer: ✨ Add fit, transform and fit_transform to all Transformers, along with API and caching simplificatons. (5cc4ebc)
  • cache: ✨ Add CacheHandler which allows customization of read/write functions for each cached return value individually. (609e084)

Fix

  • feature selection: 🐛 Add DataSchema as partial return from all fit methods in feature selectors. (ebf2484)

Refactor

  • cache: 🚸 Replace disable_cache with a check if cache_size=0 for LRUCacheMixin. (cfd7592)

Chore

  • deps: Bump pypa/gh-action-pypi-publish from 1.8.8 to 1.8.10 (#76) (d6dc41f)
  • deps: Bump pypa/gh-action-pypi-publish from 1.8.8 to 1.8.10 (a7363ee)
  • deps: Bump sphinx-autodoc-typehints from 1.23.3 to 1.24.0 in /docs (#75) (beece1c)
  • deps: Bump sphinx-autodoc-typehints in /docs (aea624c)
  • deps: Bump pip from 23.2 to 23.2.1 in /.github/workflows (#74) (b86840f)
  • deps: Bump pip from 23.2 to 23.2.1 in /.github/workflows (35f6828)
  • deps: Bump pip from 22.3.1 to 23.2 in /.github/workflows (#73) (a0083bd)
  • deps: Bump pip from 22.3.1 to 23.2 in /.github/workflows (de871ea)
  • deps: Bump pypa/gh-action-pypi-publish from 1.8.7 to 1.8.8 (#72) (144a702)
  • deps: Bump pypa/gh-action-pypi-publish from 1.8.7 to 1.8.8 (2747ead)

v0.7.0

11 Jul 17:49
Compare
Choose a tag to compare

Feature

  • ✨ Add fit transform support to all FeatureSelector along with refactoring the LRUCacheMixin. (3df0601)
  • ✨ Add support for separate fitting and transforming inside the pipeline. (bb9b7a4)

Fix

  • data cleaning: 🐛 Switched to HDF5 as file format for faster I/O and better SageMaker support. (61f9e42)

Chore

  • deps: Bump nox-poetry from 1.0.2 to 1.0.3 in /.github/workflows (#70) (5aa535d)
  • deps: Bump nox-poetry from 1.0.2 to 1.0.3 in /.github/workflows (b55b7e2)

v0.6.1

30 Jun 10:51
Compare
Choose a tag to compare

Fix

  • data cleaning: 🐛 Fix date32/64[day] not converted to datetime. (98f4b26)
  • data source: 🐛 Fix bug where S3 buckets with no manifest caused crash. (9078845)

Build

  • config: 🔧 Switch mypy for pyright and update configuration. (5631aed)

Chore

  • deps: Bump pypa/gh-action-pypi-publish from 1.5.0 to 1.8.7 (#67) (74315f4)
  • deps: Bump sphinx-autodoc-typehints from 1.23.2 to 1.23.3 in /docs (#68) (54e0845)
  • deps: Bump sphinx-autodoc-typehints in /docs (5c285be)
  • deps: Bump pypa/gh-action-pypi-publish from 1.5.0 to 1.8.7 (b8f0f29)

v0.6.0

26 Jun 21:29
Compare
Choose a tag to compare

Feature

  • cache: ✨ Add cache_group that can segment an instance cache into different isolated parts. (b5c3de5)

Chore

  • deps: Bump sphinx-autodoc-typehints in /docs (de2e720)

v0.5.0

17 Jun 16:54
Compare
Choose a tag to compare

Feature

  • transformer: ✨ Add MinMaxScalerTransformer for normalizing numerical features. (9b26c00)
  • transformer: ✨ Add MaxAbsScalerTransformer that scales numerical features. (1fd2a93)
  • transformer: ✨ Add CompositeTransformer for chaining together multiple transformers sequentially. (006d741)
  • transformer: ✨ Add LabelEncoderTransformer for ordinal encoding. (41a4c45)
  • transformer: ✨ Add FrequencyEncoderTransformer along with support for pipeline. (465e6db)

Refactor

  • 💫 Switch to tqdm.auto to prevent breaking in Jupyter notebooks. (dc139cf)

Test

  • ✅ Now _get_local_filenames returns a sorted list of filenames to ensure stability. (774e8eb)

Chore

  • deps: Fix issue with boolean logic in last commit. (3832911)
  • deps: Update release workflow to ignore chore(deps) commits. (a7d5f62)
  • deps: Bump sphinx-autoapi from 2.1.0 to 2.1.1 in /docs (2cb82d1)

v0.4.2

11 Jun 08:22
Compare
Choose a tag to compare

Performance

  • ⚡️ Optimize VarianceFeatureSelector when threshold is 0. (906dde3)

Refactor

  • ➖ Remove pandas dependency. (40e264c)

Ci

  • semantic versioning: 👷 Add more sections to changelog based on conventional commit categories. (e5b1594)

v0.4.1

04 Jun 20:58
Compare
Choose a tag to compare

Fix

  • feature selection: 🐛 Fix FeatureSelector cache to use tuple instead of frozenset to have stable fingerprint. (cd82417)

v0.4.0

03 Jun 21:08
Compare
Choose a tag to compare

Feature

  • feature selection: ✨ Add that filters out invariant features. (798c261)
  • feature selection: ✨ Add PearsonCorrelationFeatureSelector which drops highly correlated features. (66e5cd2)
  • feature selection: ✨ Add CompositeFeatureSelector, for chaining multiple feature selection steps on the same DataFrame. (3d75079)
  • feature selection: ✨ Add standard deviation feature selector. (c56177b)
  • feature selection: ✨ Add missing rate feature selector. (d5ba8b5)

Fix

  • 🐛 Fix typeguard breaking changes causing build to fail. (66c6a8e)

v0.3.1

21 May 14:50
Compare
Choose a tag to compare

Fix

  • 🐛 Added notes to pipeline step docstrings. (d94f899)

v0.3.0

21 May 14:11
Compare
Choose a tag to compare

Feature

Fix

  • data splitting: 🐛 Added notes and examples to splitters docstrings. (d162c86)
  • pipeline: 🐛 Updated some docstrings. (56b36fd)