Releases · jrzaurin/pytorch-widedeep

07 Oct 09:44

jrzaurin

v.1.2.1

f3297c4

Fixed the implementation of the Additive Attention

Simple minor release fixing the implementation of the additive attention (see #110 )

Assets 2

01 Sep 13:10

jrzaurin

v1.2.0

1c8709f

Self-Supervised Pre-Training for Tabular models

There are a number of changes and new features in this release, here is a summary:

Refactored the code related to the 3 forms of training in the library:
- Supervised Training (via the Trainer class)
- Self-Supervised pre-training: we have implemented two methods or routines for self-supervised pre-training. These are:
  - Encoder-Decoder Pre-Training (via the EncoderDecoderTrainer class): this is inspired by the TabNet paper
  - Constrastive-Denoising Pre-Training (via de ConstrastiveDenoising class): this is inspired by the SAINT paper
- Bayesian or Probabilistic Training (via the BayesianTrainer: this is inspired by the paper Weight Uncertainty in Neural Networks
Just as a reminder, the current deep learning models for tabular data available in the library are:
- Wide
- TabMlp
- TabResNet
- TabNet
- TabTransformer
- FTTransformer
- SAINT
- TabFastformer
- TabPerceiver
- BayesianWide
- BayesianTabMlp
The text related component has now 3 available models, all based on RNNs. There are reasons for that although the integration with the Hugginface Transformer library is the next step in the development of the library. The 3 models available are:
- BasicRNN
- AttentiveRNN
- StackedAttentiveRNN
The last two are based on Hierarchical Attention Networks for Document Classification. See the docs for details
The image related component is now fully integrated with the latest torchvision release, with a new Multi-Weight Support API. Currently, the model variants supported by our library are:
- resnet
- shufflenet
- resnext
- wide_resnet
- regnet
- densenet
- mobilenet
- mnasnet
- efficientnet
- squeezenet

Assets 2

29 Aug 14:53

jrzaurin

v1.1.2

a5dac1a

Move docs to mkdocs

Simply Update all documentation

Assets 2

10 Mar 16:26

jrzaurin

v1.1.0

923011c

Probabilistic Models and Label/Feature Distribution smoothing

This release fixes some minor bugs but mainly brings a couple of new functionalities:

New experimental Attentive models, namely: ContextAttentionMLP and SelfAttentionMLP.
2 Probabilistic models based on Bayes by Backprop (BBP) as described in Weight Uncertainty in Neural Networks, namely: BayesianTabMlp and BayesianWide.
Label and Feature Distribution Smoothing (LDS and FDS) for Deep Imbalanced Regression (DIR) as described in Delving into Deep Imbalanced Regression
Better integration with torchvision for the deepimage component of a WideDeep model
3 Available models for the deeptext component of a WideDeep model. Namely: BasicRNN, AttentiveRNN and StackedAttentiveRNN

Assets 2

07 Oct 16:04

jrzaurin

v1.0.10

5f60170

Transformers without categorical data

This minor release simply fixes issue #53 related to the fact that SAINT, the FT-Transformer and the TabFasformer failed when the input data had no categorical columns

Assets 2

07 Sep 10:11

jrzaurin

v1.0.9

86217bb

v1.0.9: The TabFormer Family Grows

Functionalities:

Added a new functionality called Tab2Vec that given a trained model and a fitted Tabular Preprocessor it will return an input dataframe transformed into embeddings

TabFormers: Increased the Tabformer (Transformers for Tabular Data) family

Added a proper implementation of the FT-Transformer with Linear Attention (as introduced in the Linformer paper)
Added a TabFastFormer model, an adaptation of the FastFormer for Tabular Data
Added a TabPerceiver model, an adaptation of the Perceiver for Tabular Data

Docs

Refined the docs to make them cleared and fix a few typos

Assets 2

11 Aug 15:43

jrzaurin

v1.0.5

0c79deb

SAINT and the FT-Transformer

The two main additions to the library are:

SAINT from SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training and
FT-Transformer from Revisiting Deep Learning Models for Tabular Data

In addition

New DataLoader for imbalanced dataset. See here.
Integration with torchmetrics.

Assets 2

25 Jun 17:52

jrzaurin

v1.0.0

a71699d

Tabnet and v1 ready

This release represents a major step forward for the library in terms of functionalities and flexibility:

Ported TabNet from the fantastic implementation of the guys at dreamquark-ai.
Callbacks are now more flexible and save more information.
The save method in the Trainer is more flexible and transparent
The library has extensively been tested via experiments against LightGBM (see here)

Assets 2

11 Feb 13:22

jrzaurin

v0.4.8

f430864

v0.4.8: WideDeep with the TabTransformer

This release represents an almost-complete refactor of the previous version and I consider the code in this version well tested and production-ready. The main reason why this release is not v1 is because I want to use it with a few more datasets, but at the same time I want the version to be public to see if others use it. Also, I want the changes from the last Beta and v1 to be not too significant.

This version is not backwards compatible (at all).

These are some of the structural changes:

Building of the model and training the model and now completely decoupled
Added the TabTransformer as a potential deeptabular component
Renamed many of the parameters so that they are consistent between models
Added the possibility of customising almost every single component: model component, losses, metrics and callbacks
Added R2 metrics for regression problems

Assets 2

04 Dec 16:50

jrzaurin

v0.4.7

2fe4b49

v0.4.7: individual components can run independently and image treatment replicates that of Pytorch

The treatment of the image datasets in WideDeepDataset replicates that of Pytorch. In particular this source code:

if isinstance(pic, np.ndarray):
    # handle numpy array
    if pic.ndim == 2:
        pic = pic[:, :, None]

In addition, I have added the possibility of using each of the model components in isolation and independently. This is, one could now use the wide, deepdense (either DeepDense or DeepDenseResnet), deeptext and deepimage independently.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: jrzaurin/pytorch-widedeep

Fixed the implementation of the Additive Attention

Self-Supervised Pre-Training for Tabular models

Move docs to mkdocs

Probabilistic Models and Label/Feature Distribution smoothing

Transformers without categorical data

v1.0.9: The TabFormer Family Grows

SAINT and the FT-Transformer

Tabnet and v1 ready

v0.4.8: WideDeep with the TabTransformer

v0.4.7: individual components can run independently and image treatment replicates that of Pytorch