Is it possible to use sparse matrix as input? #297

albert-ying · 2021-05-15T00:55:09Z

Hi, I want to train on a very huge sparse dataset using tabnet. I noticed that tabnet only takes the dense array as input. However, the dataset is too big to be converted into dense dataset and my RAM can only hold 10% of samples in dense format.

I'm wondering if there is or will be any workaround for this kind of problem?

Thank you so much!

Optimox · 2021-05-15T09:35:34Z

hello @albert-ying,

TabNet can't handle sparse matrix as entrypoints.

The thing is you'll have to desparsify your data before going to the dataloader and the network, it is feasible but the dataloader is not so easily accessible with the current implementation of the library and I'm not sure how efficient it is to randomly sample from a sparse matrix.

Why do you need sparse matrix? For large files that don't fit in RAM we should probably implement from disk loading.

albert-ying · 2021-05-15T22:12:18Z

Thank you for the quick response @Optimox !

Because the dataset I used is very sparse and in glmnet (which I used before), preserving the sparsity of the dataset use less RAM, and improves the training speed. Maybe this is not the case for the deep learning method and disk loading is a better workaround here.

I'm also wondering if you have any suggestions on hyperparameter settings for the sparse dataset? It is a regression task with one numeric outcome and ~20,000 numeric input columns. Currently, I use the default and it works okay, but it would be impractical to fine-tune it as the dataset is too big. It would be great if you could point to some direction based on your experience that I could try.

Thank you so much!

IchiruTake · 2021-05-19T03:07:14Z

Hi, I want to train on a very huge sparse dataset using tabnet. I noticed that tabnet only takes the dense array as input. However, the dataset is too big to be converted into dense dataset and my RAM can only hold 10% of samples in dense format.

I'm wondering if there is or will be any workaround for this kind of problem?

Thank you so much!

You can try uint8 dataType to lower down memory burden

Optimox · 2021-05-20T08:45:23Z

@albert-ying it's hard to go give parameters in the dark.

If you want to train faster I would advise you to use a GPU, also using a one cycle learning rate scheduler can help you reduce the number of epochs required for convergence, you'll then be able to iterate faster on your parameter tuning.

About the memory burden, @IchiruTake I think it's not that simple as in the end you need to give float 32 to the network. Also I'll probably implement the half precision feature #150 in order to reduce size by half (and faster training time as well)

IchiruTake · 2021-05-20T12:57:32Z

@albert-ying it's hard to go give parameters in the dark.

If you want to train faster I would advise you to use a GPU, also using a one cycle learning rate scheduler can help you reduce the number of epochs required for convergence, you'll then be able to iterate faster on your parameter tuning.

About the memory burden, @IchiruTake I think it's not that simple as in the end you need to give float 32 to the network. Also I'll probably implement the half precision feature #150 in order to reduce size by half (and faster training time as well)

Yrs, I agree that apply not only uint8 will solve much problem. In fact, I have once handle the input matrix with size 581k obervations, 7465 features, with 7421 features of it is sparse (2.02 % density), dtype=np.uint8. You can make your network to be semi-supervised network (build 2 or 2+ sub-models with different network structure (different in number of layers and neurons)). Then merge them by Concatenate Layer and standardize your prediction. Then use Tabnet to fit your result with output of that last Concatenate layer. By then, it was much easier than search your associated "importance" features

albert-ying · 2021-05-20T18:22:56Z

@IchiruTake Wow, it sounds like a good way to do it and might solve my problem! I'm wondering if there is any examples (tutorial/notebook) of this that I could refer to? Thank you so much!

Optimox · 2021-05-27T09:23:38Z

I'm closing this as we are not going to allow sparse matrix, the solution might come from #143

albert-ying added the enhancement New feature or request label May 15, 2021

albert-ying assigned eduardocarvp, Hartorn, j-abi and Optimox May 15, 2021

Optimox closed this as completed May 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to use sparse matrix as input? #297

Is it possible to use sparse matrix as input? #297

albert-ying commented May 15, 2021

Optimox commented May 15, 2021

albert-ying commented May 15, 2021

IchiruTake commented May 19, 2021

Optimox commented May 20, 2021

IchiruTake commented May 20, 2021

albert-ying commented May 20, 2021

Optimox commented May 27, 2021

Is it possible to use sparse matrix as input? #297

Is it possible to use sparse matrix as input? #297

Comments

albert-ying commented May 15, 2021

Optimox commented May 15, 2021

albert-ying commented May 15, 2021

IchiruTake commented May 19, 2021

Optimox commented May 20, 2021

IchiruTake commented May 20, 2021

albert-ying commented May 20, 2021

Optimox commented May 27, 2021