A terminal spreadsheet multitool for discovering and arranging data
-
Updated
Jul 1, 2024 - Python
A terminal spreadsheet multitool for discovering and arranging data
End-to-end ML project for tabular data.
Characterization of relational table embeddings (VLDB 2024).
A zero-config, fast and small (~3kB) virtual list (and grid) component for React, Vue, Solid and Svelte.
Get classification risk scores on tabular tasks using LLMs
A comprehensive toolkit and benchmark for tabular data learning, featuring over 20 deep methods, more than 10 classical methods, and 300 diverse tabular datasets.
a Stellar Dynamics Toolbox (Not Everybody Must Observe)
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
The modern React DataGrid for building apps — faster
MSBoost is a gradient boosting algorithm that improves performance by selecting the best model from multiple parallel-trained models for each layer, excelling in small and noisy datasets.
Fast and Accurate ML in 3 Lines of Code
Benchmarking synthetic data generation methods.
Treeffuser is an easy-to-use package for probabilistic prediction on tabular data with tree-based diffusion models.
Algorithms for outlier, adversarial and drift detection
A Benchmark of Tabular Machine Learning in-the-Wild with real-world industry-grade tabular datasets
Conditional GAN for generating synthetic tabular data.
Repositorio con el código de los experimentos de mi TFM titulado "Transformación de Datos Tabulares a Imágenes Sintéticas: Optimización y Evaluación de la Librería TINTOlib en Python"
Let's settle down, rest our minds, spill the tea of our experience in multiple ai fields [Data Science, Machine Learning, Deep Learning], including many other aspects starting from prorgramming and clean code till design patterns & businness interference. Enjoy the drink, and if you find something interesting here, offer us a cup of tea.
Python library for embedding inference of relational tables.
CSV Lint plug-in for Notepad++ for syntax highlighting, csv validation, automatic column and datatype detecting, fixed width datasets, change datetime format, decimal separator, sort data, count unique values, convert to xml, json, sql etc. A plugin for data cleaning and working with messy data files.
Add a description, image, and links to the tabular-data topic page so that developers can more easily learn about it.
To associate your repository with the tabular-data topic, visit your repo's landing page and select "manage topics."