Skip to content

A distributed column-oriented database built in Rust.

Notifications You must be signed in to change notification settings

iambriccardo/distribuito

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distribuito

Distribuito logo

Introduction

Distribuito is a column-oriented distributed database inspired by the Retriever database (https://www.youtube.com/watch?v=EMFKVimuyhQ) developed by Honeycomb. I decided to create Distribuito during Sentry's internal hackweek, a week-long hackathon where Sentry employees can work on whatever they want.

This project is intended to be a playground for me to learn about database design and implementation. Currently, the implementation lacks many features:

  • Ability to filter by predicates.
  • Ability to order by.
  • Ability to express partitioning of data by columns.

I hope to be able to finish all the distributed query execution to make the project presentable.

Features

  • Column-oriented with nearly infinite scalability for adding new columns.
  • File-based with an efficient custom file format (Apache Parquet support could be added in the future).
  • Distributed query execution across multiple nodes (still TBD).

About

A distributed column-oriented database built in Rust.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages