Skip to content

Latest commit

 

History

History
31 lines (19 loc) · 1011 Bytes

README.md

File metadata and controls

31 lines (19 loc) · 1011 Bytes

A big data scenario to practise how to manage, optimize millions of data

Big data scenario is not patent of big company, we could design a scenario, then mock millions, billions of data, store into database.

Features

  1. Super fast data generation, concurrently generate billions data in Node.js cluster mode.
  2. Full problem list, to track detail problems we will facing under big data scenario.
  3. Data models mocked from an living website: https://zhihu.com, a Quora-like Ask && Answer product.
  4. Friendly to all levels developers, easy to set up, and full tutorials to help.

Hardware requirements

You should owns a little bit high performance computer, which will speed up your practise, give you my PC as an example:

  • CPU: 4 x Core i5-7500 @ 3.4GHz
  • Memory: 16G
  • Disk: 256G SSD

Bigger data set requires more resource, especially large amount of disk, when your practise involved backup, partion, replica and sharding

Tutorial

Full tutorial to begin your practise

Requirements