mongodb-vs-postgresql-jsonb

The goal of this POC was to test the implementation and performance of storing JSON data in Postgrsql 9.4+ vs using native JSON structures in MongoDB.

We generate 10k documents in both MongoDB and Postgresql using Spring Boot + JPA + Hibernate and add appropriate indexes.

Getting started

Install local Postgres (with database "postgres")
Install local MongoDB (with database "PgPerf")
Install DynamoDB Local
Run mvn compile to build QueryDSL sources
Run mvn test to run performance tests

sql4lite dependencies for Mac

wget https://d1.almworks.com/.files/sqlite4java/sqlite4java-392.zip
unzip sqlite4java-392.zip
sudo cp sqlite4java-392/*.dylib /Library/Java/Extensions/

Results

We test the following indexes on the JSONB columns:

CREATE INDEX ON example USING BTREE ((data->>'stock'));
CREATE INDEX ON example USING HASH ((data->>'stock'));
CREATE INDEX ON example USING GIN ((data));
CREATE INDEX ON example USING BTREE (cast (data->>'stock' as int));

Time taken

Test	Time taken
testPerfMongo	23.519s
testPerfPg	54.798s

Example data

In MongoDB, documents look like:

In Postgresql, rows look like:

Conclusions

Disadvantages of Postgres/JSONB vs MongoDB

Constraints/Validation

While PostgreSQL JSONB type provides flexibility, it should be used just when appropriate. The only check being performed is that stored data is actually in a valid JSON format. You cannot impose any other constraints as with regular columns - such as not null or enforce a particular Data Type (Integer, VarChar, Date). Therefore it is best suited for providing an additional optional set of data to an entity, where you cannot be sure before which data is would contain. And such data would differ a lot among each of the rows. Such example can be a user-provided set of additional data. You should always carefully consider which data is better suited as regular columns and which should be stored as JSON.

Lack of stats

jsonb columns have a flat 1% statistics rate causing poor lookup strategies (unlike MongoDB)

Range queries

Consider

EXPLAIN ANALYZE SELECT *
FROM example
WHERE to_date(data->>'date', 'YYYY-MM-DD') 
       BETWEEN '2018-02-01' 
       AND     '2020-03-01'
AND data->>'name' = 'Name 7';

or even

EXPLAIN analyze  SELECT * FROM example
WHERE ((data->>'stock')::integer)  > 15000;

The results should that neither approach can use indexes for range queries - a filter need to be run.

GIN Index is based on string format of value

Consider

EXPLAIN ANALYZE SELECT * FROM example
WHERE ((data->>'stock')::integer)  = 15000;

The casting as integer means that a table scan and filter is performed - no index.

However, you can do:

CREATE INDEX ON example USING BTREE (cast (data->>'stock' as int));

which does seem to create a usable index for the above query.

Compound queries

WHERE DATE > ? AND FEATURES.Ref = ?

Acknowledgements

Some code forked from https://www.vojtechruzicka.com/postgresqls-jsonb-type-mapping-using-hibernate/

Official docs:

Postgresql articles:

DynamoDB articles:

General Guidelines for Secondary Indexes in DynamoDB

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
src		src
README.md		README.md
example_jsonb.png		example_jsonb.png
example_mongo.png		example_mongo.png
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mongodb-vs-postgresql-jsonb

Getting started

sql4lite dependencies for Mac

Results

We test the following indexes on the JSONB columns:

Time taken

Example data

Conclusions

Disadvantages of Postgres/JSONB vs MongoDB

Constraints/Validation

Lack of stats

Range queries

GIN Index is based on string format of value

Compound queries

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

niccottrell/mongodb-vs-postgresql-jsonb

Folders and files

Latest commit

History

Repository files navigation

mongodb-vs-postgresql-jsonb

Getting started

sql4lite dependencies for Mac

Results

We test the following indexes on the JSONB columns:

Time taken

Example data

Conclusions

Disadvantages of Postgres/JSONB vs MongoDB

Constraints/Validation

Lack of stats

Range queries

GIN Index is based on string format of value

Compound queries

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages