Skip to content

Conversation

@fatangare
Copy link

Improving performance while saving data in Postgresql in case of large dataset.

to_sql() method is slow and takes times to save data in Postgres. It is replaced with copy_exprt() to save data in Postgres tables fast.

@fatangare
Copy link
Author

I also added one commit to parallelize compute_norm_cond_entropy_corr() method.

With single thread on hospital data, it takes 3.11 sec. With 6 threads, it takes 1.05 sec on my mac (16GB RAM)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant