This Repository contains all the work I did for the Large Scale Computing course at CMU. For this course's assignments, we majorly dealt with large datasets and worked on Bridges-2 Supercomputer at PSC.
Languages and tools used: VSCode, Python (pandas, numpy, scikit-learn), SQL, PySpark, PyTorch, TensorFlow
Topics covered were:
- Parallel Programming with Advanced MPI (Python)
- SQL (basic and advanced)
- Machine Learning with Spark
- Recommender System
- Big Data
- Deep Learning