Skip to content

Conversation

@anamrasul-123
Copy link

This pull request includes my solutions for Problem 1 and Problem 2 of the Data Engineering assignment. The following changes have been made:

Problem 1: Implemented a fixed-width parser to correctly extract and transform data based on predefined column widths. Implemented a Docker container as-well. Used few test cases to check validity of code.

Problem 2: Developed big data generation using Faker to create synthetic data and applied hashing techniques for data anonymization. Also implemented test cases to check validity of code.

Both solutions have been tested, and the scripts are structured for easy execution.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant