Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Good first issue | Feature]Add a Multi-Table Auto-Modify Processor #108

Open
Z712023 opened this issue Jan 16, 2024 · 2 comments
Open

[Good first issue | Feature]Add a Multi-Table Auto-Modify Processor #108

Z712023 opened this issue Jan 16, 2024 · 2 comments
Labels
difficulty-easy enhancement New feature or request good first issue Good for newcomers

Comments

@Z712023
Copy link
Collaborator

Z712023 commented Jan 16, 2024

Problem

When modeling multi-table data, I've observed that real-world data often doesn't satisfy foreign key constraints. For example: parents_id=[1,2,3], children_id=[1,2], or parents_id=[1,2,3], children_id=[1,2,3,4].

I wish this tool could automatically assist me in cleaning the data, ensuring that foreign keys exist in both the parent and child tables (e.g., parents_id=[1,2], children_id=[1,2]).

In this way, the data used for multi-table simulation modeling can accurately reflect the associative relationships of foreign keys.

Proposed Solution

Retain only the intersection of foreign keys between the parent table and the child table.

$$ID_{remain} = ID_{parent} \Cap ID_{child}$$

Additional context

TBD

@Z712023 Z712023 added enhancement New feature or request good first issue Good for newcomers difficulty-easy labels Jan 16, 2024
@Z712023 Z712023 changed the title Add a Multi-Table Auto-Modify Processor [Good first issue | Feature]Add a Multi-Table Auto-Modify Processor Jan 17, 2024
@c3kimball
Copy link

I'd like to help with this issue, but I have a few questions:

  1. Which file would this be implemented in or is a new file necessary?
  2. What do you mean by multi table data? do you have an example data set in which this issue would be useful for?
  3. I'd like to more about the foreign key constraint, what it is and how to use it.

Thank you

@c3kimball
Copy link

@Z712023 I did some more looking around in the project, and I've found the single_table base class, would the solution for this issue be similar to this file?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
difficulty-easy enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants