-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to contribute? #3
Comments
Glad to hear that you found the project interesting. |
I have confirmed. We can have external open-source contributions :-) Before starting with any contribution I recommend that you first go through the documentation and understand the design of the package and concepts like fairness tensor and wrappers. Possible contributions can be
These are a few possible things I could think of. There might be even more. Feel free to discuss in case of any suggestions or feedback or issues. I am available on Julia slack workspace as "Ashrya Agrawal". You can join the workspace using https://slackinvite.julialang.org/ |
@Faldict thanks for reaching out - above is quite comprehensive - let us know where you interst and strength lie and we can work something out - happy to jump on a call. |
Thanks for your response! I think I could start with adding the fairness datasets. My question here is, why does the dataset macro return the tuple (X, Y, Y_hat). If I understand correctly, the Y_hat is the prediction and it may need training on the dataset. Why not return the sensitive attributes directly? |
The macro you are talking about is toy-data with only 10 rows. It returns (X, y, ŷ) just to enable users to try out various things like metrics, etc without fitting an algorithm and predicting. But while adding macros for real datasets like COMPAS, German, Adult, etc. we would not need the macro to return ŷ. So we can normally return (X, y). It is going to be very similar to macros available at https://github.com/alan-turing-institute/MLJBase.jl/blob/master/src/data/datasets.jl#L200 . Let me know if you need further clarification on this. |
Thanks for your clarification. I have added the COMPAS and Adult datasets. Do I need to write the test scripts for them? Another question is that when I am install the package for testing, I meet the following errors:
so that I am not able to install the package. |
Thanks a lot for working on the dataset macros. It would be great if you could write the tests (/tests/datasets/datasets.jl) as well for the datasets you add. I am not sure why this version incompatibility issue is coming on your system. But this MLJFlux package is not required for the package. In the commit 9dee330 I have removed the inessential packages like MLJFlux from the dependencies. Please let me know if you still face any setup issues after pulling the changes. |
@Faldict you might want to look at the macro I have added for German credit data : https://github.com/ashryaagr/MLJFair.jl/blob/master/src/datasets/datasets.jl I hope these make it easier for you to add the macros and tests for other fairness datasets. |
@ashryaagr Thanks a lot! I have fixed this problem. |
Hello,
I find it is really an interesting project, and I would like to make some contribution. What can I start with?
The text was updated successfully, but these errors were encountered: