add gender randomizer #229

tk-sugumar · 2021-08-29T06:12:50Z

No description provided.

kaustubhdhole · 2021-09-02T20:40:21Z

transformations/gender_randomizer/README.md

+Author name: Tabitha Sugumar
+Author email: __
+Author Affiliation: __
+


Thanks for your changes @tk-sugumar . Please add your email and affiliation.

timothy22000 · 2021-09-08T00:29:26Z

transformations/gender_randomizer/README.md

+
+## Examples of this transformation
+
+Because this is a randomized transformation, in both the selection of gender and selection of name, test examples are impossible -- the output for a single sentence is expected to be different in each successive run. Instead I've provided some example sentences and outputs for reference.


I believe you can use a default seed in the argument in init of your GenderRandomizer transformation so you can generate consistent results for your test cases so you can include them in your test.json

Quite a few of the PRs use this approach for test cases.

See for example:
https://github.com/GEM-benchmark/NL-Augmenter/pull/164/files

Thanks Timothy! When I tried this, the same name was predicted for each sentence, so for use as intended I think the user would have to modify the code after downloading. Should I still go ahead and do this?

Hi Timothy, I added in the seed in the initializer, the name names does get predicted each time though, I hope it's ok! Test cases are also added in the test.json

sebastianGehrmann · 2021-09-10T14:15:44Z

transformations/gender_randomizer/README.md

+Author Affiliation: Elsevier
+
+## What type of a transformation is this?
+This transformation changes names in English texts, randomizing selection so there's an even chance of male and female names. It modifies pronouns to match the selected name.


Please add an acknowledgement that names are not deterministic identifiers of someones pronouns/gender :)

…Augmenter into gender_randomizer

msobrevillac · 2021-09-19T01:46:32Z

transformations/gender_randomizer/transformation.py

+Randomizes names in text for a 50/50 gender breakdown. Handles pronouns.
+"""
+nlp = spacy.load("en_core_web_sm", disable=["lemmatizer"])
+nlp.add_pipe("coreferee")


You might want to use spacy like this.

Modified as given in example

msobrevillac · 2021-09-19T01:47:48Z

transformations/gender_randomizer/transformation.py

+class GenderRandomizer(SentenceOperation):
+    tasks = [TaskType.TEXT_TO_TEXT_GENERATION]
+    languages = ["en"]
+


Please, add some keywords here.

…itialization, added tests to text.json

kaustubhdhole · 2021-09-30T14:10:44Z

transformations/gender_randomizer/README.md

+## What tasks does it intend to benefit?
+This is intended to avoid gender bias in natural language processing models. Run this transformation on text data prior to using it to train a model.
+
+## Previous Work


Importantly please add a Data and Code Provenance section to your transformation. Also, seems you've added about a 109 files which are hard to evaluate. I would suggest moving this into a separate pip project out of this and then adding it to the requirements.txt.

Thanks! I've expanded on the data and code provenance, and put the description in a Data and Code Provenance section in the Readme.

On the 109 files -- most of them come from the coreferee directory -- this actually already exists as a library installable by pip, but when I was working on this was only installable in python 3.8 and the current version requires python 3.9. Since these transformations are required to be compatible with python 3.7, I downloaded here to make it installable in python 3.7.

kaustubhdhole · 2021-10-09T14:55:08Z

Hi @tk-sugumar, it won't be a good idea to merge all of these in the repository. It would be better to make a pip library out of it in a separate repository and call only the relevant parts here. @AbinayaM02 thoughts

AbinayaM02 · 2021-10-11T05:37:08Z

Hi @tk-sugumar, it won't be a good idea to merge all of these in the repository. It would be better to make a pip library out of it in a separate repository and call only the relevant parts here. @AbinayaM02 thoughts

Agreed. Like @kaustubhdhole mentioned, you should be installing the library (specify it in the reuirements.txt) and use it for your transformation @tk-sugumar. You can check if the library works fine for python 3.7.

add gender randomizer

74fe85e

kaustubhdhole added the transformation label Sep 2, 2021

kaustubhdhole reviewed Sep 2, 2021

View reviewed changes

Update README.md

43ac17c

timothy22000 reviewed Sep 8, 2021

View reviewed changes

sebastianGehrmann reviewed Sep 10, 2021

View reviewed changes

tk-sugumar added 2 commits September 12, 2021 00:33

Read me modifications, adding himself/herself handling

16ed1cb

Merge branch 'gender_randomizer' of https://github.com/tk-sugumar/NL-…

328c910

…Augmenter into gender_randomizer

msobrevillac reviewed Sep 19, 2021

View reviewed changes

Added keywords, changed spacy pipeline import, used random_seed in in…

05fc620

…itialization, added tests to text.json

mille-s self-requested a review September 30, 2021 10:58

mille-s approved these changes Sep 30, 2021

View reviewed changes

kaustubhdhole reviewed Sep 30, 2021

View reviewed changes

Update README.md

7c2c3ce

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add gender randomizer #229

add gender randomizer #229

tk-sugumar commented Aug 29, 2021

kaustubhdhole Sep 2, 2021

tk-sugumar Sep 3, 2021

timothy22000 Sep 8, 2021

tk-sugumar Sep 12, 2021 •

edited

Loading

tk-sugumar Sep 21, 2021

sebastianGehrmann Sep 10, 2021

tk-sugumar Sep 12, 2021

msobrevillac Sep 19, 2021

tk-sugumar Sep 21, 2021

msobrevillac Sep 19, 2021

tk-sugumar Sep 21, 2021

kaustubhdhole Sep 30, 2021

tk-sugumar Sep 30, 2021

kaustubhdhole commented Oct 9, 2021

AbinayaM02 commented Oct 11, 2021


		## Examples of this transformation

		Because this is a randomized transformation, in both the selection of gender and selection of name, test examples are impossible -- the output for a single sentence is expected to be different in each successive run. Instead I've provided some example sentences and outputs for reference.

add gender randomizer #229

Are you sure you want to change the base?

add gender randomizer #229

Conversation

tk-sugumar commented Aug 29, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tk-sugumar Sep 12, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kaustubhdhole commented Oct 9, 2021

AbinayaM02 commented Oct 11, 2021

tk-sugumar Sep 12, 2021 •

edited

Loading