GitHub - feminist-ai/hacking-llms-with-feminism: Using adversarial and hacker thinking to hack feminist AIs from today's LLMs

Hacking LLMs for Feminism

Can you deconstruct oppressive notions in today's LLMs and use hacker/adversarial thinking to build a more feminist AI?

This repository explores the question and introduces ideas for hacking LLMs for feminism. So far it:

Explores simple and advanced prompt engineering
Takes inspiration from successful adversarial attacks
Identifies conceptual links in text embeddings
More ideas (add yours here!)

Each notebook guides participants of a Feminist AI LAN Party through exercises, questions and conversations. It's more fun to hack with friends, so it's encouraged to do just that by hosting your own party.

Setup and Requirements

Most of these notebooks can be run locally on your own computer with Mozilla's Llamafiles.

Some notebooks work better if you have an LLM set up for inference via the Local Area Network (LAN). Stay tuned for a full instruction setup with photos on how to do this on your own GPUs or with donated or shared computers.

To get your local computer setup:

Install at least one, but preferably several llamafiles. Follow the Readme instructions there and test that it is working properly by running the file and testing the chat browser that pops up.
Clone this repository onto your computer.
Open a new terminal and set up a miniconda or virtualenv with at least Python3.10. Activate this environment. You will now see (environment_name) in your terminal.
In that same terminal, navigate to the root folder of this repository on your computer (usually cd FULL_PATH_TO_THIS_FOLDER ) and install the requirements.txt file using pip install -r requirements.txt.
In that same terminal window, run jupyter notebook.

You should now see a browser window that opens up and the notebooks will be viewable. Click on one to get started!

In case there are folks who are new to Python/Jupyter in the room, take time to help one another getting set up. Contributions to improve these instructions very welcome!

Contributions

Contributions are greatly welcome from any and all feminist hackers interested in exposing and challenging oppressive systems in LLMs.

Feminist AI LAN Party Hosts and Attendees: Find bugs? Have a suggestion? Develop a new idea or attack? Please share!

Researchers: If you already have a research library or article that people can or should take inspiration from, please share any open-source-able work! If your code is not yet open-sourced or shareable, please feel free to also open an Issue describing an idea or sharing your research.

Hackers/Programmers: If you are good at Python or attacking systems but new to LLMs, feel free to peruse open ideas and see if you can implement an already shared idea. Collaboration is key to destroying oppression! :)

Data Scientists/ML Engineers: Have research or ideas you want to implement? Host a party or pair up with someone to add notebooks to the conversation!

Everyone: Feminism is for everybody. If you find anything you'd like to add or improve, please feel free to open an Issue, send a pull request or even just say hi.

There are a few open issues already, so go take a look in case you can help!

In general, please follow the following workflow:

Fork this repository.
Create a new branch: git checkout -b feature-name.
Make your changes. Please make sure that any new libraries get added to the requirements and try to test backwards compatibility with other requirements or make a note of any conflicts in your pull request.
Push your branch: git push origin feature-name.
Create a pull request.

Thank you for any and all improvements and contributions!

License

This repository uses a GNU General Public License.

If you need a different license in order to contribute, please open an issue to discuss.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
01 - Feminist Prompt Engineering.ipynb		01 - Feminist Prompt Engineering.ipynb
02 - Crescendo for Feminism.ipynb		02 - Crescendo for Feminism.ipynb
03 - Exploring Feminism via Embeddings.ipynb		03 - Exploring Feminism via Embeddings.ipynb
LICENSE		LICENSE
README.md		README.md
llamafile_client.py		llamafile_client.py
requirements.txt		requirements.txt
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hacking LLMs for Feminism

Setup and Requirements

Contributions

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

feminist-ai/hacking-llms-with-feminism

Folders and files

Latest commit

History

Repository files navigation

Hacking LLMs for Feminism

Setup and Requirements

Contributions

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages