Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add langsan sanitization feature #8

Merged
merged 1 commit into from
Oct 16, 2024
Merged

add langsan sanitization feature #8

merged 1 commit into from
Oct 16, 2024

Conversation

mdegans
Copy link
Owner

@mdegans mdegans commented Oct 16, 2024

Feature is on by default. Anthropic does not consider this a concern, but I do.

  • Addresses hidden unicode text.
  • Language model will see [X BYTES SANITIZED] where text was stripped. So will humans if the language model genrates anything.

This breaks some const constructors so is a minor revision. Const constructors might return in the future with const sanitization.

Feature is on by default because. Anthropic does not consider this a concern, but I do.

- Addresses hidden unicode text.
- Language model will see `[X BYTES SANITIZED]` where text was stripped. So will humans if the language model genrates anything.
@mdegans mdegans merged commit 05cd885 into main Oct 16, 2024
3 checks passed
@mdegans mdegans deleted the dev branch October 16, 2024 23:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant