Fix XSS issue in safe mode (#601) by Crozzers · Pull Request #602 · trentm/python-markdown2

Crozzers · 2024-09-22T15:14:42Z

This PR fixes #601 by making the HTML tokenisation regex more lenient.

Currently, the regex matches HTML tags and their attributes, using [\w-] for the attribute name. However, in the HTML spec it says:

Attribute names must consist of one or more characters other than controls, U+0020 SPACE, U+0022 ("), U+0027 ('), U+003E (>), U+002F (/), U+003D (=), and noncharacters

The current character class being used does not include all these possibilities so I've updated it to instead exclude these banned characters: [^<>"'=/].

I also tweaked how the attribute values are matched. The current regex only allows for quoted values, which lets src=# onerror=alert() slip past. I've added another clause in the regex that matches unquoted attr values as long as they don't contain a space (because that would count as the next attribute)

nicholasserra · 2024-09-23T18:38:29Z

Thank you!

Crozzers and others added 2 commits September 22, 2024 16:12

Fix XSS issue in safe mode (trentm#601)

4278f3e

Merge branch 'master' into fix-xss-601

e266576

nicholasserra merged commit cc432bf into trentm:master Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix XSS issue in safe mode (#601)#602

Fix XSS issue in safe mode (#601)#602
nicholasserra merged 2 commits into
trentm:masterfrom
Crozzers:fix-xss-601

Crozzers commented Sep 22, 2024 •

edited

Loading

Uh oh!

nicholasserra commented Sep 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Crozzers commented Sep 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nicholasserra commented Sep 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Crozzers commented Sep 22, 2024 •

edited

Loading