Add a fast path for NameSets without wildcards #241

jparise · 2025-04-02T15:35:18Z

When the set of ignored names doesn't use shell-style wildcards, we can use the faster frozenset.__contains__ base implementation rather than the more expensive iteration-based fnmatch'ing approach. This is true for the default ignore list, and I expect its the more common case by far even for those who add their own ignored names.

In a simple local benchmark, this results in a 2x speed improvement for that common (default) path, which I think justifies the small additional complexity.

When the set of ignored names doesn't use shell-style wildcards, we can use the faster `frozenset.__contains__` base implementation rather than the more expensive iteration-based fnmatch'ing approach. This is true for the default ignore list, and I expect its the more common case by far even for those who add their own ignored names. In a simple local benchmark, this results in a 2x speed improvement for that common (default) path, which I think justifies the small additional complexity.

sigmavirus24 · 2025-04-02T17:24:28Z

src/pep8ext_naming.py

+
+    def __new__(cls, iterable: Iterable[str]):
+        obj = super().__new__(cls, iterable)
+        obj._fnmatch = any(c in r"*?[" for name in iterable for c in name)


Is it faster to loop over the characters in the name or the 3 characters in the raw string which doesn't strictly need to be raw?

Also would it be faster still to turn a name into a frozen set and look for the intersection of that with the frozen set of those three characters?

It's actually fastest (by 2-3x) to check one wild character at a time, presumable because we hit a memchr or similar fast path internally:

any("*" in s or "?" in s or "[" in s for s in iterable)

jparise requested a review from sigmavirus24 April 2, 2025 15:35

jparise force-pushed the nameset-fast-path branch from a5d04e8 to 8a091f8 Compare April 2, 2025 15:37

Remove typing.override, which is Python 3.12+

53491e1

jparise force-pushed the nameset-fast-path branch from 8a091f8 to 53491e1 Compare April 2, 2025 15:37

sigmavirus24 reviewed Apr 2, 2025

View reviewed changes

Speed up wildcard character detection

9d4c6af

sigmavirus24 approved these changes Apr 2, 2025

View reviewed changes

jparise merged commit 95db1bc into PyCQA:main Apr 3, 2025
6 checks passed

jparise deleted the nameset-fast-path branch April 3, 2025 00:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a fast path for NameSets without wildcards #241

Add a fast path for NameSets without wildcards #241

Uh oh!

jparise commented Apr 2, 2025

Uh oh!

sigmavirus24 Apr 2, 2025

Uh oh!

jparise Apr 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add a fast path for NameSets without wildcards #241

Add a fast path for NameSets without wildcards #241

Uh oh!

Conversation

jparise commented Apr 2, 2025

Uh oh!

sigmavirus24 Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

jparise Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jparise Apr 2, 2025 •

edited

Loading