Improve MetalLexer Performance Using Combined Regular Expression #250

pracheeeeez · 2025-01-02T10:29:02Z

The current implementation of MetalLexer iterates through a list of token patterns (TOKENS) and applies individual regex matches sequentially. While functional, this approach can lead to suboptimal performance due to multiple regex evaluations for each position in the input code.

To enhance performance, I propose combining all token patterns into a single master regex using named capturing groups. This approach reduces the overhead of repeated regex evaluations and improves the maintainability of the code.

Current Implementation

for token_type, pattern in TOKENS:
regex = re.compile(pattern)
match = regex.match(self.code, pos)
# Logic for handling the match

Proposed Improvement
Combine all token patterns into a single regex using named capturing groups. Each pattern will be associated with its corresponding token type:
TOKENS_PATTERN = "|".join(
f"(?P<{token_type}>{pattern})" for token_type, pattern in TOKENS
)
MASTER_REGEX = re.compile(TOKENS_PATTERN)

This allows the lexer to match all token types in one pass and determine the matched token using match.lastgroup.

@NripeshN Can I work on this?

Manya123-max · 2025-01-05T05:34:20Z

I want to work on this

ManeeshDevarasetty · 2025-01-06T05:47:14Z

Can u assign me this work?

raghulchandramouli · 2025-01-06T10:24:32Z

@CrossGL-issue-bot assign me

CrossGL-issue-bot · 2025-01-06T10:24:44Z

The issue you are trying to assign to yourself is already assigned.

huntermarchi · 2025-01-09T20:33:06Z

@CrossGL-issue-bot assign me this issue i will fix this easily

CrossGL-issue-bot · 2025-01-09T20:33:20Z

The issue you are trying to assign to yourself is already assigned.

KeerthiKeswaran · 2025-01-11T06:46:27Z

I'll work on this, assign this to me.

NripeshN assigned pracheeeeez Jan 2, 2025

github-actions bot added the stale label Jan 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve MetalLexer Performance Using Combined Regular Expression #250

Improve MetalLexer Performance Using Combined Regular Expression #250

pracheeeeez commented Jan 2, 2025 •

edited

Loading

Manya123-max commented Jan 5, 2025

ManeeshDevarasetty commented Jan 6, 2025

raghulchandramouli commented Jan 6, 2025

CrossGL-issue-bot commented Jan 6, 2025

huntermarchi commented Jan 9, 2025

CrossGL-issue-bot commented Jan 9, 2025

KeerthiKeswaran commented Jan 11, 2025

Improve MetalLexer Performance Using Combined Regular Expression #250

Improve MetalLexer Performance Using Combined Regular Expression #250

Comments

pracheeeeez commented Jan 2, 2025 • edited Loading

Current Implementation

Manya123-max commented Jan 5, 2025

ManeeshDevarasetty commented Jan 6, 2025

raghulchandramouli commented Jan 6, 2025

CrossGL-issue-bot commented Jan 6, 2025

huntermarchi commented Jan 9, 2025

CrossGL-issue-bot commented Jan 9, 2025

KeerthiKeswaran commented Jan 11, 2025

pracheeeeez commented Jan 2, 2025 •

edited

Loading