refactor(message): rewrite parse logic by olivermrose · Pull Request #84 · usehyperion/hyperion

olivermrose · 2025-08-15T05:20:52Z

Closes #77

Just need to reimplement automod boundary marking

Copilot

Pull Request Overview

This PR refactors the message parsing logic by moving the complex text processing from the UserMessage.toFragments() method to a new dedicated parse.ts module. The main purpose is to improve code organization and maintainability by separating parsing concerns from the message class.

Key changes:

Replaces the toFragments() method with a nodes getter that uses the new parser
Introduces a new parsing system with Node types instead of Fragment types
Updates component interfaces to use the new node structure

Reviewed Changes

Copilot reviewed 8 out of 9 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`src/lib/message/user-message.ts`	Removes complex parsing logic and Fragment types, adds nodes getter using new parser
`src/lib/message/parse.ts`	New parsing module with Node types and parse function for message text processing
`src/lib/message/index.ts`	Exports the new parse module
`src/lib/handlers/eventsub/*`	Updates method calls from `addAutoModMetadata` to `setAutoMod`
`src/lib/components/message/Message.svelte`	Updates to use Node types instead of Fragment types
`src/lib/components/Emote.svelte`	Renames `overlays` prop to `layers`
`package.json`	Adds tldts dependency for URL parsing

Files not reviewed (1)

pnpm-lock.yaml: Language not supported

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-08-20T12:36:07Z

+		};
+
+		const url = URL.parse(`https://${part.replace(/^https?:\/\/|\.$/i, "")}`);
+		const result = url ? parseTld(url.hostname) : null;


The URL parsing logic is flawed. The regex /^https?:\/\/|\.$/i will remove trailing dots from all text, not just URLs. For example, 'Hello.' would become 'Hello' and incorrectly be parsed as a URL. Consider checking if the part starts with 'http' before applying URL parsing logic.

Suggested change

const result = url ? parseTld(url.hostname) : null;

let url: URL | null = null;

let result: ReturnType<typeof parseTld> | null = null;

// Only try to parse as URL if part looks like a URL or domain

if (

/^https?:\/\//i.test(part) ||

// domain-like: contains a dot, no spaces, not just a number

(/\./.test(part) && !/\s/.test(part) && !/^\d+$/.test(part))

) {

const cleaned = part.replace(/^https?:\/\/|\.$/i, "");

try {

url = URL.parse(`https://${cleaned}`);

result = url ? parseTld(url.hostname) : null;

} catch {

url = null;

result = null;

}

}

Copilot · 2025-08-20T12:36:07Z

+			return hasPrefix && hasBits;
+		});
+
+		const ircEmote = ircEmotes.find((e) => e.code === part);


IRC emotes are being matched by token text rather than by their position ranges. This could cause incorrect matching when the same emote code appears multiple times in a message, as the first occurrence found will be used regardless of position.

Suggested change

const ircEmote = ircEmotes.find((e) => e.code === part);

const ircEmote = ircEmotes.find((e) => e.start === start && e.end === end);

Copilot · 2025-08-20T12:36:07Z

+					});
+				}
+			}
+		} else if (ircEmote) {


The IRC emote handling creates a node with ircEmote.range.start and ircEmote.range.end positions, but these positions may not align with the current token's position. This could result in overlapping or misaligned nodes when the emote spans multiple tokens or partial tokens.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

rewrite message parsing

100649e

olivermrose added this to the Beta milestone Aug 15, 2025

olivermrose added 5 commits August 15, 2025 01:24

lint

c261b87

simplify text merging

ec2454f

single pass scan

e74747e

workaround improper irc offsets with emojis

1fe9aa8

auto mod boundaries

4656ede

olivermrose marked this pull request as ready for review August 20, 2025 12:34

olivermrose requested a review from Copilot August 20, 2025 12:34

Copilot AI reviewed Aug 20, 2025

View reviewed changes

fix double at-sign fallback

c361cc0

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

olivermrose merged commit f4b9c2d into main Aug 20, 2025
1 check passed

olivermrose deleted the refactor/msg-parsing branch August 20, 2025 12:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(message): rewrite parse logic#84

refactor(message): rewrite parse logic#84
olivermrose merged 7 commits into
mainfrom
refactor/msg-parsing

olivermrose commented Aug 15, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 20, 2025

Uh oh!

Copilot AI Aug 20, 2025

Uh oh!

Copilot AI Aug 20, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-		const result = url ? parseTld(url.hostname) : null;
+		let url: URL | null = null;
+		let result: ReturnType<typeof parseTld> | null = null;
+		// Only try to parse as URL if part looks like a URL or domain
+		if (
+			/^https?:\/\//i.test(part) ||
+			// domain-like: contains a dot, no spaces, not just a number
+			(/\./.test(part) && !/\s/.test(part) && !/^\d+$/.test(part))
+		) {
+			const cleaned = part.replace(/^https?:\/\/|\.$/i, "");
+			try {
+				url = URL.parse(`https://${cleaned}`);
+				result = url ? parseTld(url.hostname) : null;
+			} catch {
+				url = null;
+				result = null;
+			}
+		}

	const ircEmote = ircEmotes.find((e) => e.code === part);
	const ircEmote = ircEmotes.find((e) => e.start === start && e.end === end);

Conversation

olivermrose commented Aug 15, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants