Optimize egraph execute methods with pre-filtering and equivalence class grouping #38

Copilot · 2025-12-23T06:24:48Z

Optimization Plan for egraph.py

Understanding

The issue requests optimization of the execute method in the Search class to improve efficiency by:

Pre-filtering candidate terms that can match before checking egraph equivalence
Grouping by equivalence classes to reduce redundant checks
Splitting the optimization into two situations: equality queries (A == B) and fact queries (A)

Changes Required

Summary of Changes

Added fact equivalence cache: In rebuild(), pre-compute and cache equivalent terms for each fact
Added helper methods:
- _collect_matching_candidates(): Pre-filters terms that can potentially match a pattern using unification (@ operator). Fully documented the unification operation.
- _group_by_equivalence_class(): Groups terms by their equivalence classes using the egraph's find operation
Optimized _execute_expr():
- Collects candidate pools for LHS and RHS patterns through pre-filtering
- Groups by equivalence classes
- Only checks matches within groups that are equivalent
- Maintains original behavior: continues to check variable unification even after finding strict equality
Optimized _execute_fact():
- Collects candidate pool for idea pattern through pre-filtering
- Uses cached fact equivalences from rebuild
- Groups by equivalence classes
- Only checks matches within groups that are equivalent
- Maintains original behavior: yields for all matching facts
- Uses original variable naming convention (target, query) for consistency
Fixed formatting: Removed trailing whitespace to pass ruff-format pre-commit hook
Removed dead code: Removed self.pairs attribute which was no longer used after the optimization

Performance Impact

The optimization reduces redundant checks by:

Pre-filtering candidates that can't match (via unification check) before checking equivalence
Grouping by equivalence classes to organize checks efficiently
Only checking matches within groups that are proven equivalent
Caching fact equivalences to avoid repeated lookups during rebuild
Eliminating unnecessary pair building (removed self.pairs)

This changes the algorithm from checking all pairs (O(n²)) to only checking relevant equivalence classes, significantly reducing computational overhead especially when there are many terms.

Correctness

All 11 existing tests pass successfully, ensuring the optimization maintains the exact same behavior as the original implementation while improving efficiency.

The implementation follows the optimization strategy described in the issue:

Situation 1 (A == B queries): Pre-filters and groups LHS/RHS candidates, then checks matches only within equivalent groups
Situation 2 (A queries): Pre-filters idea candidates, uses cached fact equivalences, groups both, and checks matches only within equivalent groups

Original prompt

This section details on the original issue you should resolve

<issue_title>egraph.py重构优化</issue_title>
<issue_description>execute中, 匹配有两个可能, 现在的处理思路如下

情况1:

Q := A == B
遍历 x from term
遍历 y from term
检查 x == y
检查 A == B 和 x == y 可以匹配

情况2:

Q := A
遍历 F from fact
遍历 x from term
遍历 y from term
检查 x == y
检查 A == F 和 x == y 可以匹配

但是这样效率较低, 应该先预先检查 A == B 和 x == y 或者 A == F 和 x == y 是否可以匹配.

优化的思路应该是:

情况1:

Q := A == B
遍历 x from term
检查 A 和 x 可以匹配 -> 收集到 A_pool
遍历 y from term
检查 B 和 y 可以匹配 -> 收集到 B_pool
计算所有 A_pool 和 B_pool 中 egraph 提供的find, 分组得到 [(A_minipool_1, B_minipool_1), (A_minipool_2, B_minipool_2), ...] , 其中同一个i中 A_minipool_i 并 B_minipool_i 都是通过egraph判断等价的, 而A的所有minipool是A_pool的不交并
for i
for x from A_minipool_i
for y from B_minipool_i
检查 A == B 和 x == y 可以匹配 (A ~ x, B ~ y 不代表 A == B ~ x == y , 所以需要重新检查一遍)

情况2:

Q := A
遍历 F from fact
遍历 y from term
检查 F 和 y 可以匹配 -> 收集到 F_pool, 需要同时记录(y, F), 需要记录是从哪个fact来的 # 这部分在rebuild中完成
遍历 x from term
检查 A 和 x 可以匹配 -> 收集到 A_pool
然后类似的, 分组, 得到 [(A_minipool_1, F_minipool_1, base_fact_i), ...]
然后 for i
for x from A_minipool_i
for y, F_i from F_minipool_i
检查 A == F_i 和 x == y 可以匹配</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes egraph.py重构优化 #37

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Co-authored-by: hzhangxyz <[email protected]>

hzhangxyz · 2025-12-23T06:57:34Z

fix pre-commit hook please @copilot

Co-authored-by: hzhangxyz <[email protected]>

Copilot · 2025-12-23T07:01:17Z

fix pre-commit hook please @copilot

Fixed in commit 4e14b9f. The ruff-format pre-commit hook was failing due to trailing whitespace on blank lines. All formatting issues have been resolved and pre-commit hooks now pass.

hzhangxyz

self.pairs 这个东西不再使用, 可以删掉了.

Co-authored-by: hzhangxyz <[email protected]>

Copilot · 2025-12-23T07:26:16Z

self.pairs 这个东西不再使用, 可以删掉了.

Removed in commit 56c8ec1. The self.pairs attribute was indeed no longer used after the optimization - it was being built in rebuild() but never accessed in the optimized _execute_expr() and _execute_fact() methods.

Co-authored-by: Copilot <[email protected]>

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Initial plan

0dc2669

Copilot AI assigned Copilot and hzhangxyz Dec 23, 2025

Copilot started work on behalf of hzhangxyz December 23, 2025 06:25 View session

Copilot AI and others added 4 commits December 23, 2025 06:29

Optimize egraph.py execute methods with pre-filtering and grouping

fb7b472

Co-authored-by: hzhangxyz <[email protected]>

Fix early returns and improve documentation in optimized execute methods

fd083e5

Co-authored-by: hzhangxyz <[email protected]>

Remove incorrect break statement in _execute_fact

921478a

Co-authored-by: hzhangxyz <[email protected]>

Fix variable naming in _execute_fact to match original convention

5453675

Co-authored-by: hzhangxyz <[email protected]>

Copilot AI changed the title ~~[WIP] Refactor and optimize execute in egraph.py~~ Optimize egraph execute methods with pre-filtering and equivalence class grouping Dec 23, 2025

Copilot AI requested a review from hzhangxyz December 23, 2025 06:37

Copilot finished work on behalf of hzhangxyz December 23, 2025 06:37

Copilot started work on behalf of hzhangxyz December 23, 2025 06:58 View session

Fix formatting to pass pre-commit ruff-format hook

4e14b9f

Co-authored-by: hzhangxyz <[email protected]>

Copilot finished work on behalf of hzhangxyz December 23, 2025 07:02

hzhangxyz requested changes Dec 23, 2025

View reviewed changes

Copilot started work on behalf of hzhangxyz December 23, 2025 07:23 View session

Remove unused self.pairs attribute from Search class

56c8ec1

Co-authored-by: hzhangxyz <[email protected]>

Copilot AI requested a review from hzhangxyz December 23, 2025 07:27

Copilot finished work on behalf of hzhangxyz December 23, 2025 07:27

Update egraph.py.

d469b10

hzhangxyz force-pushed the copilot/optimize-egraph-execute branch from 7fe384b to d469b10 Compare December 23, 2025 07:48

hzhangxyz requested a review from Copilot December 23, 2025 07:49

hzhangxyz approved these changes Dec 23, 2025

View reviewed changes

hzhangxyz marked this pull request as ready for review December 23, 2025 07:49

Copilot started reviewing on behalf of hzhangxyz December 23, 2025 07:49 View session

This comment was marked as outdated.

Sign in to view

Update ddss/egraph.py

9a26b1f

Co-authored-by: Copilot <[email protected]>

Update egraph.py.

ab06234

hzhangxyz requested a review from Copilot December 23, 2025 08:00

Copilot started reviewing on behalf of hzhangxyz December 23, 2025 08:00 View session

Copilot AI reviewed Dec 23, 2025

View reviewed changes

hzhangxyz added 2 commits December 23, 2025 16:05

Update egraph.py.

6ec8d5c

Fix a typing issue.

3dc85a6

hzhangxyz merged commit dd07b17 into main Dec 23, 2025
14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize egraph execute methods with pre-filtering and equivalence class grouping #38

Optimize egraph execute methods with pre-filtering and equivalence class grouping #38

Uh oh!

Copilot AI commented Dec 23, 2025 •

edited

Loading

Uh oh!

hzhangxyz commented Dec 23, 2025

Uh oh!

Copilot AI commented Dec 23, 2025

Uh oh!

hzhangxyz left a comment

Uh oh!

Copilot AI commented Dec 23, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optimize egraph execute methods with pre-filtering and equivalence class grouping #38

Optimize egraph execute methods with pre-filtering and equivalence class grouping #38

Uh oh!

Conversation

Copilot AI commented Dec 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Optimization Plan for egraph.py

Understanding

Changes Required

Summary of Changes

Performance Impact

Correctness

Comments on the Issue (you are @copilot in this section)

Uh oh!

hzhangxyz commented Dec 23, 2025

Uh oh!

Copilot AI commented Dec 23, 2025

Uh oh!

hzhangxyz left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Dec 23, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 23, 2025 •

edited

Loading