Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sparse query pattern performance #2973

Open
linas opened this issue Jul 22, 2022 · 0 comments
Open

sparse query pattern performance #2973

linas opened this issue Jul 22, 2022 · 0 comments
Labels

Comments

@linas
Copy link
Member

linas commented Jul 22, 2022

The sparse queries are used for the cheminformatics git repo. They consist of an unordered link holding the query pattern, and a glob to soak up everything else. As currently implemented in pull reqs #2968 #2970 #2971 the algo does an exhaustive search ... which is not required. For truly sparse data, the performance can be tremendously improved by compiling all of the connected components in the pattern, and then traversing only those.

This could be done by taking the current connected-component code, and refactoring it so it can also be employed here.

This work could become urgent, if the cheminformatics project takes off. The current code could be a major performance bottleneck.

@linas linas added pattern-matcher performance Performance issues labels Jul 22, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant