Interest and Contribute in GSoC 2026 Project 4 : Lazy Trajectory Loading and Indexing #5271

1825Vaishnavi · 2026-03-04T00:24:56Z

1825Vaishnavi
Mar 4, 2026

Hi MDAnalysis Community!

Hi I'm Vaishnavi Gajarla currently pursuing Masters in Data Analytics Engineering at Northeastern University ,Boston (GPA:3.7)
I am working as a Research Assistant under professor Nik Bear Brown where my role is analyzing 500k+ record datasets using python and SQL to generate forecasts and satistically validate performance metrics for research driven decision-making and developed ETL workflows . I have industry experiences optimizing database indexing-improving query speed by 50% during my internship
I'm really interested in Project 4: lazy Trajectory Loading and Indexing for GSoC 2026 and after reading the information you provided i saw the issues #3793 and the XDRBaseReader documentation, I understand the core problem : XTC/TRR readers currenlty build complete offset indices on first file open by scanning the entire file , which can take hours for large trajectories . The proposed solution is lazy indexing skipping index building during simple forward iteration and buiding it progressively as frames are read and the XTC that uses a magic number 1993 in headers with fixed size headers enabling seek-based traversal,while TRR frames have variables sizes depending on what data like coordinates,velocity,forces is stored making lazy indexing slightly different after each format
My Questions:

Should lazy indexing be implemented seperately for XTC and TRR given their structural differences or through a shared base class approach in XDRBaseReader?
For User-Configurable Kwarg (index_trajectory="always"|"never"|"lazy") is this planned at universe level or reader level?
What would be a good first contribution to get familiar with the XDRBaseReader codebase before tackling lazy indexing ?

I have already installed the MDAnalysis 2.10.0 and I 'm ready to contribute!

Vaishnavi Gajarla
Northeastern University
https://github.com/1825Vaishnavi

1825Vaishnavi · 2026-03-04T00:29:10Z

1825Vaishnavi
Mar 4, 2026
Author

@orbeckst @yuxuanzhuang @talagayev -would love your guidance on this project!

0 replies

orbeckst · 2026-03-04T00:41:29Z

orbeckst
Mar 4, 2026
Maintainer

My quick thoughts:

Should lazy indexing be implemented seperately for XTC and TRR given their structural differences or through a shared base class approach in XDRBaseReader?

Try to repeat as little code as possible, so I'd start with trying to put as much as possible common code into XDRBaseReader.

For User-Configurable Kwarg (index_trajectory="always"|"never"|"lazy") is this planned at universe level or reader level?

ALways at Reader level. Universe will just pass kwargs through.

What would be a good first contribution to get familiar with the XDRBaseReader codebase before tackling lazy indexing ?

Search issues with label format-Gromacs https://github.com/MDAnalysis/mdanalysis/issues?q=is%3Aissue%20state%3Aopen%20label%3AFormat-Gromacs — none of these may be "good" = "easy" but we do not require that you have a PR merged; we want to interact with you productively so just getting deep into a PR is a good start. (Of course, getting it merged is even better but it's not required and as you may see, often our PRs have 50+ comments until they get merged as we take code correctness and quality seriously for a scientific code like MDA.)

0 replies

1825Vaishnavi · 2026-03-04T01:24:01Z

1825Vaishnavi
Mar 4, 2026
Author

Thank you so much @orbeckst for the clear response!
I will look through the format-Gromacs issues and pick one to start working on. I'll aim to get deep into a PR to show my engagement with the codebase.
I'll share my progress here as I work through it!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interest and Contribute in GSoC 2026 Project 4 : Lazy Trajectory Loading and Indexing #5271

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Interest and Contribute in GSoC 2026 Project 4 : Lazy Trajectory Loading and Indexing #5271

Uh oh!

1825Vaishnavi Mar 4, 2026

Replies: 3 comments

Uh oh!

1825Vaishnavi Mar 4, 2026 Author

Uh oh!

orbeckst Mar 4, 2026 Maintainer

Uh oh!

1825Vaishnavi Mar 4, 2026 Author

1825Vaishnavi
Mar 4, 2026

1825Vaishnavi
Mar 4, 2026
Author

orbeckst
Mar 4, 2026
Maintainer

1825Vaishnavi
Mar 4, 2026
Author