Partly Stale Worker Cache Can Cause Cycles in TreeMap/TreeSet Iteration #24

tmagrino · 2018-09-20T00:40:22Z

We've noticed this in the past before moving the repository to github and I've run into the issue again, so I'm documenting the problem here.

If I remember correctly, we determined in the past that this occurred because one node in the underlying red black tree backing TreeMap was stale in the worker's cache and pointed to another node that was updated to become it's parent (done during tree rotations to balance the tree). The parent node, unlike the child, is up to date in the worker cache, points to the stale child, causing the infinite loop.

There should be a way to avoid this causing an infinite loop. In the FabIL code that I'm working on which triggered this issue, I've come up with a bit of a hack solution to detect an issue and retry the transaction (and update the cache). I set up a local Java set to track items we've seen from the iterator already, throwing an exception if there's a duplicate. Here's a snippet of FabIL code demonstrating this "hack":

fabric.util.TreeSet s = new fabric.util.TreeSet().fabric$util$TreeSet$();
// ... code here adding things to the set s.
atomic {
  java.util.HashSet itemsSeen = new java.util.HashSet();
  for (fabric.util.Iterator iter = s.iterator(); iter.hasNext();) {
    Object item = iter.next();
    if (itemsSeen.contains(item)) throw new IllegalStateException("This shouldn't happen");
    itemsSeen.add(item);
    // Stuff we actually wanted to do in the loop goes here.
  }
}

This allows the transaction manager to check for stale objects on the store when going through the abort/retry loop.

Ideally, we could either ensure this can't happen to begin with or somehow build the invariant check I'm doing in the above snippet into the implementation of TreeMap.

The text was updated successfully, but these errors were encountered:

tmagrino · 2018-09-20T00:41:35Z

Adding an image depicting the sort of update that can cause this issue, found by @liujed, from past discussions of this problem.

tmagrino · 2018-09-20T01:08:57Z

Might be worth considering rewriting TreeMap to use a balanced tree structure that's not dependent on rotations like a B-tree for our TreeMap implementation? I'm not quite sure this would be free of the issue, but it seems like it doesn't invite the issue like a red-black tree.

tmagrino · 2018-09-20T01:16:13Z

@liujed notes that we saw something like this with buckets in HashMap when resizing the map.

andrewcmyers · 2018-09-20T01:16:38Z

A red-black tree can be implemented in a way that doesn't require imperative update to do tree rotations. See the CS 3110 notes.

tmagrino · 2018-09-20T01:17:26Z

So the concern with doing a more "functional" version is that we don't have a way to delete persistent objects after they're put on the store, in Fabric.

andrewcmyers · 2018-09-20T01:18:24Z

We could probably add that...

tmagrino · 2018-09-20T01:20:27Z

I think this might be a big enough idea to merit a separate issue for discussion? I think that if we could find a general way to allow for temporary-yet-persistent objects to not lead to storage leaks on the store in Fabric, a lot of things would get better.

tmagrino · 2018-09-20T01:24:53Z

Made an issue regarding the persistent garbage concern here: #25.

tmagrino added the bug label Sep 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partly Stale Worker Cache Can Cause Cycles in TreeMap/TreeSet Iteration #24

Partly Stale Worker Cache Can Cause Cycles in TreeMap/TreeSet Iteration #24

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018 •

edited

Loading

andrewcmyers commented Sep 20, 2018

tmagrino commented Sep 20, 2018

andrewcmyers commented Sep 20, 2018

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018

Partly Stale Worker Cache Can Cause Cycles in TreeMap/TreeSet Iteration #24

Partly Stale Worker Cache Can Cause Cycles in TreeMap/TreeSet Iteration #24

Comments

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018 • edited Loading

andrewcmyers commented Sep 20, 2018

tmagrino commented Sep 20, 2018

andrewcmyers commented Sep 20, 2018

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018

tmagrino commented Sep 20, 2018 •

edited

Loading