Let the DataPartitionTable be automatically cleanable #14737

CRZbulabula · 2025-01-20T14:29:52Z

A data partition is not necessary to exist when all corresponding data are expired given the pre-configurated TTL. Hence, this PR added a thread to automatically clean these expired data partitions, making the cache size of the DataPartitionTable is always acceptable! Specifically, the main updates include:

In TTLManager, to retrieve the maximum TTL of a specific database, we implement getDatabaseMaxTTL().
In TimePartitionUtils, to locate the time partition slot of the current timestamp, we implement the getCurrentTimePartitionSlot().
In PartitionManager, we setup a periodically asynchronous thread, which combines the aforementioned interfaces to construct an "AutoCleanPartitionTablePlan." As a result, for each database, any data partition whose data are all expired will be deleted automatically by this independent thread.

Incidentally, the corresponding IT is available at "IoTDBPartitionTableAutoCleanTest."

codecov · 2025-01-20T15:40:14Z

Codecov Report

Attention: Patch coverage is 41.02564% with 69 lines in your changes missing coverage. Please review.

Project coverage is 39.21%. Comparing base (bc5fdae) to head (8711a10).
Report is 22 commits behind head on master.

Files with missing lines	Patch %	Lines
...onfignode/procedure/PartitionTableAutoCleaner.java	20.00%	20 Missing ⚠️
.../org/apache/iotdb/commons/schema/ttl/TTLCache.java	0.00%	14 Missing ⚠️
...onfignode/persistence/partition/PartitionInfo.java	0.00%	8 Missing ⚠️
...t/write/partition/AutoCleanPartitionTablePlan.java	84.37%	5 Missing ⚠️
...apache/iotdb/commons/utils/TimePartitionUtils.java	16.66%	5 Missing ⚠️
.../iotdb/commons/partition/SeriesPartitionTable.java	20.00%	4 Missing ⚠️
...g/apache/iotdb/confignode/persistence/TTLInfo.java	0.00%	3 Missing ⚠️
...he/iotdb/commons/partition/DataPartitionTable.java	0.00%	3 Missing ⚠️
...che/iotdb/confignode/manager/ProcedureManager.java	33.33%	2 Missing ⚠️
.../persistence/partition/DatabasePartitionTable.java	0.00%	2 Missing ⚠️
... and 3 more

Additional details and impacted files

@@             Coverage Diff              @@
##             master   #14737      +/-   ##
============================================
+ Coverage     39.12%   39.21%   +0.09%     
  Complexity      193      193              
============================================
  Files          4417     4445      +28     
  Lines        281267   282533    +1266     
  Branches      34783    34861      +78     
============================================
+ Hits         110044   110800     +756     
- Misses       171223   171733     +510

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

liyuheng55555

Haven't finished reading all the code yet, I have some thoughts for discussing:

If the default execution interval is every 2 hours, then maybe adding a periodically executed Procedure is more lightweight compared to adding a new thread pool?

liyuheng55555 · 2025-01-21T09:41:13Z

...src/test/java/org/apache/iotdb/confignode/it/partition/IoTDBPartitionTableAutoCleanTest.java

+      for (int retry = 0; retry < 120; retry++) {
+        boolean partitionTableAutoCleaned = true;
+        TDataPartitionTableResp resp = client.getDataPartitionTable(req);
+        if (TSStatusCode.SUCCESS_STATUS.getStatusCode() == resp.getStatus().getCode()) {
+          Map<String, Map<TSeriesPartitionSlot, Map<TTimePartitionSlot, List<TConsensusGroupId>>>>
+              dataPartitionTable = resp.getDataPartitionTable();
+          for (Map.Entry<
+                  String,
+                  Map<TSeriesPartitionSlot, Map<TTimePartitionSlot, List<TConsensusGroupId>>>>
+              e1 : dataPartitionTable.entrySet()) {
+            for (Map.Entry<TSeriesPartitionSlot, Map<TTimePartitionSlot, List<TConsensusGroupId>>>
+                e2 : e1.getValue().entrySet()) {
+              if (e2.getValue().size() != 1) {
+                // The PartitionTable of each database should only contain 1 time partition slot
+                partitionTableAutoCleaned = false;
+                break;
+              }
+            }
+            if (!partitionTableAutoCleaned) {
+              break;
+            }
+          }
+        }
+        if (partitionTableAutoCleaned) {
+          return;
+        }
+        TimeUnit.SECONDS.sleep(1);


Considering using Awaitility to replace the outer for-loop and sleep

(As Chatgpt suggested :) please checkout whether these two inner loop can be simplified, such as:

partitionTableAutoCleaned = resp.getDataPartitionTable().entrySet().stream() .flatMap(e1 -> e1.getValue().entrySet().stream()) .allMatch(e2 -> e2.getValue().size() == 1);

An awesome suggestion! The corresponding test codes are simplified significantly!

iotdb-core/node-commons/src/main/java/org/apache/iotdb/commons/concurrent/ThreadName.java

OneSizeFitsQuorum · 2025-01-22T04:21:18Z

...confignode/src/main/java/org/apache/iotdb/confignode/manager/partition/PartitionManager.java

+    for (String database : databases) {
+      long subTreeMaxTTL = getTTLManager().getDatabaseMaxTTL(database);
+      databaseTTLMap.put(
+          database, Math.max(subTreeMaxTTL, databaseTTLMap.getOrDefault(database, -1L)));


add some judgement like "isDatabaseExisted(database) && 0 < ttl && ttl < Long.MAX_VALUE" here to remove overhead?
BTW, If all the databases don't have ttl, we can logically just do this check and find that none of them need to be cleaned up, so there's no need to do a consensus write

Thanks for pinpointing this logic enhancement. The judgement is available at PartitionTableAutoCleaner.

OneSizeFitsQuorum · 2025-01-22T04:21:54Z

...confignode/src/main/java/org/apache/iotdb/confignode/manager/partition/PartitionManager.java

  public PartitionManager(IManager configManager, PartitionInfo partitionInfo) {
    this.configManager = configManager;
    this.partitionInfo = partitionInfo;
    this.regionMaintainer =
        IoTDBThreadPoolFactory.newSingleThreadScheduledExecutor(
            ThreadName.CONFIG_NODE_REGION_MAINTAINER.getName());
+    this.partitionCleaner =


maybe try to reuse procedure periodic tasks

Sure. I now employ the PartitionTableAutoCleaner rather than creating an extra thread pool.

...src/test/java/org/apache/iotdb/confignode/it/partition/IoTDBPartitionTableAutoCleanTest.java

sonarqubecloud · 2025-01-22T07:39:55Z

Quality Gate passed

Issues
31 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

liyuheng55555

Have you considered concurrency issues, such as setTTL and Cleaner running simultaneously?

(I think one solution could be to clean up partitions only after they’ve exceeded the TTL by one hour, but you might have solved this in another way ：)

CRZbulabula · 2025-01-22T08:52:57Z

Have you considered concurrency issues, such as setTTL and Cleaner running simultaneously?

(I think one solution could be to clean up partitions only after they’ve exceeded the TTL by one hour, but you might have solved this in another way ：)

Thanks for raising this critical issue. To address your concern, please refer to the autoCleanPartitionTable in the SeriesPartitionTable, as outlined below:

public void autoCleanPartitionTable(long TTL, TTimePartitionSlot currentTimeSlot) {
    seriesPartitionMap
        .entrySet()
        .removeIf(entry -> entry.getKey().getStartTime() + TTL < currentTimeSlot.getStartTime());
  }

Here, the removing condition is < rather than <=. As a result, a data partition is removed from cache only when it exceeded the TTL by a whole time partitioning interval. Therefore, the PartitionTable always contains a row of "empty" data partitions. I believe this approach can avoid this concurrency problem.

seems finished

1839533

liyuheng55555 suggested changes Jan 21, 2025

View reviewed changes

OneSizeFitsQuorum reviewed Jan 22, 2025

View reviewed changes

CRZbulabula added 2 commits January 22, 2025 15:29

Use periodic procedure 4 partition table cleaner

cfd9432

Update ThreadName.java

8711a10

liyuheng55555 reviewed Jan 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Let the DataPartitionTable be automatically cleanable #14737

Let the DataPartitionTable be automatically cleanable #14737

CRZbulabula commented Jan 20, 2025

codecov bot commented Jan 20, 2025 •

edited

Loading

liyuheng55555 left a comment •

edited

Loading

liyuheng55555 Jan 21, 2025

CRZbulabula Jan 22, 2025

OneSizeFitsQuorum Jan 22, 2025

CRZbulabula Jan 22, 2025

OneSizeFitsQuorum Jan 22, 2025

CRZbulabula Jan 22, 2025

sonarqubecloud bot commented Jan 22, 2025

liyuheng55555 left a comment

CRZbulabula commented Jan 22, 2025

Let the DataPartitionTable be automatically cleanable #14737

Are you sure you want to change the base?

Let the DataPartitionTable be automatically cleanable #14737

Conversation

CRZbulabula commented Jan 20, 2025

codecov bot commented Jan 20, 2025 • edited Loading

Codecov Report

liyuheng55555 left a comment • edited Loading

Choose a reason for hiding this comment

liyuheng55555 Jan 21, 2025

Choose a reason for hiding this comment

CRZbulabula Jan 22, 2025

Choose a reason for hiding this comment

OneSizeFitsQuorum Jan 22, 2025

Choose a reason for hiding this comment

CRZbulabula Jan 22, 2025

Choose a reason for hiding this comment

OneSizeFitsQuorum Jan 22, 2025

Choose a reason for hiding this comment

CRZbulabula Jan 22, 2025

Choose a reason for hiding this comment

sonarqubecloud bot commented Jan 22, 2025

Quality Gate passed

liyuheng55555 left a comment

Choose a reason for hiding this comment

CRZbulabula commented Jan 22, 2025

codecov bot commented Jan 20, 2025 •

edited

Loading

liyuheng55555 left a comment •

edited

Loading