Skip to content

Commit c84b03f

Browse files
Add info about fixing the issues (#2243)
1 parent d20127b commit c84b03f

File tree

1 file changed

+22
-10
lines changed

1 file changed

+22
-10
lines changed

rst/cartridge_admin.rst

Lines changed: 22 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1435,13 +1435,15 @@ Cartridge displays cluster and instances issues in WebUI:
14351435
* Failover:
14361436

14371437
* **warning**: "Can't obtain failover coordinator (...)";
1438-
* **warning**: "There is no active failover coordinator";
1438+
* **warning**: "There is no active failover coordinator"
1439+
-- all of the cooridinator instances are probably dead or unavailable;
14391440
* **warning**: "Failover is stuck on ...: Error fetching appointments (...)";
14401441
* **warning**: "Failover is stuck on ...: Failover fiber is dead" -- this is likely a bug;
14411442

14421443
* Switchover:
14431444

1444-
* **warning**: "Consistency on ... isn't reached yet";
1445+
* **warning**: "Consistency on ... isn't reached yet" -- wait for the switchover to finish
1446+
or try to restart an instance;
14451447

14461448
* Clock:
14471449

@@ -1451,15 +1453,18 @@ Cartridge displays cluster and instances issues in WebUI:
14511453

14521454
* **critical**: "Running out of memory on ..." -- when all 3 metrics
14531455
``items_used_ratio``, ``arena_used_ratio``, ``quota_used_ratio`` from
1454-
``box.slab.info()`` exceed ``limits.fragmentation_threshold_critical``;
1456+
``box.slab.info()`` exceed ``limits.fragmentation_threshold_critical``.
1457+
You need to add more memory to the instance;
14551458

14561459
* **warning**: "Memory is highly fragmented on ..." - when
14571460
``items_used_ratio > limits.fragmentation_threshold_warning`` and
1458-
both ``arena_used_ratio``, ``quota_used_ratio`` exceed critical limit;
1461+
both ``arena_used_ratio``, ``quota_used_ratio`` exceed critical limit.
1462+
You can restart the instance to defragment memory or change ``box.cfg.min_tuple_size`` or
1463+
``box.cfg.slab_alloc_factor`` to reduce fragmentation;
14591464

14601465
* Configuration:
14611466

1462-
* **warning**: "Configuration checksum mismatch on ...";
1467+
* **warning**: "Configuration checksum mismatch on ..." -- config is probably corrupted somehow;
14631468
* **warning**: "Configuration is prepared and locked on ...";
14641469
* **warning**: "Advertise URI (...) differs from clusterwide config (...)";
14651470
* **warning**: "Configuring roles is stuck on ... and hangs for ... so far";
@@ -1483,14 +1488,19 @@ Cartridge displays cluster and instances issues in WebUI:
14831488

14841489
* various vshard alerts (see vshard docs for details);
14851490

1486-
* **warning**: warning: "Group "..." wasn't bootstrapped: ...";
1491+
* **warning**: "Group "..." wasn't bootstrapped: ...".
1492+
You need to bootstrap the group again;
14871493

1488-
* **warning**: Vshard storages in replicaset %s marked as "all writable".
1494+
* **warning**: "Vshard storages in replicaset ... marked as "all writable".
1495+
You can fix it by setting ``all_rw = false`` in the replicaset configuration;
14891496

14901497
* Alien members:
14911498

14921499
* **warning**: "Instance ... with alien uuid is in the membership" --
1493-
when two separate clusters share the same cluster cookie;
1500+
when two separate clusters share the same cluster cookie or
1501+
when two clustes have the same first 32 symbols of cluster cookie.
1502+
You need to change the cluster cookie on one of the clusters or set
1503+
``set_cookie_hash_membership`` in ``cartridge.cfg`` to ``true``;
14941504

14951505
.. image:: images/cartridge-issues-alien-uuid.png
14961506
:align: left
@@ -1501,11 +1511,13 @@ Cartridge displays cluster and instances issues in WebUI:
15011511
* Expelled instances:
15021512

15031513
* **warning**: "Replicaset ... has expelled instance ... in box.space._cluster" -
1504-
when instance was expelled from replicaset, but still remains in box.space._cluster;
1514+
when instance was expelled from replicaset, but still remains in box.space._cluster.
1515+
You need to remove it manually;
15051516

15061517
* Deprecated space format:
15071518

1508-
* **warning**: "Instance ... has spaces with deprecated format: space1, ..."
1519+
* **warning**: "Instance ... has spaces with deprecated format: space1, ...". You need to
1520+
convert spaces to the new format;
15091521

15101522
* Raft issues:
15111523

0 commit comments

Comments
 (0)