- Author(s): Yisaer (Song Gao)
- Last updated: 2021-09-22
- Tracking Issue: #21094
- Related Document: https://docs.google.com/document/d/1dSbXbudTK-hpz0vIBsmpfuzeQhhTFJ-zDenygt8fmCk/edit?usp=sharing
This proposal aims to support Stale Read and describe what Stale Read looks like and how to keep compatibility with other queries.
Currently, TiDB supports Follower Read to read data from follower replicas in order to increase the query throughput. TiDB also supports Snapshot Read to read old data with a given snapshot timestamp.
However, Follower Read still requires an RPC calling between follower replica and leader replica during querying which will be the key bottleneck to scale up the read throughput when there is a hotspot reading in this range. Moreover, in cross region scenario, an extra rpc calling will also increased the latency.
For Snapshot Read, it only reads data from leader replica which might be a bottleneck, especially in hot spot cases.
Stale Read is more like the combination of Follower Read and Snapshot Read. It supports TiDB to read data from both leader replicas and follower replicas with no extra RPC calling during followers and leaders.
It also supports read data with a given snapshot timestamp like Snapshot Read.
Normally, TiDB will send data query requests to the storage with a given timestamp to the leader replicas.
In Stale Read, each replica could handle the data query request because every replica maintains the resolved timestamp from the leader replica.
This makes each store could know the smallest resolved timestamp they could handle. Thus, TiDB could send data requests with a staleness timestamp to get the data they want.
Here we will introduce how to use Stale Read in TiDB in 4 methods.
- Use Stale Read by
SELECT ... FROM ... AS OF TIMESTAMP
format
SELECT * FROM t AS OF TIMESTAMP '2021-09-22 15:04:05' WHERE id = 1;
SELECT * FROM t AS OF TIMESTAMP NOW() - INTERVAL 20 SECOND WHERE id = 1;
You can get detailed grammar ebnf diagram
for SELECT ... FROM ... AS OF TIMESTAMP
in this document
- Use Stale Read by
START TRANSACTION READ ONLY AS OF TIMESTAMP
format
START TRANSACTION READ ONLY AS OF TIMESTAMP '2021-09-22 15:04:05';
SELECT * FROM t WHERE id = 1;
COMMIT;
You can get detailed grammar ebnf diagram
for START TRANSACTION READ ONLY AS OF TIMESTAMP
in this document
- Use Stale Read by
SET TRANSACTION READ ONLY AS OF TIMESTAMP
format.
SET TRANSACTION READ ONLY AS OF TIMESTAMP '2021-09-22 15:04:05';
BEGIN;
SELECT * FROM t WHERE id = 1;
COMMIT;
// it also affects next query statement
SET TRANSACTION READ ONLY AS OF TIMESTAMP '2021-09-22 15:04:05';
SELECT * FROM t WHERE id = 1;
You can get detailed grammar ebnf diagram
for SET TRANSACTION READ ONLY AS OF TIMESTAMP
in this document
- Enable Stale Read with given max tolerant staleness in session:
// enable stale read with max tolerant 5 seconds ago in session
SET @@tidb_read_staleness='-5';
select * from t where id = 1;
Note that Stale Read will only affect the following query statement SELECT
, it won't affect the interactive transaction, or updating statement like INSERT
,DELETE
and UPDATE
AS OF TIMESTAMP
provide several ways to define the staleness of STALE READ
, you can find detail usage in this document
This section we will talk about the compatibility between each Stale Read situations.
Stale Read by SET TRANSACTION Statement will make next interactive transaction or query with staleness timestamp.
Thus it is not allowed that querying stale read statement after stale read by SET TRANSACTION statement like following:
mysql> set transaction read only as of timestamp now(1);
Query OK, 0 rows affected (0.00 sec)
mysql> select * from t as of timestamp now(2);
ERROR 8135 (HY000): invalid as of timestamp: can't use select as of while already set transaction as of
After enabling tidb_read_staleness
, the following querying will use given staleness timestamp to get data from replicas.
And you can still use Stale Read with other forms with different timestamp:
SET @@tidb_read_staleness='-5';
// query data with max tolerant 5 seconds ago
select * from t where id = 1;
// query data with exact 10 seconds ago timestamp for this statement
select * from t as of timestamp now(10) where id = 1;
// query data with exact 10 seconds ago timestamp for next statement
set transaction read only as of timestamp now(10);
select * from t where id = 1;