MIME-Version: 1.0
From: WU Yan <4wuyan@gmail.com>
Date: Sat, 11 May 2024 13:27:40 +1000
Message-ID: <CAAdwFAxBjyrYUkH7u+EceTaztd1QxBtBY1Teux8K=vcGKe==-A@mail.gmail.com>
Subject: Unnecessary buffer usage with multicolumn index, row comparison, and
 equility constraint
To: pgsql-general@lists.postgresql.org
Content-Type: multipart/alternative; boundary="000000000000a29ed60618253c8a"
Archived-At: <https://www.postgresql.org/message-id/CAAdwFAxBjyrYUkH7u%2BEceTaztd1QxBtBY1Teux8K%3DvcGKe%3D%3D-A%40mail.gmail.com>
Precedence: bulk

--000000000000a29ed60618253c8a
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi everyone, first time here. Please kindly let me know if this is not the
right place to ask.

I notice a simple query can read a lot of buffer blocks in a meaningless
way, when
1. there is an index scan on a multicolumn index
2. there is row constructor comparison in the Index Cond
3. there is also an equality constraint on the leftmost column of the
multicolumn index


## How to reproduce

I initially noticed it on AWS Aurora RDS, but it can be reproduced in
docker container as well.
```bash
docker run --name test-postgres -e POSTGRES_PASSWORD=3Dmysecretpassword -d =
-p
5432:5432 postgres:16.3
```

Create a table with a multicolumn index. Populate 12 million rows with
random integers.
```sql
CREATE TABLE t(a int, b int);
CREATE INDEX my_idx ON t USING BTREE (a, b);

INSERT INTO t(a, b)
SELECT
    (random() * 123456)::int AS a,
    (random() * 123456)::int AS b
FROM
    generate_series(1, 12345678);

ANALYZE t;
```

Simple query that uses the multicolumn index.
```
postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a =3D 0 order by a, b;
                                                    QUERY PLAN
---------------------------------------------------------------------------=
----------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..8.46 rows=3D1 width=3D8)
(actual time=3D284.312..284.314 rows=3D0 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a =3D 0))
   Heap Fetches: 0
   Buffers: shared hit=3D3777 read=3D37304 written=3D11713
 Planning:
   Buffers: shared hit=3D22 read=3D4
 Planning Time: 0.270 ms
 Execution Time: 284.341 ms
(8 rows)
```

## Expected output

The number of buffer blocks used is high. I expect it to be no more than
when there=E2=80=99s only one constraint.

```
postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) order by a, b;
                                                     QUERY PLAN
---------------------------------------------------------------------------=
-----------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..23.67 rows=3D642 width=3D=
8)
(actual time=3D0.030..0.158 rows=3D542 loops=3D1)
   Index Cond: (ROW(a, b) > ROW(123450, 123450))
   Heap Fetches: 0
   Buffers: shared hit=3D254 read=3D3
 Planning:
   Buffers: shared read=3D4
 Planning Time: 0.232 ms
 Execution Time: 0.206 ms
(8 rows)

postgres=3D# explain (analyze, buffers) select * from t where a =3D 0 order=
 by
a, b;
                                                    QUERY PLAN
---------------------------------------------------------------------------=
---------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..6.20 rows=3D101 width=3D8=
)
(actual time=3D0.099..0.113 rows=3D57 loops=3D1)
   Index Cond: (a =3D 0)
   Heap Fetches: 0
   Buffers: shared hit=3D27 read=3D2
 Planning Time: 0.081 ms
 Execution Time: 0.131 ms
(6 rows)
```

## Postgres version

16.3

## Platform information

I can reproduce it on the latest postgres docker image, which is based on
Debian Linux. Originally found the issue on AWS Aurora.


The following are my own observation and thoughts. Please disregard if it=
=E2=80=99s
distraction.

For a general form of
```sql
select * from t where (a, b) > (x, y) and a =3D z order by a, b;
```

1. The number of buffer blocks is proportional to the gap between x and z.
Strictly, it=E2=80=99s max(0, min(x, max(a)) =E2=80=93 max(z, min(a))).

```
postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a =3D -30000 order by a, b;
                                                    QUERY PLAN
---------------------------------------------------------------------------=
----------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..4.45 rows=3D1 width=3D8)
(actual time=3D243.173..243.175 rows=3D0 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a =3D
'-30000'::integer))
   Heap Fetches: 0
   Buffers: shared hit=3D1 read=3D41080
 Planning:
   Buffers: shared hit=3D2 read=3D2
 Planning Time: 0.174 ms
 Execution Time: 243.199 ms
(8 rows)

postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a =3D 0 order by a, b;
                                                    QUERY PLAN
---------------------------------------------------------------------------=
----------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..4.45 rows=3D1 width=3D8)
(actual time=3D230.425..230.426 rows=3D0 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a =3D 0))
   Heap Fetches: 0
   Buffers: shared hit=3D1 read=3D41080
 Planning:
   Buffers: shared read=3D4
 Planning Time: 0.296 ms
 Execution Time: 230.460 ms
(8 rows)

postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a =3D 30000 order by a, b;
                                                    QUERY PLAN
---------------------------------------------------------------------------=
----------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..4.45 rows=3D1 width=3D8)
(actual time=3D171.787..171.788 rows=3D0 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a =3D 30000))
   Heap Fetches: 0
   Buffers: shared hit=3D1 read=3D31126
 Planning:
   Buffers: shared read=3D4
 Planning Time: 0.191 ms
 Execution Time: 171.812 ms
(8 rows)

postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a =3D 60000 order by a, b;
                                                    QUERY PLAN
---------------------------------------------------------------------------=
----------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..4.45 rows=3D1 width=3D8)
(actual time=3D137.516..137.518 rows=3D0 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a =3D 60000))
   Heap Fetches: 0
   Buffers: shared hit=3D1 read=3D21139
 Planning:
   Buffers: shared read=3D4
 Planning Time: 0.212 ms
 Execution Time: 137.543 ms
(8 rows)

postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a =3D 90000 order by a, b;
                                                   QUERY PLAN
---------------------------------------------------------------------------=
--------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..4.45 rows=3D1 width=3D8)
(actual time=3D57.868..57.870 rows=3D0 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a =3D 90000))
   Heap Fetches: 0
   Buffers: shared hit=3D11187 read=3D1
 Planning:
   Buffers: shared hit=3D1 read=3D3
 Planning Time: 0.240 ms
 Execution Time: 57.896 ms
(8 rows)

postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a =3D 120000 order by a, b;
                                                  QUERY PLAN
---------------------------------------------------------------------------=
------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..4.45 rows=3D1 width=3D8)
(actual time=3D6.018..6.019 rows=3D0 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a =3D 120000))
   Heap Fetches: 0
   Buffers: shared hit=3D1173 read=3D1
 Planning:
   Buffers: shared hit=3D4
 Planning Time: 0.122 ms
 Execution Time: 6.052 ms
(8 rows)
```

2. It=E2=80=99s not an issue when `a=3Dx` becomes `a<x` or `a>x`.

```
postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a < 100 order by a, b;
                                                  QUERY PLAN
---------------------------------------------------------------------------=
------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..4.45 rows=3D1 width=3D8)
(actual time=3D0.006..0.006 rows=3D0 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a < 100))
   Heap Fetches: 0
   Buffers: shared hit=3D3
 Planning:
   Buffers: shared hit=3D8
 Planning Time: 0.119 ms
 Execution Time: 0.020 ms
(8 rows)

postgres=3D# explain (analyze, buffers) select * from t where row(a, b) >
row(123450, 123450) and a > 100 order by a, b;
                                                     QUERY PLAN
---------------------------------------------------------------------------=
-----------------------------------------
 Index Only Scan using my_idx on t  (cost=3D0.43..25.25 rows=3D641 width=3D=
8)
(actual time=3D0.040..0.339 rows=3D542 loops=3D1)
   Index Cond: ((ROW(a, b) > ROW(123450, 123450)) AND (a > 100))
   Heap Fetches: 0
   Buffers: shared hit=3D257
 Planning:
   Buffers: shared hit=3D8
 Planning Time: 0.233 ms
 Execution Time: 0.443 ms
(8 rows)
```

3. It=E2=80=99s not an issue when `a=3Dx` becomes `b=3Dx`.

4. The example query is trivial and for demo purpose only. Obviously
there=E2=80=99s no need to supply `a =3D 0` when there=E2=80=99s `(a, b) > =
(123450, 123450)`.
However, in practice it can be a problem when the table is joined to other
tables, resulting in a nested loop for a list of `a` values that we have no
control of, while `(a, b) > (x, y)` is used for paging.

5. My current workaround is add `AND a >=3D x` to `(a, b) > (x, y)`. Howeve=
r,
this makes the planner underestimate the number of rows due to the
multiplied selectivities.

Best regards,
Yan

--000000000000a29ed60618253c8a
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi everyone, first time here. Please kindly let me know if=
 this is not the right place to ask.<br><br>I notice a simple query can rea=
d a lot of buffer blocks in a meaningless way, when<br>1. there is an index=
 scan on a multicolumn index<br>2. there is row constructor comparison in t=
he Index Cond<br>3. there is also an equality constraint on the leftmost co=
lumn of the multicolumn index<br><br><br>## How to reproduce<br><br>I initi=
ally noticed it on AWS Aurora RDS, but it can be reproduced in docker conta=
iner as well.<br>```bash<br>docker run --name test-postgres -e POSTGRES_PAS=
SWORD=3Dmysecretpassword -d -p 5432:5432 postgres:16.3<br>```<br><br>Create=
 a table with a multicolumn index. Populate 12 million rows with random int=
egers.<br>```sql<br>CREATE TABLE t(a int, b int);<br>CREATE INDEX my_idx ON=
 t USING BTREE (a, b);<br><br>INSERT INTO t(a, b)<br>SELECT<br>=C2=A0 =C2=
=A0 (random() * 123456)::int AS a,<br>=C2=A0 =C2=A0 (random() * 123456)::in=
t AS b<br>FROM<br>=C2=A0 =C2=A0 generate_series(1, 12345678);<br><br>ANALYZ=
E t;<br>```<br><br>Simple query that uses the multicolumn index.<br>```<br>=
postgres=3D# explain (analyze, buffers) select * from t where row(a, b) &gt=
; row(123450, 123450) and a =3D 0 order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 QUERY PLAN<br>-----------------------------------------------=
--------------------------------------------------------------------<br>=C2=
=A0Index Only Scan using my_idx on t =C2=A0(cost=3D0.43..8.46 rows=3D1 widt=
h=3D8) (actual time=3D284.312..284.314 rows=3D0 loops=3D1)<br>=C2=A0 =C2=A0=
Index Cond: ((ROW(a, b) &gt; ROW(123450, 123450)) AND (a =3D 0))<br>=C2=A0 =
=C2=A0Heap Fetches: 0<br>=C2=A0 =C2=A0Buffers: shared hit=3D3777 read=3D373=
04 written=3D11713<br>=C2=A0Planning:<br>=C2=A0 =C2=A0Buffers: shared hit=
=3D22 read=3D4<br>=C2=A0Planning Time: 0.270 ms<br>=C2=A0Execution Time: 28=
4.341 ms<br>(8 rows)<br>```<br><br>## Expected output<br><br>The number of =
buffer blocks used is high. I expect it to be no more than when there=E2=80=
=99s only one constraint.<br><br>```<br>postgres=3D# explain (analyze, buff=
ers) select * from t where row(a, b) &gt; row(123450, 123450) order by a, b=
;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0QUERY PLAN<br>----------------=
---------------------------------------------------------------------------=
-------------------------<br>=C2=A0Index Only Scan using my_idx on t =C2=A0=
(cost=3D0.43..23.67 rows=3D642 width=3D8) (actual time=3D0.030..0.158 rows=
=3D542 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: (ROW(a, b) &gt; ROW(123450, 1=
23450))<br>=C2=A0 =C2=A0Heap Fetches: 0<br>=C2=A0 =C2=A0Buffers: shared hit=
=3D254 read=3D3<br>=C2=A0Planning:<br>=C2=A0 =C2=A0Buffers: shared read=3D4=
<br>=C2=A0Planning Time: 0.232 ms<br>=C2=A0Execution Time: 0.206 ms<br>(8 r=
ows)<br><br>postgres=3D# explain (analyze, buffers) select * from t where a=
 =3D 0 order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 QUERY PLAN<br>-=
---------------------------------------------------------------------------=
--------------------------------------<br>=C2=A0Index Only Scan using my_id=
x on t =C2=A0(cost=3D0.43..6.20 rows=3D101 width=3D8) (actual time=3D0.099.=
.0.113 rows=3D57 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: (a =3D 0)<br>=C2=A0=
 =C2=A0Heap Fetches: 0<br>=C2=A0 =C2=A0Buffers: shared hit=3D27 read=3D2<br=
>=C2=A0Planning Time: 0.081 ms<br>=C2=A0Execution Time: 0.131 ms<br>(6 rows=
)<br>```<br><br>## Postgres version<br><br>16.3<br><br>## Platform informat=
ion<br><br>I can reproduce it on the latest postgres docker image, which is=
 based on Debian Linux. Originally found the issue on AWS Aurora.<br><br><b=
r><br>The following are my own observation and thoughts. Please disregard i=
f it=E2=80=99s distraction.<br><br>For a general form of<br>```sql<br>selec=
t * from t where (a, b) &gt; (x, y) and a =3D z order by a, b;<br>```<br><b=
r>1. The number of buffer blocks is proportional to the gap between x and z=
. Strictly, it=E2=80=99s max(0, min(x, max(a)) =E2=80=93 max(z, min(a))).<b=
r><br>```<br>postgres=3D# explain (analyze, buffers) select * from t where =
row(a, b) &gt; row(123450, 123450) and a =3D -30000 order by a, b;<br>=C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 QUERY PLAN<br>-----------------------------=
---------------------------------------------------------------------------=
-----------<br>=C2=A0Index Only Scan using my_idx on t =C2=A0(cost=3D0.43..=
4.45 rows=3D1 width=3D8) (actual time=3D243.173..243.175 rows=3D0 loops=3D1=
)<br>=C2=A0 =C2=A0Index Cond: ((ROW(a, b) &gt; ROW(123450, 123450)) AND (a =
=3D &#39;-30000&#39;::integer))<br>=C2=A0 =C2=A0Heap Fetches: 0<br>=C2=A0 =
=C2=A0Buffers: shared hit=3D1 read=3D41080<br>=C2=A0Planning:<br>=C2=A0 =C2=
=A0Buffers: shared hit=3D2 read=3D2<br>=C2=A0Planning Time: 0.174 ms<br>=C2=
=A0Execution Time: 243.199 ms<br>(8 rows)<br><br>postgres=3D# explain (anal=
yze, buffers) select * from t where row(a, b) &gt; row(123450, 123450) and =
a =3D 0 order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 QUERY PLAN<br>-=
---------------------------------------------------------------------------=
---------------------------------------<br>=C2=A0Index Only Scan using my_i=
dx on t =C2=A0(cost=3D0.43..4.45 rows=3D1 width=3D8) (actual time=3D230.425=
..230.426 rows=3D0 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: ((ROW(a, b) &gt; =
ROW(123450, 123450)) AND (a =3D 0))<br>=C2=A0 =C2=A0Heap Fetches: 0<br>=C2=
=A0 =C2=A0Buffers: shared hit=3D1 read=3D41080<br>=C2=A0Planning:<br>=C2=A0=
 =C2=A0Buffers: shared read=3D4<br>=C2=A0Planning Time: 0.296 ms<br>=C2=A0E=
xecution Time: 230.460 ms<br>(8 rows)<br><br>postgres=3D# explain (analyze,=
 buffers) select * from t where row(a, b) &gt; row(123450, 123450) and a =
=3D 30000 order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 QUERY PLAN<b=
r>-------------------------------------------------------------------------=
------------------------------------------<br>=C2=A0Index Only Scan using m=
y_idx on t =C2=A0(cost=3D0.43..4.45 rows=3D1 width=3D8) (actual time=3D171.=
787..171.788 rows=3D0 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: ((ROW(a, b) &g=
t; ROW(123450, 123450)) AND (a =3D 30000))<br>=C2=A0 =C2=A0Heap Fetches: 0<=
br>=C2=A0 =C2=A0Buffers: shared hit=3D1 read=3D31126<br>=C2=A0Planning:<br>=
=C2=A0 =C2=A0Buffers: shared read=3D4<br>=C2=A0Planning Time: 0.191 ms<br>=
=C2=A0Execution Time: 171.812 ms<br>(8 rows)<br><br>postgres=3D# explain (a=
nalyze, buffers) select * from t where row(a, b) &gt; row(123450, 123450) a=
nd a =3D 60000 order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 QUERY PL=
AN<br>---------------------------------------------------------------------=
----------------------------------------------<br>=C2=A0Index Only Scan usi=
ng my_idx on t =C2=A0(cost=3D0.43..4.45 rows=3D1 width=3D8) (actual time=3D=
137.516..137.518 rows=3D0 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: ((ROW(a, b=
) &gt; ROW(123450, 123450)) AND (a =3D 60000))<br>=C2=A0 =C2=A0Heap Fetches=
: 0<br>=C2=A0 =C2=A0Buffers: shared hit=3D1 read=3D21139<br>=C2=A0Planning:=
<br>=C2=A0 =C2=A0Buffers: shared read=3D4<br>=C2=A0Planning Time: 0.212 ms<=
br>=C2=A0Execution Time: 137.543 ms<br>(8 rows)<br><br>postgres=3D# explain=
 (analyze, buffers) select * from t where row(a, b) &gt; row(123450, 123450=
) and a =3D 90000 order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0QUERY =
PLAN<br>-------------------------------------------------------------------=
----------------------------------------------<br>=C2=A0Index Only Scan usi=
ng my_idx on t =C2=A0(cost=3D0.43..4.45 rows=3D1 width=3D8) (actual time=3D=
57.868..57.870 rows=3D0 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: ((ROW(a, b) =
&gt; ROW(123450, 123450)) AND (a =3D 90000))<br>=C2=A0 =C2=A0Heap Fetches: =
0<br>=C2=A0 =C2=A0Buffers: shared hit=3D11187 read=3D1<br>=C2=A0Planning:<b=
r>=C2=A0 =C2=A0Buffers: shared hit=3D1 read=3D3<br>=C2=A0Planning Time: 0.2=
40 ms<br>=C2=A0Execution Time: 57.896 ms<br>(8 rows)<br><br>postgres=3D# ex=
plain (analyze, buffers) select * from t where row(a, b) &gt; row(123450, 1=
23450) and a =3D 120000 order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 QUERY=
 PLAN<br>------------------------------------------------------------------=
---------------------------------------------<br>=C2=A0Index Only Scan usin=
g my_idx on t =C2=A0(cost=3D0.43..4.45 rows=3D1 width=3D8) (actual time=3D6=
.018..6.019 rows=3D0 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: ((ROW(a, b) &gt=
; ROW(123450, 123450)) AND (a =3D 120000))<br>=C2=A0 =C2=A0Heap Fetches: 0<=
br>=C2=A0 =C2=A0Buffers: shared hit=3D1173 read=3D1<br>=C2=A0Planning:<br>=
=C2=A0 =C2=A0Buffers: shared hit=3D4<br>=C2=A0Planning Time: 0.122 ms<br>=
=C2=A0Execution Time: 6.052 ms<br>(8 rows)<br>```<br><br>2. It=E2=80=99s no=
t an issue when `a=3Dx` becomes `a&lt;x` or `a&gt;x`.<br><br>```<br>postgre=
s=3D# explain (analyze, buffers) select * from t where row(a, b) &gt; row(1=
23450, 123450) and a &lt; 100 order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0=
 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 Q=
UERY PLAN<br>--------------------------------------------------------------=
-------------------------------------------------<br>=C2=A0Index Only Scan =
using my_idx on t =C2=A0(cost=3D0.43..4.45 rows=3D1 width=3D8) (actual time=
=3D0.006..0.006 rows=3D0 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: ((ROW(a, b)=
 &gt; ROW(123450, 123450)) AND (a &lt; 100))<br>=C2=A0 =C2=A0Heap Fetches: =
0<br>=C2=A0 =C2=A0Buffers: shared hit=3D3<br>=C2=A0Planning:<br>=C2=A0 =C2=
=A0Buffers: shared hit=3D8<br>=C2=A0Planning Time: 0.119 ms<br>=C2=A0Execut=
ion Time: 0.020 ms<br>(8 rows)<br><br>postgres=3D# explain (analyze, buffer=
s) select * from t where row(a, b) &gt; row(123450, 123450) and a &gt; 100 =
order by a, b;<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0QUERY PLAN<br>--=
---------------------------------------------------------------------------=
---------------------------------------<br>=C2=A0Index Only Scan using my_i=
dx on t =C2=A0(cost=3D0.43..25.25 rows=3D641 width=3D8) (actual time=3D0.04=
0..0.339 rows=3D542 loops=3D1)<br>=C2=A0 =C2=A0Index Cond: ((ROW(a, b) &gt;=
 ROW(123450, 123450)) AND (a &gt; 100))<br>=C2=A0 =C2=A0Heap Fetches: 0<br>=
=C2=A0 =C2=A0Buffers: shared hit=3D257<br>=C2=A0Planning:<br>=C2=A0 =C2=A0B=
uffers: shared hit=3D8<br>=C2=A0Planning Time: 0.233 ms<br>=C2=A0Execution =
Time: 0.443 ms<br>(8 rows)<br>```<br><br>3. It=E2=80=99s not an issue when =
`a=3Dx` becomes `b=3Dx`.<br><br>4. The example query is trivial and for dem=
o purpose only. Obviously there=E2=80=99s no need to supply `a =3D 0` when =
there=E2=80=99s `(a, b) &gt; (123450, 123450)`. However, in practice it can=
 be a problem when the table is joined to other tables, resulting in a nest=
ed loop for a list of `a` values that we have no control of, while `(a, b) =
&gt; (x, y)` is used for paging.<br><br>5. My current workaround is add `AN=
D a &gt;=3D x` to `(a, b) &gt; (x, y)`. However, this makes the planner und=
erestimate the number of rows due to the multiplied selectivities.<br><br>B=
est regards,<br>Yan<br></div>

--000000000000a29ed60618253c8a--