public inbox for [email protected]
help / color / mirror / Atom feedFrom: lakshmi <[email protected]>
To: Tomas Vondra <[email protected]>
Cc: Chengpeng Yan <[email protected]>
Cc: Pavel Stehule <[email protected]>
Cc: John Naylor <[email protected]>
Cc: [email protected] <[email protected]>
Subject: Re: Add a greedy join search algorithm to handle large join problems
Date: Mon, 16 Feb 2026 16:14:16 +0530
Message-ID: <CAEvyyTigh2eB9hGirHzAC9j3SrMW1otMNxm7yHYOU3xs8x+FLA@mail.gmail.com> (raw)
In-Reply-To: <[email protected]>
References: <[email protected]>
<[email protected]>
<[email protected]>
<[email protected]>
<[email protected]>
<[email protected]>
<[email protected]>
<CANWCAZY529EPHyo1kLnEzjFBq-UaDPc3KErK=ApqDZZ1Oc-XHg@mail.gmail.com>
<CAFj8pRCO5ocbr-wFWx5QsKdfkW-=XuQ6zkW5FES7ERQZQHtpwQ@mail.gmail.com>
<[email protected]>
<CAFj8pRASJuRQKHOoBTnR5aRUeRKpNAmrYQcBrQb=yqeZ_8me9Q@mail.gmail.com>
<CAEvyyTi1M6JhHb6sR+xK-kp2bezMoADSC+RY2A+DbdEn+_BLxA@mail.gmail.com>
<[email protected]>
<CAEvyyTizQ9ki++g0P8-2Ae2OundUb-2=cS2-PQHe-LYPzhSS1A@mail.gmail.com>
<CAEvyyTjqcn9RwFb-S_Kx-+3b_Zg9YCUyhqoDsgvUcrR=pkMB0A@mail.gmail.com>
<[email protected]>
Hi Tomas,
Thank you for the question.
The 15-table and 20-table results I shared were obtained using a synthetic
join workload designed to stress join-order planning and measure
planning-time scaling, rather than a JOB or TPC-H query.
Each query is essentially a left-deep chain of equality joins over simple
tables. For reference, the structure is equivalent to:
15-table join
SELECT count(*)
FROM t1
JOIN t2 ON t1.id = t2.id
JOIN t3 ON t2.id = t3.id
...
JOIN t15 ON t14.id = t15.id;
20-table join
SELECT count(*)
FROM t1
JOIN t2 ON t1.id = t2.id
JOIN t3 ON t2.id = t3.id
...
JOIN t20 ON t19.id = t20.id;
Regarding planner settings:
-geqo_threshold was set to:
a high value (e.g., 100) to force DP
a low value (e.g., 2) to allow GEQO/GOO
-enable_goo_join_search was toggled on/off depending on the comparison
being measured.
-Other planner parameters, including join_collapse_limit, were left at
their default values.
So these experiments mainly evaluate planning-time scaling and basic plan
sanity on a controlled join graph, rather than realistic workload plan
quality.
I’m currently preparing additional tests using selected JOB queries to
provide more meaningful plan-quality comparisons and will share those
results once available.
Regards
Lakshmi
reply
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Reply to all the recipients using the --to and --cc options:
reply via email
To: [email protected]
Cc: [email protected], [email protected], [email protected], [email protected], [email protected], [email protected]
Subject: Re: Add a greedy join search algorithm to handle large join problems
In-Reply-To: <CAEvyyTigh2eB9hGirHzAC9j3SrMW1otMNxm7yHYOU3xs8x+FLA@mail.gmail.com>
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox