Re: Add a greedy join search algorithm to handle large join problems

public inbox for [email protected]  
help / color / mirror / Atom feed

From: lakshmi <[email protected]>
To: Tomas Vondra <[email protected]>
Cc: Chengpeng Yan <[email protected]>
Cc: Pavel Stehule <[email protected]>
Cc: John Naylor <[email protected]>
Cc: [email protected] <[email protected]>
Subject: Re: Add a greedy join search algorithm to handle large join problems
Date: Mon, 16 Feb 2026 16:14:16 +0530
Message-ID: <CAEvyyTigh2eB9hGirHzAC9j3SrMW1otMNxm7yHYOU3xs8x+FLA@mail.gmail.com> (raw)
In-Reply-To: <[email protected]>
References: <[email protected]>
	<[email protected]>
	<[email protected]>
	<[email protected]>
	<[email protected]>
	<[email protected]>
	<[email protected]>
	<CANWCAZY529EPHyo1kLnEzjFBq-UaDPc3KErK=ApqDZZ1Oc-XHg@mail.gmail.com>
	<CAFj8pRCO5ocbr-wFWx5QsKdfkW-=XuQ6zkW5FES7ERQZQHtpwQ@mail.gmail.com>
	<[email protected]>
	<CAFj8pRASJuRQKHOoBTnR5aRUeRKpNAmrYQcBrQb=yqeZ_8me9Q@mail.gmail.com>
	<CAEvyyTi1M6JhHb6sR+xK-kp2bezMoADSC+RY2A+DbdEn+_BLxA@mail.gmail.com>
	<[email protected]>
	<CAEvyyTizQ9ki++g0P8-2Ae2OundUb-2=cS2-PQHe-LYPzhSS1A@mail.gmail.com>
	<CAEvyyTjqcn9RwFb-S_Kx-+3b_Zg9YCUyhqoDsgvUcrR=pkMB0A@mail.gmail.com>
	<[email protected]>

Hi Tomas,

Thank you for the question.
The 15-table and 20-table results I shared were obtained using a synthetic
join workload designed to stress join-order planning and measure
planning-time scaling, rather than a JOB or TPC-H query.
Each query is essentially a left-deep chain of equality joins over simple
tables. For reference, the structure is equivalent to:


15-table join

SELECT count(*)

FROM t1

JOIN t2 ON t1.id = t2.id

JOIN t3 ON t2.id = t3.id

 ...

JOIN t15 ON t14.id = t15.id;



20-table join


SELECT count(*)

FROM t1

JOIN t2 ON t1.id = t2.id

JOIN t3 ON t2.id = t3.id

...

JOIN t20 ON t19.id = t20.id;

Regarding planner settings:

-geqo_threshold was set to:

                 a high value (e.g., 100) to force DP

                 a low value (e.g., 2) to allow GEQO/GOO


-enable_goo_join_search was toggled on/off depending on the comparison
being     measured.


-Other planner parameters, including join_collapse_limit, were left at
their default values.

So these experiments mainly evaluate planning-time scaling and basic plan
sanity on a controlled join graph, rather than realistic workload plan
quality.

I’m currently preparing additional tests using selected JOB queries to
provide more meaningful plan-quality comparisons and will share those
results once available.

Regards
Lakshmi

reply

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Reply to all the recipients using the --to and --cc options:
  reply via email

  To: [email protected]
  Cc: [email protected], [email protected], [email protected], [email protected], [email protected], [email protected]
  Subject: Re: Add a greedy join search algorithm to handle large join problems
  In-Reply-To: <CAEvyyTigh2eB9hGirHzAC9j3SrMW1otMNxm7yHYOU3xs8x+FLA@mail.gmail.com>

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox