From: Chris Cogdon <chris@cogdon.org>
Content-Type: multipart/alternative;
 boundary="Apple-Mail=_B0CA19EA-A7C4-4BC0-AD40-74834E62FAA0"
Subject: Adding a ROLLUP switches to GroupAggregate unexpectedly
Message-Id: <09FB46A1-B2CA-4C65-9959-D4D03C81EBB1@cogdon.org>
Date: Thu, 31 Mar 2016 10:03:27 -0700
To: pgsql-performance@postgresql.org
Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2104\))
Precedence: bulk
Sender: pgsql-performance-owner@postgresql.org


--Apple-Mail=_B0CA19EA-A7C4-4BC0-AD40-74834E62FAA0
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8

Hi folks! I=E2=80=99ve a query where adding a rollup to the group by =
switches to GroupAggregate unexpectedly, where the standard GROUP BY =
uses HashAggregate. Since the rollup should only add one additional =
bucket, the switch to having to sort (and thus a to-disk temporary file) =
is very puzzling. This reads like a query optimiser bug to me. This is =
the first I=E2=80=99ve posted to the list, please forgive me if I=E2=80=99=
ve omitted any =E2=80=9Cbefore bugging the list=E2=80=9D homework.


Description: Adding a summary row by changing =E2=80=9CGROUP BY x=E2=80=9D=
 into =E2=80=9CGROUP BY ROLLUP (x)=E2=80=9D should not cause a switch =
from HashAggregate to GroupAggregate


Here=E2=80=99s the =E2=80=9Cexplain=E2=80=9D from the simple GROUP BY:

projectdb=3D> explain analyze verbose SELECT error_code, count ( * ) =
FROM api_activities GROUP BY error_code;
                                                                 QUERY =
PLAN                                                                 =20
=
--------------------------------------------------------------------------=
-------------------------------------------------------------------
 HashAggregate  (cost=3D3456930.11..3456930.16 rows=3D5 width=3D2) =
(actual time=3D26016.222..26016.223 rows=3D5 loops=3D1)
   Output: error_code, count(*)
   Group Key: api_activities.error_code
   ->  Seq Scan on public.api_activities  (cost=3D0.00..3317425.74 =
rows=3D27900874 width=3D2) (actual time=3D0.018..16232.608 rows=3D36224844=
 loops=3D1)
         Output: id, client_id, date_added, kind, activity, error_code
 Planning time: 0.098 ms
 Execution time: 26016.337 ms
(7 rows)

Changing this to a GROUP BY ROLLUP switches to GroupAggregate (with the =
corresponding to-disk temporary table being created):

projectdb=3D> explain analyze verbose SELECT error_code, count ( * ) =
FROM api_activities GROUP BY rollup (error_code);
                                                                    =
QUERY PLAN                                                               =
     =20
=
--------------------------------------------------------------------------=
-------------------------------------------------------------------------
 GroupAggregate  (cost=3D7149357.90..7358614.52 rows=3D6 width=3D2) =
(actual time=3D54271.725..82354.144 rows=3D6 loops=3D1)
   Output: error_code, count(*)
   Group Key: api_activities.error_code
   Group Key: ()
   ->  Sort  (cost=3D7149357.90..7219110.09 rows=3D27900874 width=3D2) =
(actual time=3D54270.636..76651.121 rows=3D36222428 loops=3D1)
         Output: error_code
         Sort Key: api_activities.error_code
         Sort Method: external merge  Disk: 424864kB
         ->  Seq Scan on public.api_activities  (cost=3D0.00..3317425.74 =
rows=3D27900874 width=3D2) (actual time=3D0.053..34282.239 rows=3D36222428=
 loops=3D1)
               Output: error_code
 Planning time: 2.611 ms
 Execution time: 82437.416 ms
(12 rows)


I=E2=80=99ve given the output of =E2=80=9CEXPLAIN ANAYLZE VERBOSE=E2=80=9D=
 rather than non-analyze, but there was no difference in the plan.

Running VACUUM FULL ANALYZE on this table makes no difference. Switching =
to Count(error_code) makes no difference. Using GROUP BY GROUPING SETS =
((), error_code) makes no difference.

I understand that a HashAggregate is possible only if it can fit all the =
aggregates into work_mem. There are 5 different error codes, and the =
statistics (from pg_stats) are showing that PG knows this. Adding just =
one more bucket for the =E2=80=9C()=E2=80=9D case should not cause a =
fallback to GroupAggregate.


PostgreSQL version: 9.5.2 (just upgraded today, Thank you! <3 )

(Was exhibiting same problem under 9.5.0)


How installed: apt-get package from apt.postgresql.org =
<http://apt.postgresql.org/>


Settings differences:

 application_name: psql
 client_encoding: UTF8
 DateStyle: ISO, MDY
 default_text_search_config: pg_catalog.english
 dynamic_shared_memory_type: posix
 lc_messages: en_US.UTF-8
 lc_monetary: en_US.UTF-8
 lc_numeric: en_US.UTF-8
 lc_time: en_US.UTF-8
 listen_addresses: *
 log_line_prefix: %t [%p-%c-%l][%a][%i][%e][%s][%x-%v] %q%u@%d=20
 log_timezone: UTC
 logging_collector: on
 max_connections: 100
 max_stack_depth: 2MB
 port: 5432
 shared_buffers: 1GB
 ssl: on
 ssl_cert_file: /etc/ssl/certs/ssl-cert-snakeoil.pem
 ssl_key_file: /etc/ssl/private/ssl-cert-snakeoil.key
 TimeZone: UTC
 work_mem: 128MB


OS and Version: Ubuntu Trusty: Linux 3.13.0-66-generic #108-Ubuntu SMP =
Wed Oct 7 15:20:27 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux


Program used to connect: psql


Nothing unusual in the logs, apart from the query indicating that it =
took a while to run.


I know that there=E2=80=99s several workarounds I can use for this =
simple case, such as using a CTE, then doing a rollup on that, but I=E2=80=
=99m simply reporting what I think is a bug in the query optimizer.


Thank you for your attention! Please let me know if there=E2=80=99s any =
additional information you need, or additional tests you=E2=80=99d like =
to run.


=E2=80=94 Chris Cogdon <chris@cogdon.org <mailto:chris@cogdon.org>>
=E2=80=94 Using PostgreSQL since 6.2!=20


--Apple-Mail=_B0CA19EA-A7C4-4BC0-AD40-74834E62FAA0
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=utf-8

<html><head><meta http-equiv=3D"Content-Type" content=3D"text/html =
charset=3Dutf-8"></head><body style=3D"word-wrap: break-word; =
-webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" =
class=3D"">Hi folks! I=E2=80=99ve a query where adding a rollup to the =
group by switches to GroupAggregate unexpectedly, where the standard =
GROUP BY uses HashAggregate. Since the rollup should only add one =
additional bucket, the switch to having to sort (and thus a to-disk =
temporary file) is very puzzling. This reads like a query optimiser bug =
to me. This is the first I=E2=80=99ve posted to the list, please forgive =
me if I=E2=80=99ve omitted any =E2=80=9Cbefore bugging the list=E2=80=9D =
homework.<div class=3D""><br class=3D""><div class=3D""><br =
class=3D""></div><div class=3D"">Description: Adding a summary row by =
changing =E2=80=9CGROUP BY x=E2=80=9D into =E2=80=9CGROUP BY ROLLUP =
(x)=E2=80=9D should not cause a switch from HashAggregate to =
GroupAggregate</div><div class=3D""><br class=3D""></div><div =
class=3D""><br class=3D""></div><div class=3D"">Here=E2=80=99s the =
=E2=80=9Cexplain=E2=80=9D from the simple GROUP BY:</div><div =
class=3D""><br class=3D""></div><div class=3D""><div style=3D"margin: =
0px; font-size: 11px; font-family: Menlo;" class=3D""><div =
style=3D"margin: 0px;" class=3D""><div style=3D"margin: 0px;" =
class=3D"">projectdb=3D&gt; explain analyze verbose SELECT error_code, =
count ( * ) FROM api_activities GROUP BY error_code;</div><div =
style=3D"margin: 0px;" class=3D"">&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; QUERY PLAN =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp;</div><div style=3D"margin: 0px;" =
class=3D"">---------------------------------------------------------------=
--------------------------------------------------------------------------=
----</div><div style=3D"margin: 0px;" class=3D"">&nbsp;HashAggregate&nbsp;=
 (cost=3D3456930.11..3456930.16 rows=3D5 width=3D2) (actual =
time=3D26016.222..26016.223 rows=3D5 loops=3D1)</div><div style=3D"margin:=
 0px;" class=3D"">&nbsp;&nbsp; Output: error_code, count(*)</div><div =
style=3D"margin: 0px;" class=3D"">&nbsp;&nbsp; Group Key: =
api_activities.error_code</div><div style=3D"margin: 0px;" =
class=3D"">&nbsp;&nbsp; -&gt;&nbsp; Seq Scan on =
public.api_activities&nbsp; (cost=3D0.00..3317425.74 rows=3D27900874 =
width=3D2) (actual time=3D0.018..16232.608 rows=3D36224844 =
loops=3D1)</div><div style=3D"margin: 0px;" class=3D"">&nbsp;&nbsp; =
&nbsp; &nbsp; &nbsp; Output: id, client_id, date_added, kind, activity, =
error_code</div><div style=3D"margin: 0px;" class=3D"">&nbsp;Planning =
time: 0.098 ms</div><div style=3D"margin: 0px;" class=3D"">&nbsp;Execution=
 time: 26016.337 ms</div><div style=3D"margin: 0px;" class=3D"">(7 =
rows)</div><div class=3D""><br class=3D""></div></div></div></div><div =
class=3D"">Changing this to a GROUP BY ROLLUP switches to GroupAggregate =
(with the corresponding to-disk temporary table being =
created):</div><div class=3D""><br class=3D""></div><div class=3D""><div =
style=3D"margin: 0px;" class=3D""><div style=3D"font-family: Menlo; =
font-size: 11px; margin: 0px;" class=3D""><div style=3D"margin: 0px;" =
class=3D"">projectdb=3D&gt; explain analyze verbose SELECT error_code, =
count ( * ) FROM api_activities GROUP BY rollup (error_code);</div><div =
style=3D"margin: 0px;" class=3D"">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; QUERY =
PLAN&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;</div><div style=3D"margin: =
0px;" =
class=3D"">---------------------------------------------------------------=
--------------------------------------------------------------------------=
----------</div><div style=3D"margin: 0px;" =
class=3D"">&nbsp;GroupAggregate&nbsp; (cost=3D7149357.90..7358614.52 =
rows=3D6 width=3D2) (actual time=3D54271.725..82354.144 rows=3D6 =
loops=3D1)</div><div style=3D"margin: 0px;" class=3D"">&nbsp;&nbsp; =
Output: error_code, count(*)</div><div style=3D"margin: 0px;" =
class=3D"">&nbsp;&nbsp; Group Key: api_activities.error_code</div><div =
style=3D"margin: 0px;" class=3D"">&nbsp;&nbsp; Group Key: ()</div><div =
style=3D"margin: 0px;" class=3D"">&nbsp;&nbsp; -&gt;&nbsp; Sort&nbsp; =
(cost=3D7149357.90..7219110.09 rows=3D27900874 width=3D2) (actual =
time=3D54270.636..76651.121 rows=3D36222428 loops=3D1)</div><div =
style=3D"margin: 0px;" class=3D"">&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; =
Output: error_code</div><div style=3D"margin: 0px;" =
class=3D"">&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; Sort Key: =
api_activities.error_code</div><div style=3D"margin: 0px;" =
class=3D"">&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; Sort Method: external =
merge&nbsp; Disk: 424864kB</div><div style=3D"margin: 0px;" =
class=3D"">&nbsp;&nbsp; &nbsp; &nbsp; &nbsp; -&gt;&nbsp; Seq Scan on =
public.api_activities&nbsp; (cost=3D0.00..3317425.74 rows=3D27900874 =
width=3D2) (actual time=3D0.053..34282.239 rows=3D36222428 =
loops=3D1)</div><div style=3D"margin: 0px;" class=3D"">&nbsp;&nbsp; =
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; Output: error_code</div><div =
style=3D"margin: 0px;" class=3D"">&nbsp;Planning time: 2.611 =
ms</div><div style=3D"margin: 0px;" class=3D"">&nbsp;Execution time: =
82437.416 ms</div><div style=3D"margin: 0px;" class=3D"">(12 =
rows)</div><div class=3D""><br class=3D""></div></div><div =
style=3D"font-family: Menlo; font-size: 11px;" class=3D""><br =
class=3D""></div><div class=3D"">I=E2=80=99ve given the output of =
=E2=80=9CEXPLAIN ANAYLZE VERBOSE=E2=80=9D rather than non-analyze, but =
there was no difference in the plan.</div><div class=3D""><br =
class=3D""></div><div class=3D"">Running VACUUM FULL ANALYZE on this =
table makes no difference. Switching to Count(error_code) makes no =
difference. Using GROUP BY GROUPING SETS ((), error_code) makes no =
difference.</div><div style=3D"margin: 0px;" class=3D""><br =
class=3D""></div>I understand that a HashAggregate is possible only if =
it can fit all the aggregates into work_mem. There are 5 different error =
codes, and the statistics (from pg_stats) are showing that PG knows =
this. Adding just one more&nbsp;bucket for the =E2=80=9C()=E2=80=9D case =
should not cause a fallback to GroupAggregate.<br =
class=3D""></div></div><span class=3D""><br class=3D""></span><div =
class=3D""><br class=3D""></div><div class=3D"">PostgreSQL version: =
9.5.2 (just upgraded today, Thank you! &lt;3 )</div><div class=3D""><br =
class=3D""></div><div class=3D"">(Was exhibiting same problem under =
9.5.0)</div><div class=3D""><br class=3D""></div><div class=3D""><br =
class=3D""></div><div class=3D"">How installed: apt-get package =
from&nbsp;<a href=3D"http://apt.postgresql.org" =
class=3D"">apt.postgresql.org</a></div><div class=3D""><br =
class=3D""></div><div class=3D""><br class=3D""></div><div =
class=3D"">Settings differences:</div><div class=3D""><br =
class=3D""></div><div class=3D""><div style=3D"margin: 0px; font-size: =
11px; font-family: Menlo;" class=3D"">&nbsp;application_name: =
psql</div><div style=3D"margin: 0px; font-size: 11px; font-family: =
Menlo;" class=3D"">&nbsp;client_encoding: UTF8</div><div style=3D"margin: =
0px; font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;DateStyle: =
ISO, MDY</div><div style=3D"margin: 0px; font-size: 11px; font-family: =
Menlo;" class=3D"">&nbsp;default_text_search_config: =
pg_catalog.english</div><div style=3D"margin: 0px; font-size: 11px; =
font-family: Menlo;" class=3D"">&nbsp;dynamic_shared_memory_type: =
posix</div><div style=3D"margin: 0px; font-size: 11px; font-family: =
Menlo;" class=3D"">&nbsp;lc_messages: en_US.UTF-8</div><div =
style=3D"margin: 0px; font-size: 11px; font-family: Menlo;" =
class=3D"">&nbsp;lc_monetary: en_US.UTF-8</div><div style=3D"margin: =
0px; font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;lc_numeric: =
en_US.UTF-8</div><div style=3D"margin: 0px; font-size: 11px; =
font-family: Menlo;" class=3D"">&nbsp;lc_time: en_US.UTF-8</div><div =
style=3D"margin: 0px; font-size: 11px; font-family: Menlo;" =
class=3D"">&nbsp;listen_addresses: *</div><div style=3D"margin: 0px; =
font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;log_line_prefix: =
%t [%p-%c-%l][%a][%i][%e][%s][%x-%v] %q%u@%d&nbsp;</div><div =
style=3D"margin: 0px; font-size: 11px; font-family: Menlo;" =
class=3D"">&nbsp;log_timezone: UTC</div><div style=3D"margin: 0px; =
font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;logging_collector: =
on</div><div style=3D"margin: 0px; font-size: 11px; font-family: Menlo;" =
class=3D"">&nbsp;max_connections: 100</div><div style=3D"margin: 0px; =
font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;max_stack_depth: =
2MB</div><div style=3D"margin: 0px; font-size: 11px; font-family: =
Menlo;" class=3D"">&nbsp;port: 5432</div><div style=3D"margin: 0px; =
font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;shared_buffers: =
1GB</div><div style=3D"margin: 0px; font-size: 11px; font-family: =
Menlo;" class=3D"">&nbsp;ssl: on</div><div style=3D"margin: 0px; =
font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;ssl_cert_file: =
/etc/ssl/certs/ssl-cert-snakeoil.pem</div><div style=3D"margin: 0px; =
font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;ssl_key_file: =
/etc/ssl/private/ssl-cert-snakeoil.key</div><div style=3D"margin: 0px; =
font-size: 11px; font-family: Menlo;" class=3D"">&nbsp;TimeZone: =
UTC</div><div style=3D"margin: 0px; font-size: 11px; font-family: =
Menlo;" class=3D"">&nbsp;work_mem: 128MB</div></div><div class=3D""><br =
class=3D""></div><div class=3D""><br class=3D""></div><div class=3D"">OS =
and Version: Ubuntu Trusty:&nbsp;Linux 3.13.0-66-generic #108-Ubuntu SMP =
Wed Oct 7 15:20:27 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux</div><div =
class=3D""><br class=3D""></div><div class=3D""><br class=3D""></div><div =
class=3D"">Program used to connect: psql</div><div class=3D""><br =
class=3D""></div><div class=3D""><br class=3D""></div><div =
class=3D"">Nothing unusual in the logs, apart from the query indicating =
that it took a while to run.</div><div class=3D""><br =
class=3D""></div><div class=3D""><br class=3D""></div><div class=3D"">I =
know that there=E2=80=99s several workarounds I can use for this simple =
case, such as using a CTE, then doing a rollup on that, but I=E2=80=99m =
simply reporting what I think is a bug in the query optimizer.</div><div =
class=3D""><br class=3D""></div><div class=3D""><br class=3D""></div><div =
class=3D"">Thank you for your attention! Please let me know if there=E2=80=
=99s any additional information you need, or additional tests you=E2=80=99=
d like to run.</div><div class=3D""><br class=3D""></div><div =
class=3D""><br class=3D""></div><div class=3D"">=E2=80=94 Chris Cogdon =
&lt;<a href=3D"mailto:chris@cogdon.org" =
class=3D"">chris@cogdon.org</a>&gt;</div><div class=3D"">=E2=80=94 Using =
PostgreSQL since 6.2!&nbsp;</div><div class=3D""><br class=3D""></div><div=
 class=3D""><br class=3D""><div class=3D""><pre style=3D"padding: 1em; =
border: 1px dashed rgb(47, 111, 171); line-height: 1.1em;" =
class=3D""><font face=3D"Helvetica" class=3D""><br class=3D""></font><span=
 style=3D"background-color: rgb(249, 249, 249);" class=3D""><br =
class=3D""></span></pre></div></div></div></body></html>=

--Apple-Mail=_B0CA19EA-A7C4-4BC0-AD40-74834E62FAA0--