MIME-Version: 1.0
In-Reply-To: 
 <CAHyXU0wDi9VjfGC8aQeLsBq4ncLVOKJ=1QR6iRq71U2HXQso4Q@mail.gmail.com>
References: 
 <CAOGex3nXTRPZTD-KeoSwD=bj62hQrMK+6h30u09srV71sePqUA@mail.gmail.com>
 <CAHyXU0wDi9VjfGC8aQeLsBq4ncLVOKJ=1QR6iRq71U2HXQso4Q@mail.gmail.com>
From: =?UTF-8?Q?Fl=C3=A1vio_Henrique?= <yoshimit@gmail.com>
Date: Thu, 5 Jan 2017 14:51:41 -0200
Message-ID: 
 <CAOGex3=0DB-R9V558CkeoSOuU4KG_RyNh5etzo85o43xGcuVvQ@mail.gmail.com>
Subject: Re: Slow query after 9.3 to 9.6 migration
To: Merlin Moncure <mmoncure@gmail.com>
Cc: postgres performance list <pgsql-performance@postgresql.org>
Content-Type: multipart/alternative; boundary=001a114093a69cdef205455bb98b
Precedence: bulk
Sender: pgsql-performance-owner@postgresql.org

--001a114093a69cdef205455bb98b
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Hi all!
Sorry the delay (holidays).

Well, the most expensive sequencial scan was solved.
I asked the db team to drop the index and recreate it and guess what: now
postgresql is using it and the time dropped.
(thank you, @Gerardo Herzig!)

I think there's still room for improvement, but the problem is not so
crucial right now.
I'll try to investigate every help mentioned here. Thank you all.

@Daniel Blanch
I'll make some tests with a materialized view. Thank you.

> On systems side: ask them if they have not changed anything in
> effective_cache_size and shared_buffers parameters, I presume they haven=
=E2=80=99t
> change anything related to costs.

Replying your comment, I think they tunned the server:
effective_cache_size =3D 196GB
shared_buffers =3D 24GB (this shouldn't be higher?)

@Kevin Grittner
sorry, but I'm not sure when the autovacuum is aggressive enough, but here
my settings related:
autovacuum                          |on
autovacuum_analyze_scale_factor     |0.05
autovacuum_analyze_threshold        |10
autovacuum_freeze_max_age           |200000000
autovacuum_max_workers              |3
autovacuum_multixact_freeze_max_age |400000000
autovacuum_naptime                  |15s
autovacuum_vacuum_cost_delay        |10ms
autovacuum_vacuum_cost_limit        |-1
autovacuum_vacuum_scale_factor      |0.1
autovacuum_vacuum_threshold         |10
autovacuum_work_mem                 |-1

@Merlin Moncure

> Big gains (if any) are likely due to indexing strategy.
> I do see some suspicious casting, for example:
> Join Filter: ((four_charlie.delta_tango)::integer =3D
> (six_quebec.golf_bravo)::integer)
> Are you casting in the query or joining through dissimilar data types?

No casts in query. The joins are on same data types.

Thank you all for the answers. Happy 2017!

Fl=C3=A1vio Henrique
--------------------------------------------------------
"There are only 10 types of people in the world: Those who understand
binary, and those who don't"
--------------------------------------------------------

On Thu, Jan 5, 2017 at 12:40 PM, Merlin Moncure <mmoncure@gmail.com> wrote:

> On Tue, Dec 27, 2016 at 5:50 PM, Fl=C3=A1vio Henrique <yoshimit@gmail.com=
>
> wrote:
> > Hi there, fellow experts!
> >
> > I need an advice with query that became slower after 9.3 to 9.6
> migration.
> >
> > First of all, I'm from the dev team.
> >
> > Before migration, we (programmers) made some modifications on query bri=
ng
> > it's average time from 8s to 2-3s.
> >
> > As this query is the most executed on our system (it builds the user
> panel
> > to work), every bit that we can squeeze from it will be nice.
> >
> > Now, after server migration to 9.6 we're experiencing bad times with th=
is
> > query again.
> >
> > Unfortunately, I don't have the old query plain (9.3 version) to show
> you,
> > but in the actual version (9.6) I can see some buffers written that
> tells me
> > that something is wrong.
> >
> > Our server has 250GB of memory available, but the database team says th=
at
> > they can't do nothing to make this query better. I'm not sure, as some
> > buffers are written on disk.
> >
> > Any tip/help will be much appreciated (even from the query side).
> >
> > Thank you!
> >
> > The query plan: https://explain.depesz.com/s/5KMn
> >
> > Note: I tried to add index on kilo_victor table already, but Postgresql
> > still thinks that is better to do a seq scan.
>
> Hard to provide more without the query or the 'old' plan.   Here are
> some things you can try:
> *) Set effective_io_concurrency high.    You have some heap scanning
> going on and this can sometimes help (but it should be marginal).
> *) See if you can get any juice out of parallel query
> *) try playing with enable_nestloop and enable_seqscan.   these are
> hail mary passes but worth a shot.
>
> Run the query back to back with same arguments in the same database
> session. Does performance improve?
>
> Big gains (if any) are likely due to indexing strategy.
> I do see some suspicious casting, for example:
>
> Join Filter: ((four_charlie.delta_tango)::integer =3D
> (six_quebec.golf_bravo)::integer)
>
> Are you casting in the query or joining through dissimilar data types?
>  I suspect your database team might be incorrect.
>
> merlin
>

--001a114093a69cdef205455bb98b
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Hi all!<div>Sorry the delay (holidays).</div><div><br></di=
v><div>Well, the most expensive sequencial scan was solved.</div><div>I ask=
ed the db team to drop the index and recreate it and guess what: now postgr=
esql is using it and the time dropped.</div>(thank you, @Gerardo Herzig!)<d=
iv><br></div><div>I think there&#39;s still room for improvement, but the p=
roblem is not so crucial right now.</div><div>I&#39;ll try to investigate e=
very help mentioned here. Thank you all.</div><div><br></div><div>@Daniel B=
lanch</div><div><span style=3D"font-size:12.8px">I&#39;ll make some tests w=
ith a materialized view. Thank you.</span><br></div><div><blockquote style=
=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding=
-left:1ex" class=3D"gmail_quote">On systems side: ask them if they have not=
 changed anything in effective_cache_size and shared_buffers parameters, I =
presume they haven=E2=80=99t change anything related to costs.</blockquote>=
</div><div><span style=3D"font-size:12.8px">Replying your comment, I think =
they tunned the server:</span></div><div><span style=3D"font-size:12.8px">e=
ffective_cache_size =3D=C2=A0</span><span style=3D"font-size:12.8px">196GB<=
/span><br></div><div><span style=3D"font-size:12.8px">shared_buffers =3D=C2=
=A024GB (this shouldn&#39;t be higher?)<br></span></div><div><span style=3D=
"font-size:12.8px"><br></span></div><div><span style=3D"font-size:12.8px">@=
Kevin Grittner</span></div><div><span style=3D"font-size:12.8px">sorry, but=
 I&#39;m not sure when the autovacuum is aggressive enough, but here my set=
tings related:</span></div><div><div><span style=3D"font-size:12.8px">autov=
acuum =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0=
 =C2=A0 =C2=A0 =C2=A0|on =C2=A0 =C2=A0 =C2=A0 =C2=A0</span></div><div><span=
 style=3D"font-size:12.8px">autovacuum_analyze_scale_factor =C2=A0 =C2=A0 |=
0.05 =C2=A0 =C2=A0 =C2=A0</span></div><div><span style=3D"font-size:12.8px"=
>autovacuum_analyze_threshold =C2=A0 =C2=A0 =C2=A0 =C2=A0|10 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0</span></div><div><span style=3D"font-size:12.8px">autovacuum_=
freeze_max_age =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 |200000000=C2=A0</span></=
div><div><span style=3D"font-size:12.8px">autovacuum_max_workers =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0|3 =C2=A0 =C2=A0 =C2=A0 =C2=A0=C2=A0<=
/span></div><div><span style=3D"font-size:12.8px">autovacuum_multixact_free=
ze_max_age |400000000=C2=A0</span></div><div><span style=3D"font-size:12.8p=
x">autovacuum_naptime =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0|15s =C2=A0 =C2=A0 =C2=A0=C2=A0</span></div><div><span style=3D"f=
ont-size:12.8px">autovacuum_vacuum_cost_delay =C2=A0 =C2=A0 =C2=A0 =C2=A0|1=
0ms =C2=A0 =C2=A0 =C2=A0</span></div><div><span style=3D"font-size:12.8px">=
autovacuum_vacuum_cost_limit =C2=A0 =C2=A0 =C2=A0 =C2=A0|-1 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0</span></div><div><span style=3D"font-size:12.8px">autovacuum_=
vacuum_scale_factor =C2=A0 =C2=A0 =C2=A0|0.1 =C2=A0 =C2=A0 =C2=A0=C2=A0</sp=
an></div><div><span style=3D"font-size:12.8px">autovacuum_vacuum_threshold =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 |10 =C2=A0 =C2=A0 =C2=A0 =C2=A0</span></div><di=
v><span style=3D"font-size:12.8px">autovacuum_work_mem =C2=A0 =C2=A0 =C2=A0=
 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 |-1 =C2=A0 =C2=A0 =C2=A0 =C2=A0</span><=
/div></div><div><span style=3D"font-size:12.8px"><br></span></div><div><spa=
n style=3D"font-size:12.8px">@Merlin Moncure</span></div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex"><span style=3D"font-size:12.8px">Big gains=
 (if any) are likely due to indexing strategy.<br></span><span style=3D"fon=
t-size:12.8px">I do see some suspicious casting, for example:</span><br sty=
le=3D"font-size:12.8px"><span style=3D"font-size:12.8px">Join Filter: ((fou=
r_charlie.delta_tango)::</span><wbr style=3D"font-size:12.8px"><span style=
=3D"font-size:12.8px">integer =3D<br></span><span style=3D"font-size:12.8px=
">(six_quebec.golf_bravo)::</span><wbr style=3D"font-size:12.8px"><span sty=
le=3D"font-size:12.8px">integer)</span><br style=3D"font-size:12.8px"><span=
 style=3D"font-size:12.8px">Are you casting in the query or joining through=
 dissimilar data types?</span></blockquote><div>No casts in query. The join=
s are on same data types.=C2=A0</div><div><br></div><div>Thank you all for =
the answers. Happy 2017!</div></div><div class=3D"gmail_extra"><br clear=3D=
"all"><div><div class=3D"gmail_signature" data-smartmail=3D"gmail_signature=
"><div>Fl=C3=A1vio Henrique</div>------------------------------------------=
--------------<br>&quot;There are only 10 types of people in the world: Tho=
se who understand binary, and those who don&#39;t&quot;<br>----------------=
----------------------------------------</div></div>
<br><div class=3D"gmail_quote">On Thu, Jan 5, 2017 at 12:40 PM, Merlin Monc=
ure <span dir=3D"ltr">&lt;<a href=3D"mailto:mmoncure@gmail.com" target=3D"_=
blank">mmoncure@gmail.com</a>&gt;</span> wrote:<br><blockquote class=3D"gma=
il_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-lef=
t:1ex"><span class=3D"">On Tue, Dec 27, 2016 at 5:50 PM, Fl=C3=A1vio Henriq=
ue &lt;<a href=3D"mailto:yoshimit@gmail.com">yoshimit@gmail.com</a>&gt; wro=
te:<br>
</span><span class=3D"">&gt; Hi there, fellow experts!<br>
&gt;<br>
&gt; I need an advice with query that became slower after 9.3 to 9.6 migrat=
ion.<br>
&gt;<br>
&gt; First of all, I&#39;m from the dev team.<br>
&gt;<br>
&gt; Before migration, we (programmers) made some modifications on query br=
ing<br>
&gt; it&#39;s average time from 8s to 2-3s.<br>
&gt;<br>
&gt; As this query is the most executed on our system (it builds the user p=
anel<br>
&gt; to work), every bit that we can squeeze from it will be nice.<br>
&gt;<br>
&gt; Now, after server migration to 9.6 we&#39;re experiencing bad times wi=
th this<br>
&gt; query again.<br>
&gt;<br>
&gt; Unfortunately, I don&#39;t have the old query plain (9.3 version) to s=
how you,<br>
&gt; but in the actual version (9.6) I can see some buffers written that te=
lls me<br>
&gt; that something is wrong.<br>
&gt;<br>
&gt; Our server has 250GB of memory available, but the database team says t=
hat<br>
&gt; they can&#39;t do nothing to make this query better. I&#39;m not sure,=
 as some<br>
&gt; buffers are written on disk.<br>
&gt;<br>
&gt; Any tip/help will be much appreciated (even from the query side).<br>
&gt;<br>
&gt; Thank you!<br>
&gt;<br>
&gt; The query plan: <a href=3D"https://explain.depesz.com/s/5KMn" rel=3D"n=
oreferrer" target=3D"_blank">https://explain.depesz.com/s/<wbr>5KMn</a><br>
&gt;<br>
&gt; Note: I tried to add index on kilo_victor table already, but Postgresq=
l<br>
&gt; still thinks that is better to do a seq scan.<br>
<br>
</span>Hard to provide more without the query or the &#39;old&#39; plan.=C2=
=A0 =C2=A0Here are<br>
some things you can try:<br>
*) Set effective_io_concurrency high.=C2=A0 =C2=A0 You have some heap scann=
ing<br>
going on and this can sometimes help (but it should be marginal).<br>
*) See if you can get any juice out of parallel query<br>
*) try playing with enable_nestloop and enable_seqscan.=C2=A0 =C2=A0these a=
re<br>
hail mary passes but worth a shot.<br>
<br>
Run the query back to back with same arguments in the same database<br>
session. Does performance improve?<br>
<br>
Big gains (if any) are likely due to indexing strategy.<br>
I do see some suspicious casting, for example:<br>
<br>
Join Filter: ((four_charlie.delta_tango)::<wbr>integer =3D<br>
(six_quebec.golf_bravo)::<wbr>integer)<br>
<br>
Are you casting in the query or joining through dissimilar data types?<br>
=C2=A0I suspect your database team might be incorrect.<br>
<span class=3D"HOEnZb"><font color=3D"#888888"><br>
merlin<br>
</font></span></blockquote></div><br></div>

--001a114093a69cdef205455bb98b--