MIME-Version: 1.0
In-Reply-To: 
 <CAB9893g-1fpvh=0snbe7qFJKfXEsn2YxR3ZWZ6-JxrMCyaZg3Q@mail.gmail.com>
References: 
 <CAB9893izHQaPTk1bGEDs8UTQUTKtpj1sk6PLyWrvXU-j0JBFaQ@mail.gmail.com>
 <CAKJS1f-AkrKfLEsrb7ymZve_b3e9cTKUcEdeeeJkVWnOTVdPnA@mail.gmail.com>
 <CAB9893hmTC-TMeFN8S91NWS_++3w2t5D0X7O-ogsZZ8zEyxv6w@mail.gmail.com>
 <3138.1505508143@sss.pgh.pa.us>
 <CAB9893g-1fpvh=0snbe7qFJKfXEsn2YxR3ZWZ6-JxrMCyaZg3Q@mail.gmail.com>
From: Mike Broers <mbroers@gmail.com>
Date: Wed, 20 Sep 2017 11:15:53 -0500
Message-ID: 
 <CAB9893iWzGVDh1GRKPdDwUx=fbe4uKCx2GxQOy3jDQ9Xe+uD8Q@mail.gmail.com>
Subject: Re: query of partitioned object doesnt use index in qa
To: Tom Lane <tgl@sss.pgh.pa.us>
Cc: David Rowley <david.rowley@2ndquadrant.com>,
	postgres performance list <pgsql-performance@postgresql.org>
Content-Type: multipart/alternative; boundary="001a1147d64e49bba50559a14a25"
Precedence: bulk
Sender: pgsql-performance-owner@postgresql.org

--001a1147d64e49bba50559a14a25
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

I was able to add the suggested indexes
like stage.event__00075000((body->>'SID'::text)); and indeed these helped
the QA environment use those indexes instead of sequence scanning.

I'm still perplexed by my original question, why production uses the
partition index and qa does not?

Index Scan using ix_event__00014695_landing_id on event__00014695 e_3
(cost=3D0.56..39137.89
rows=3D37697 width=3D564)               =E2=94=82

=E2=94=82                                 Index Cond: (landing_id =3D
t_sap.landing_id)


Ultimately I think this is just highlighting the need in my environment to
set random_page_cost lower (we are on an SSD SAN anyway..), but I dont
think I have a satisfactory reason by the row estimates are so bad in the
QA planner and why it doesnt use that partition index there.


On Fri, Sep 15, 2017 at 3:59 PM, Mike Broers <mbroers@gmail.com> wrote:

> That makes a lot of sense, thanks for taking a look.  An index like you
> suggest would probably further improve the query.   Is that suggestion
> sidestepping the original problem that production is evaluating the
> landing_id bit with the partition index and qa is sequence scanning inste=
ad?
>
> AND exists (select 1 from t_sap where e.landing_id =3D t_sap.landing_id))=
 as
> rankings;
>
> Based on the difference in row estimate I am attempting an analyze with a
> higher default_statistic_target (currently 100) to see if that helps.
>
>
>
>
> On Fri, Sep 15, 2017 at 3:42 PM, Tom Lane <tgl@sss.pgh.pa.us> wrote:
>
>> Mike Broers <mbroers@gmail.com> writes:
>> > If Im reading this correctly postgres thinks the partition will return
>> 6.5
>> > million matching rows but actually comes back with 162k.  Is this a ca=
se
>> > where something is wrong with the analyze job?
>>
>> You've got a lot of scans there that're using conditions like
>>
>> > =E2=94=82                           ->  Seq Scan on event__99999999 e_=
1
>> (cost=3D0.00..2527828.05 rows=3D11383021 width=3D778) (actual
>> time=3D25522.389..747238.885 rows=3D42 loops=3D1)
>> > =E2=94=82                                 Filter: (((body ->> 'SID'::t=
ext) IS
>> NOT NULL) AND (validation_status_code =3D 'P'::bpchar))
>> > =E2=94=82                                 Rows Removed by Filter: 1217=
2186
>>
>> While I'd expect the planner to be pretty solid on estimating the
>> validation_status_code condition, it's not going to have any idea about
>> that JSON field test.  That's apparently very selective, but you're just
>> getting a default estimate, which is not going to think that a NOT NULL
>> test will exclude lots of rows.
>>
>> One thing you could consider doing about this is creating an index
>> on (body ->> 'SID'::text), which would prompt ANALYZE to gather statisti=
cs
>> about that expression.  Even if the index weren't actually used in the
>> plan, this might improve the estimates and the resulting planning choice=
s
>> enough to make it worth maintaining such an index.
>>
>> Or you could think about pulling that field out and storing it on its ow=
n.
>> JSON columns are great for storing random unstructured data, but they ar=
e
>> less great when you want to do relational-ish things on subfields.
>>
>>                         regards, tom lane
>>
>
>

--001a1147d64e49bba50559a14a25
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">I was able to add the suggested indexes like=C2=A0stage.ev=
ent__00075000((body-&gt;&gt;&#39;SID&#39;::text)); and indeed these helped =
the QA environment use those indexes instead of sequence scanning.=C2=A0<di=
v><br></div><div>I&#39;m still perplexed by my original question, why produ=
ction uses the partition index and qa does not?</div><div><p class=3D"gmail=
-m_5399471546080563555gmail-p1" style=3D"font-size:12.8px"><span class=3D"g=
mail-m_5399471546080563555gmail-s1">Index Scan using ix_event__00014695_lan=
ding_id on event__00014695 e_3<span class=3D"gmail-m_5399471546080563555gma=
il-Apple-converted-space">=C2=A0=C2=A0</span>(cost=3D0.56..39137.89 rows=3D=
37697 width=3D564)=C2=A0<span class=3D"gmail-m_5399471546080563555gmail-App=
le-converted-space">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0=C2=A0<=
/span>=E2=94=82</span></p><p class=3D"gmail-m_5399471546080563555gmail-p1" =
style=3D"font-size:12.8px"><span class=3D"gmail-m_5399471546080563555gmail-=
s1">=E2=94=82=C2=A0<span class=3D"gmail-m_5399471546080563555gmail-Apple-co=
nverted-space">=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0=C2=A0</span>Index Cond=
: (landing_id =3D t_sap.landing_id)=C2=A0<span class=3D"gmail-m_53994715460=
80563555gmail-Apple-converted-space">=C2=A0 =C2=A0=C2=A0</span></span></p><=
p class=3D"gmail-m_5399471546080563555gmail-p1" style=3D"font-size:12.8px">=
<br></p><p class=3D"gmail-m_5399471546080563555gmail-p1" style=3D"font-size=
:12.8px">Ultimately I think this is just highlighting the need in my enviro=
nment to set random_page_cost lower (we are on an SSD SAN anyway..), but I =
dont think I have a satisfactory reason by the row estimates are so bad in =
the QA planner and why it doesnt use that partition index there.</p><p clas=
s=3D"gmail-m_5399471546080563555gmail-p1" style=3D"font-size:12.8px"><br></=
p><p class=3D"gmail-m_5399471546080563555gmail-p1" style=3D"font-size:12.8p=
x"><span class=3D"gmail-m_5399471546080563555gmail-s1"><span class=3D"gmail=
-m_5399471546080563555gmail-Apple-converted-space"><br></span></span></p></=
div></div><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri,=
 Sep 15, 2017 at 3:59 PM, Mike Broers <span dir=3D"ltr">&lt;<a href=3D"mail=
to:mbroers@gmail.com" target=3D"_blank">mbroers@gmail.com</a>&gt;</span> wr=
ote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border=
-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr">That makes a lot of=
 sense, thanks for taking a look.=C2=A0 An index like you suggest would pro=
bably further improve the query. =C2=A0 Is that suggestion sidestepping the=
 original problem that production is evaluating the landing_id bit with the=
 partition index and qa is sequence scanning instead?<span class=3D""><div>=
<br></div><div>AND exists (select 1 from t_sap where e.landing_id =3D t_sap=
.landing_id)) as rankings;<span class=3D"m_6725776135303111923gmail-Apple-c=
onverted-space">=C2=A0</span><br></div><div><span class=3D"m_67257761353031=
11923gmail-Apple-converted-space"><br></span></div></span><div><span class=
=3D"m_6725776135303111923gmail-Apple-converted-space">Based on the differen=
ce in row estimate I am attempting an analyze with a higher default_statist=
ic_target (currently 100) to see if that helps.</span></div><div><br></div>=
<div><span class=3D"m_6725776135303111923gmail-Apple-converted-space"><br><=
/span></div><div><span class=3D"m_6725776135303111923gmail-Apple-converted-=
space"><br></span></div></div><div class=3D"HOEnZb"><div class=3D"h5"><div =
class=3D"gmail_extra"><br><div class=3D"gmail_quote">On Fri, Sep 15, 2017 a=
t 3:42 PM, Tom Lane <span dir=3D"ltr">&lt;<a href=3D"mailto:tgl@sss.pgh.pa.=
us" target=3D"_blank">tgl@sss.pgh.pa.us</a>&gt;</span> wrote:<br><blockquot=
e class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc sol=
id;padding-left:1ex"><span>Mike Broers &lt;<a href=3D"mailto:mbroers@gmail.=
com" target=3D"_blank">mbroers@gmail.com</a>&gt; writes:<br>
&gt; If Im reading this correctly postgres thinks the partition will return=
 6.5<br>
&gt; million matching rows but actually comes back with 162k.=C2=A0 Is this=
 a case<br>
&gt; where something is wrong with the analyze job?<br>
<br>
</span>You&#39;ve got a lot of scans there that&#39;re using conditions lik=
e<br>
<span><br>
&gt; =E2=94=82=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0-&gt;=C2=A0 Seq Scan on event__999999=
99 e_1 (cost=3D0.00..2527828.05 rows=3D11383021 width=3D778) (actual time=
=3D25522.389..747238.885 rows=3D42 loops=3D1)<br>
&gt; =E2=94=82=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Filter: (((body =
-&gt;&gt; &#39;SID&#39;::text) IS NOT NULL) AND (validation_status_code =3D=
 &#39;P&#39;::bpchar))<br>
&gt; =E2=94=82=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0Rows Removed by =
Filter: 12172186<br>
<br>
</span>While I&#39;d expect the planner to be pretty solid on estimating th=
e<br>
validation_status_code condition, it&#39;s not going to have any idea about=
<br>
that JSON field test.=C2=A0 That&#39;s apparently very selective, but you&#=
39;re just<br>
getting a default estimate, which is not going to think that a NOT NULL<br>
test will exclude lots of rows.<br>
<br>
One thing you could consider doing about this is creating an index<br>
on (body -&gt;&gt; &#39;SID&#39;::text), which would prompt ANALYZE to gath=
er statistics<br>
about that expression.=C2=A0 Even if the index weren&#39;t actually used in=
 the<br>
plan, this might improve the estimates and the resulting planning choices<b=
r>
enough to make it worth maintaining such an index.<br>
<br>
Or you could think about pulling that field out and storing it on its own.<=
br>
JSON columns are great for storing random unstructured data, but they are<b=
r>
less great when you want to do relational-ish things on subfields.<br>
<br>
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 regards, tom lane<br>
</blockquote></div><br></div>
</div></div></blockquote></div><br></div>

--001a1147d64e49bba50559a14a25--