MIME-Version: 1.0
In-Reply-To: <3f3b6180-4c67-7b17-601e-1fb0ad16fb17@postgrespro.ru>
References: 
 <CAEhK25pHh7nrGUyQYwqCinrJ=6PXRgHo5DuzGXxgO-DvY8_Yqg@mail.gmail.com>
 <3f3b6180-4c67-7b17-601e-1fb0ad16fb17@postgrespro.ru>
From: Rowan Seymour <rowanseymour@gmail.com>
Date: Thu, 16 Jun 2016 10:29:49 +0200
Message-ID: 
 <CAEhK25qaCKdiNQp0i1twjGtj6qN0zSmOD_+s9qzhZpjV1YZB4g@mail.gmail.com>
Subject: Re: Many-to-many performance problem
To: Alex Ignatov <a.ignatov@postgrespro.ru>
Cc: pgsql-performance@postgresql.org
Content-Type: multipart/alternative; boundary=001a1135dbb2c071970535610bc5
Precedence: bulk
Sender: pgsql-performance-owner@postgresql.org

--001a1135dbb2c071970535610bc5
Content-Type: text/plain; charset=UTF-8

When you create an Postgres RDS instance, it's comes with a
"default.postgres9.3" parameter group which contains substitutions based on
the server size. The defaults for the memory related settings are:

effective_cache_size = {DBInstanceClassMemory/16384}
maintenance_work_mem = GREATEST({DBInstanceClassMemory/63963136*1024},65536)
shared_buffers = {DBInstanceClassMemory/32768}
temp_buffers = <not set>
work_mem = <not set>

According to
http://www.davidmkerr.com/2013/11/tune-your-postgres-rds-instance-via.html,
the units for effective_cache_size on AWS RDS, are 8kb blocks (am not sure
why this is...), so DBInstanceClassMemory/16384 = DBInstanceClassMemory/(2
* 8kb) = 50% of system memory.

We upgraded the server over the weekend which doubled the system memory and
increased the available IOPS, and that appears to have greatly improved the
situation, but there have still been a few timeouts. I'm wondering now if
activity on the other database in this instance doesn't occasionally push
our indexes out of memory.

Thanks, Rowan

On 10 June 2016 at 18:11, Alex Ignatov <a.ignatov@postgrespro.ru> wrote:

>
> On 10.06.2016 16:04, Rowan Seymour wrote:
>
> In our Django app we have messages (currently about 7 million in table
> msgs_message) and labels (about 300), and a join table to associate
> messages with labels (about 500,000 in msgs_message_labels). Not sure
> you'll need them, but here are the relevant table schemas:
>
> CREATE TABLE msgs_message
> (
>     id INTEGER PRIMARY KEY NOT NULL,
>     type VARCHAR NOT NULL,
>     text TEXT NOT NULL,
>     is_archived BOOLEAN NOT NULL,
>     created_on TIMESTAMP WITH TIME ZONE NOT NULL,
>     contact_id INTEGER NOT NULL,
>     org_id INTEGER NOT NULL,
>     case_id INTEGER,
>     backend_id INTEGER NOT NULL,
>     is_handled BOOLEAN NOT NULL,
>     is_flagged BOOLEAN NOT NULL,
>     is_active BOOLEAN NOT NULL,
>     has_labels BOOLEAN NOT NULL,
>     CONSTRAINT
> msgs_message_contact_id_5c8e3f216c115643_fk_contacts_contact_id FOREIGN KEY
> (contact_id) REFERENCES contacts_contact (id),
>     CONSTRAINT msgs_message_org_id_81a0adfcc99151d_fk_orgs_org_id FOREIGN
> KEY (org_id) REFERENCES orgs_org (id),
>     CONSTRAINT msgs_message_case_id_51998150f9629c_fk_cases_case_id
> FOREIGN KEY (case_id) REFERENCES cases_case (id)
> );
> CREATE UNIQUE INDEX msgs_message_backend_id_key ON msgs_message
> (backend_id);
> CREATE INDEX msgs_message_6d82f13d ON msgs_message (contact_id);
> CREATE INDEX msgs_message_9cf869aa ON msgs_message (org_id);
> CREATE INDEX msgs_message_7f12ca67 ON msgs_message (case_id);
>
> CREATE TABLE msgs_message_labels
> (
>     id INTEGER PRIMARY KEY NOT NULL,
>     message_id INTEGER NOT NULL,
>     label_id INTEGER NOT NULL,
>     CONSTRAINT
> msgs_message_lab_message_id_1dfa44628fe448dd_fk_msgs_message_id FOREIGN KEY
> (message_id) REFERENCES msgs_message (id),
>     CONSTRAINT
> msgs_message_labels_label_id_77cbdebd8d255b7a_fk_msgs_label_id FOREIGN KEY
> (label_id) REFERENCES msgs_label (id)
> );
> CREATE UNIQUE INDEX msgs_message_labels_message_id_label_id_key ON
> msgs_message_labels (message_id, label_id);
> CREATE INDEX msgs_message_labels_4ccaa172 ON msgs_message_labels
> (message_id);
> CREATE INDEX msgs_message_labels_abec2aca ON msgs_message_labels
> (label_id);
>
> Users can search for messages, and they are returned page by page in
> reverse chronological order. There are several partial multi-column indexes
> on the message table, but the one used for the example queries below is
>
> CREATE INDEX msgs_inbox ON msgs_message(org_id, created_on DESC)
> WHERE is_active = TRUE AND is_handled = TRUE AND is_archived = FALSE AND
> has_labels = TRUE;
>
> So a typical query for the latest page of messages looks like (
> https://explain.depesz.com/s/G9ew):
>
> SELECT "msgs_message".*
> FROM "msgs_message"
> WHERE ("msgs_message"."org_id" = 7
>     AND "msgs_message"."is_active" = true
>     AND "msgs_message"."is_handled" = true
>     AND "msgs_message"."has_labels" = true
>     AND "msgs_message"."is_archived" = false
>     AND "msgs_message"."created_on" < '2016-06-10T07:11:06.381000
> +00:00'::timestamptz
> ) ORDER BY "msgs_message"."created_on" DESC LIMIT 50
>
> But users can also search for messages that have one or more labels,
> leading to queries that look like:
>
> SELECT DISTINCT "msgs_message".*
> FROM "msgs_message"
> INNER JOIN "msgs_message_labels" ON ( "msgs_message"."id" =
> "msgs_message_labels"."message_id" )
> WHERE ("msgs_message"."org_id" = 7
>     AND "msgs_message"."is_active" = true
>     AND "msgs_message"."is_handled" = true
>     AND "msgs_message_labels"."label_id" IN (127, 128, 135, 136, 137, 138,
> 140, 141, 143, 144)
>     AND "msgs_message"."has_labels" = true
>     AND "msgs_message"."is_archived" = false
>     AND "msgs_message"."created_on" < '2016-06-10T07:11:06.381000
> +00:00'::timestamptz
> ) ORDER BY "msgs_message"."created_on" DESC LIMIT 50
>
> Most of time, this query performs like <https://explain.depesz.com/s/ksOC>
> https://explain.depesz.com/s/ksOC (~15ms). It's no longer using the using
> the msgs_inbox index, but it's plenty fast. However, sometimes it performs
> like  <https://explain.depesz.com/s/81c>https://explain.depesz.com/s/81c
> (67000ms)
>
> And if you run it again, it'll be fast again. Am I correct in interpreting
> that second explain as being slow because msgs_message_pkey isn't cached?
> It looks like it read from that index 3556 times, and each time took 18.559
> (?) ms, and that adds up to 65,996ms. The database server says it has lots
> of free memory so is there something I should be doing to keep that index
> in memory?
>
> Generally speaking, is there a good strategy for optimising queries like
> these which involve two tables?
>
>    - I tried moving the label references into an int array on
>    msgs_message, and then using btree_gin to create a multi-column index
>    involving the array column, but that doesn't appear to be very useful for
>    these ordered queries because it's not an ordered index.
>    - I tried adding created_on to msgs_message_labels table but I
>    couldn't find a way of avoiding the in-memory sort.
>    - Have thought about dynamically creating partial indexes for each
>    label using an array column on msgs_message to hold label ids, and index
>    condition like WHERE label_ids && ARRAY[123] but not sure what other
>    problems I'll run into with hundreds of indexes on the same table.
>
> Server is an Amazon RDS instance with default settings and Postgres
> 9.3.10, with one other database in the instance.
>
> All advice very much appreciated, thanks
>
> --
> *Rowan Seymour* | +260 964153686 <%2B260%20964153686>
>
> Hello! What do you mean by
> "Server is an Amazon RDS instance with default settings and Postgres
> 9.3.10, with one other database in the instance."
> PG is with default config or smth else?
> Is it  with default config as it is as from compile version? If so you
> should definitely have to do some tuning on it.
> By looking on plan i saw a lot of disk read. It can be linked to small
> shared memory dedicated to PG exactly what Tom said.
> Can you share pg config or raise for example shared_buffers parameter?
>
>
> Alex Ignatov
> Postgres Professional: http://www.postgrespro.com
> The Russian Postgres Company
>
>
>
>


-- 
*Rowan Seymour* | +260 964153686 | @rowanseymour

--001a1135dbb2c071970535610bc5
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">When you create an Postgres RDS instance, it&#39;s comes w=
ith a &quot;default.postgres9.3&quot; parameter group which contains substi=
tutions based on the server size. The defaults for the memory related setti=
ngs are:<div><br></div><div><div>effective_cache_size =3D {DBInstanceClassM=
emory/16384}</div></div><div><div>maintenance_work_mem =3D GREATEST({DBInst=
anceClassMemory/63963136*1024},65536)</div></div><div><div>shared_buffers =
=3D {DBInstanceClassMemory/32768}</div></div><div>temp_buffers =3D &lt;not =
set&gt;<br></div><div>work_mem =3D &lt;not set&gt;</div><div><br></div><div=
>According to=C2=A0<a href=3D"http://www.davidmkerr.com/2013/11/tune-your-p=
ostgres-rds-instance-via.html">http://www.davidmkerr.com/2013/11/tune-your-=
postgres-rds-instance-via.html</a>, the units for effective_cache_size on A=
WS RDS, are 8kb blocks (am not sure why this is...), so DBInstanceClassMemo=
ry/16384 =3D DBInstanceClassMemory/(2 * 8kb) =3D 50% of system memory.</div=
><div><br></div><div>We upgraded the server over the weekend which doubled =
the system memory and increased the available IOPS, and that appears to hav=
e greatly improved the situation, but there have still been a few timeouts.=
 I&#39;m wondering now if activity on the other database in this instance d=
oesn&#39;t occasionally push our indexes out of memory.</div><div><br></div=
><div>Thanks, Rowan</div></div><div class=3D"gmail_extra"><br><div class=3D=
"gmail_quote">On 10 June 2016 at 18:11, Alex Ignatov <span dir=3D"ltr">&lt;=
<a href=3D"mailto:a.ignatov@postgrespro.ru" target=3D"_blank">a.ignatov@pos=
tgrespro.ru</a>&gt;</span> wrote:<br><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
 =20
   =20
 =20
  <div bgcolor=3D"#FFFFFF" text=3D"#000000"><div><div class=3D"h5">
    <br>
    <div>On 10.06.2016 16:04, Rowan Seymour
      wrote:<br>
    </div>
    <blockquote type=3D"cite">
      <div dir=3D"ltr">
        <div>In our Django app we have messages (currently about 7
          million in table msgs_message) and labels (about 300), and a
          join table to associate messages with labels (about 500,000 in
          msgs_message_labels). Not sure you&#39;ll need them, but here are
          the relevant table schemas:</div>
        <div><br>
        </div>
        <div>
          <div>CREATE TABLE msgs_message</div>
          <div>(</div>
          <div>=C2=A0 =C2=A0 id INTEGER PRIMARY KEY NOT NULL,</div>
          <div>=C2=A0 =C2=A0 type VARCHAR NOT NULL,</div>
          <div>=C2=A0 =C2=A0 text TEXT NOT NULL,</div>
          <div>=C2=A0 =C2=A0 is_archived BOOLEAN NOT NULL,</div>
          <div>=C2=A0 =C2=A0 created_on TIMESTAMP WITH TIME ZONE NOT NULL,<=
/div>
          <div>=C2=A0 =C2=A0 contact_id INTEGER NOT NULL,</div>
          <div>=C2=A0 =C2=A0 org_id INTEGER NOT NULL,</div>
          <div>=C2=A0 =C2=A0 case_id INTEGER,</div>
          <div>=C2=A0 =C2=A0 backend_id INTEGER NOT NULL,</div>
          <div>=C2=A0 =C2=A0 is_handled BOOLEAN NOT NULL,</div>
          <div>=C2=A0 =C2=A0 is_flagged BOOLEAN NOT NULL,</div>
          <div>=C2=A0 =C2=A0 is_active BOOLEAN NOT NULL,</div>
          <div>=C2=A0 =C2=A0 has_labels BOOLEAN NOT NULL,</div>
          <div>=C2=A0 =C2=A0 CONSTRAINT
            msgs_message_contact_id_5c8e3f216c115643_fk_contacts_contact_id
            FOREIGN KEY (contact_id) REFERENCES contacts_contact (id),</div=
>
          <div>=C2=A0 =C2=A0 CONSTRAINT
            msgs_message_org_id_81a0adfcc99151d_fk_orgs_org_id FOREIGN
            KEY (org_id) REFERENCES orgs_org (id),</div>
          <div>=C2=A0 =C2=A0 CONSTRAINT
            msgs_message_case_id_51998150f9629c_fk_cases_case_id FOREIGN
            KEY (case_id) REFERENCES cases_case (id)</div>
          <div>);</div>
          <div>CREATE UNIQUE INDEX msgs_message_backend_id_key ON
            msgs_message (backend_id);</div>
          <div>CREATE INDEX msgs_message_6d82f13d ON msgs_message
            (contact_id);</div>
          <div>CREATE INDEX msgs_message_9cf869aa ON msgs_message
            (org_id);</div>
          <div>CREATE INDEX msgs_message_7f12ca67 ON msgs_message
            (case_id);</div>
        </div>
        <div><br>
        </div>
        <div>
          <div>CREATE TABLE msgs_message_labels</div>
          <div>(</div>
          <div>=C2=A0 =C2=A0 id INTEGER PRIMARY KEY NOT NULL,</div>
          <div>=C2=A0 =C2=A0 message_id INTEGER NOT NULL,</div>
          <div>=C2=A0 =C2=A0 label_id INTEGER NOT NULL,</div>
          <div>=C2=A0 =C2=A0 CONSTRAINT
            msgs_message_lab_message_id_1dfa44628fe448dd_fk_msgs_message_id
            FOREIGN KEY (message_id) REFERENCES msgs_message (id),</div>
          <div>=C2=A0 =C2=A0 CONSTRAINT
            msgs_message_labels_label_id_77cbdebd8d255b7a_fk_msgs_label_id
            FOREIGN KEY (label_id) REFERENCES msgs_label (id)</div>
          <div>);</div>
          <div>CREATE UNIQUE INDEX
            msgs_message_labels_message_id_label_id_key ON
            msgs_message_labels (message_id, label_id);</div>
          <div>CREATE INDEX msgs_message_labels_4ccaa172 ON
            msgs_message_labels (message_id);</div>
          <div>CREATE INDEX msgs_message_labels_abec2aca ON
            msgs_message_labels (label_id);</div>
        </div>
        <div><br>
        </div>
        <div>Users can search for messages, and they are returned page
          by page in reverse chronological order. There are several
          partial multi-column indexes on the message table, but the one
          used for the example queries below is</div>
        <div><br>
        </div>
        <div>
          <div>CREATE INDEX msgs_inbox ON msgs_message(org_id,
            created_on DESC)</div>
          <div>WHERE is_active =3D TRUE AND is_handled =3D TRUE AND
            is_archived =3D FALSE AND has_labels =3D TRUE;</div>
        </div>
        <div><br>
        </div>
        <div>So a typical query for the latest page of messages looks
          like (<a href=3D"https://explain.depesz.com/s/G9ew" target=3D"_bl=
ank">https://explain.depesz.com/s/G9ew</a>):</div>
        <div><br>
        </div>
        <div>SELECT &quot;msgs_message&quot;.*=C2=A0</div>
        <div>FROM &quot;msgs_message&quot;=C2=A0</div>
        <div>WHERE (&quot;msgs_message&quot;.&quot;org_id&quot; =3D 7=C2=A0=
</div>
        <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;is_active&quo=
t; =3D true=C2=A0</div>
        <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;is_handled&qu=
ot; =3D true=C2=A0</div>
        <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;has_labels&qu=
ot; =3D true=C2=A0</div>
        <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;is_archived&q=
uot; =3D false=C2=A0</div>
        <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;created_on&qu=
ot; &lt; &#39;2016-06-10T07:11:<a href=3D"tel:06.381000" value=3D"+25006381=
000" target=3D"_blank">06.381000</a>+00:00&#39;::timestamptz</div>
        <div>) ORDER BY &quot;msgs_message&quot;.&quot;created_on&quot; DES=
C LIMIT 50</div>
        <div><br>
        </div>
        <div>But users can also search for messages that have one or
          more labels, leading to queries that look like:</div>
        <div><br>
        </div>
        <div>
          <div>SELECT DISTINCT &quot;msgs_message&quot;.*=C2=A0</div>
          <div>FROM &quot;msgs_message&quot;=C2=A0</div>
          <div>INNER JOIN &quot;msgs_message_labels&quot; ON ( &quot;msgs_m=
essage&quot;.&quot;id&quot;
            =3D &quot;msgs_message_labels&quot;.&quot;message_id&quot; )=C2=
=A0</div>
          <div>WHERE (&quot;msgs_message&quot;.&quot;org_id&quot; =3D 7=C2=
=A0</div>
          <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;is_active&q=
uot; =3D true=C2=A0</div>
          <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;is_handled&=
quot; =3D true=C2=A0</div>
          <div>=C2=A0 =C2=A0 AND &quot;msgs_message_labels&quot;.&quot;labe=
l_id&quot; IN (127, 128,
            135, 136, 137, 138, 140, 141, 143, 144)=C2=A0</div>
          <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;has_labels&=
quot; =3D true=C2=A0</div>
          <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;is_archived=
&quot; =3D false=C2=A0</div>
          <div>=C2=A0 =C2=A0 AND &quot;msgs_message&quot;.&quot;created_on&=
quot; &lt;
            &#39;2016-06-10T07:11:<a href=3D"tel:06.381000" value=3D"+25006=
381000" target=3D"_blank">06.381000</a>+00:00&#39;::timestamptz</div>
          <div>) ORDER BY &quot;msgs_message&quot;.&quot;created_on&quot; D=
ESC LIMIT 50</div>
        </div>
        <div><br>
        </div>
        <div>Most of time, this query performs like <a href=3D"https://expl=
ain.depesz.com/s/ksOC" target=3D"_blank"></a><a href=3D"https://explain.dep=
esz.com/s/ksOC" target=3D"_blank">https://explain.depesz.com/s/ksOC</a>
          (~15ms). It&#39;s no longer using the using the msgs_inbox index,
          but it&#39;s plenty fast. However, sometimes it performs like=C2=
=A0<a href=3D"https://explain.depesz.com/s/81c" target=3D"_blank"></a><a hr=
ef=3D"https://explain.depesz.com/s/81c" target=3D"_blank">https://explain.d=
epesz.com/s/81c</a>
          (67000ms)</div>
        <div><br>
        </div>
        <div>And if you run it again, it&#39;ll be fast again. Am I correct
          in interpreting that second explain as being slow because
          msgs_message_pkey isn&#39;t cached? It looks like it read from
          that index 3556 times, and each time took=C2=A018.559 (?)=C2=A0ms=
, and
          that adds up to=C2=A065,996ms. The database server says it has lo=
ts
          of free memory so is there something I should be doing to keep
          that index in memory?</div>
        <div><br>
        </div>
        <div>Generally speaking, is there a good strategy for optimising
          queries like these which involve two tables?</div>
        <div>
          <ul>
            <li>I tried moving the label references into an int array on
              msgs_message, and then using btree_gin to create a
              multi-column index involving the array column, but that
              doesn&#39;t appear to be very useful for these ordered querie=
s
              because it&#39;s not an ordered index.</li>
            <li>I tried adding created_on to=C2=A0msgs_message_labels table
              but I couldn&#39;t find a way of avoiding the in-memory sort.=
</li>
            <li>Have thought about dynamically creating partial indexes
              for each label using an array column on=C2=A0msgs_message to
              hold label ids, and index condition like WHERE label_ids
              &amp;&amp; ARRAY[123] but not sure what other problems
              I&#39;ll run into with hundreds of indexes on the same table.=
</li>
          </ul>
          <div>Server is an Amazon RDS instance with default settings
            and Postgres 9.3.10, with one other database in the
            instance.</div>
        </div>
        <div><br>
        </div>
        <div>All advice very much appreciated, thanks</div>
        <div><br>
        </div>
        -- <br>
        <div data-smartmail=3D"gmail_signature">
          <div dir=3D"ltr">
            <div>
              <div dir=3D"ltr">
                <div>
                  <div dir=3D"ltr"><b>Rowan Seymour</b> | <a href=3D"tel:%2=
B260%20964153686" value=3D"+260964153686" target=3D"_blank">+260
                      964153686</a><br>
                  </div>
                </div>
              </div>
            </div>
          </div>
        </div>
      </div>
    </blockquote></div></div>
    Hello! What do you mean by <br><span class=3D"">
    &quot;Server is an Amazon RDS instance with default settings and Postgr=
es
    9.3.10, with one other database in the instance.&quot; <br></span>
    PG is with default config or smth else? <br>
    Is it=C2=A0 with default config as it is as from compile version? If so
    you should definitely have to do some tuning on it.<br>
    By looking on plan i saw a lot of disk read. It can be linked to
    small shared memory dedicated to PG exactly what Tom said. <br>
    Can you share pg config or raise for example shared_buffers
    parameter?<span class=3D"HOEnZb"><font color=3D"#888888"><br>
    <br>
    <br>
    <pre cols=3D"72">Alex Ignatov
Postgres Professional: <a href=3D"http://www.postgrespro.com" target=3D"_bl=
ank">http://www.postgrespro.com</a>
The Russian Postgres Company

</pre>
    <br>
  </font></span></div>

</blockquote></div><br><br clear=3D"all"><div><br></div>-- <br><div class=
=3D"gmail_signature" data-smartmail=3D"gmail_signature"><div dir=3D"ltr"><d=
iv><div dir=3D"ltr"><div><div dir=3D"ltr"><b>Rowan Seymour</b> | +260 96415=
3686 | @rowanseymour<br> </div></div></div></div></div></div>
</div>

--001a1135dbb2c071970535610bc5--