MIME-Version: 1.0
In-Reply-To: <31f372f2-f70e-782c-aa73-d211b3e333fc@pritambaral.com>
References: 
 <CABZYQRKnp=FxZ7tQeyytDjUOnHP9J90irxRBEAc+-XGbKdgf2A@mail.gmail.com>
 <ce8eaea2-3008-8cc1-fa39-129e9e82eaa2@pritambaral.com>
 <CABZYQRLx2eNn1N3bS=9p39ptcPRqv_GVVicA7pYODyqLLmrJig@mail.gmail.com>
 <31f372f2-f70e-782c-aa73-d211b3e333fc@pritambaral.com>
From: =?UTF-8?Q?Ulf_Lohbr=C3=BCgge?= <ulf.lohbruegge@gmail.com>
Date: Wed, 28 Jun 2017 16:25:15 +0200
Message-ID: 
 <CABZYQRLY8MOPnpbSFqUPn0Fm2cfJwVmRZ3WGa01kU-9LxRveRg@mail.gmail.com>
Subject: Re: Performance of information_schema with many schemata
 and tables
To: pgsql-performance@postgresql.org
Content-Type: multipart/alternative; boundary="001a11434e26eac1d7055305f3c5"
Precedence: bulk
Sender: pgsql-performance-owner@postgresql.org

--001a11434e26eac1d7055305f3c5
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

2017-06-28 10:43 GMT+02:00 Pritam Baral <pritam@pritambaral.com>:

>
>
> On Wednesday 28 June 2017 02:00 PM, Ulf Lohbr=C3=BCgge wrote:
> > Nope, I didn't try that yet. But I don't have the impression that
> reindexing the indexes in information_schema will help. The table
> information_schema.tables consists of the following indexes:
> >
> >     "pg_class_oid_index" UNIQUE, btree (oid)
> >     "pg_class_relname_nsp_index" UNIQUE, btree (relname, relnamespace)
> >     "pg_class_tblspc_relfilenode_index" btree (reltablespace,
> relfilenode)
>
> information_schema.tables is not a table, it's a view; at least on 9.5[0]=
.
> These indexes you list are actually indexes on the pg_catalog.pg_class
> table.
>

Yes, it's a view. \d+ information_schema.tables gives:

View definition:
 SELECT current_database()::information_schema.sql_identifier AS
table_catalog,
    nc.nspname::information_schema.sql_identifier AS table_schema,
    c.relname::information_schema.sql_identifier AS table_name,
        CASE
            WHEN nc.oid =3D pg_my_temp_schema() THEN 'LOCAL TEMPORARY'::tex=
t
            WHEN c.relkind =3D 'r'::"char" THEN 'BASE TABLE'::text
            WHEN c.relkind =3D 'v'::"char" THEN 'VIEW'::text
            WHEN c.relkind =3D 'f'::"char" THEN 'FOREIGN TABLE'::text
            ELSE NULL::text
        END::information_schema.character_data AS table_type,
    NULL::character varying::information_schema.sql_identifier AS
self_referencing_column_name,
    NULL::character varying::information_schema.character_data AS
reference_generation,
        CASE
            WHEN t.typname IS NOT NULL THEN current_database()
            ELSE NULL::name
        END::information_schema.sql_identifier AS user_defined_type_catalog=
,
    nt.nspname::information_schema.sql_identifier AS
user_defined_type_schema,
    t.typname::information_schema.sql_identifier AS user_defined_type_name,
        CASE
            WHEN c.relkind =3D 'r'::"char" OR (c.relkind =3D ANY
(ARRAY['v'::"char", 'f'::"char"])) AND
(pg_relation_is_updatable(c.oid::regclass, false) & 8) =3D 8 THEN 'YES'::te=
xt
            ELSE 'NO'::text
        END::information_schema.yes_or_no AS is_insertable_into,
        CASE
            WHEN t.typname IS NOT NULL THEN 'YES'::text
            ELSE 'NO'::text
        END::information_schema.yes_or_no AS is_typed,
    NULL::character varying::information_schema.character_data AS
commit_action
   FROM pg_namespace nc
     JOIN pg_class c ON nc.oid =3D c.relnamespace
     LEFT JOIN (pg_type t
     JOIN pg_namespace nt ON t.typnamespace =3D nt.oid) ON c.reloftype =3D =
t.oid
  WHERE (c.relkind =3D ANY (ARRAY['r'::"char", 'v'::"char", 'f'::"char"]))
AND NOT pg_is_other_temp_schema(nc.oid) AND (pg_has_role(c.relowner,
'USAGE'::text) OR has_table_privilege(c.oid, 'SELECT, INSERT, UPDATE,
DELETE, TRUNCATE, REFERENCES, TRIGGER'::text) OR
has_any_column_privilege(c.oid, 'SELECT, INSERT, UPDATE,
REFERENCES'::text));


>
> >
> > The costly sequence scan in question on pg_class happens with the
> following WHERE clause:
> >
> > WHERE (c.relkind =3D ANY (ARRAY['r'::"char", 'v'::"char", 'f'::"char"])=
)
> AND NOT pg_is_other_temp_schema(nc.oid) AND (pg_has_role(c.relowner,
> 'USAGE'::text) OR has_table_privilege(c.oid, 'SELECT, INSERT, UPDATE,
> DELETE, TRUNCATE, REFERENCES, TRIGGER'::text) OR has_any_column_privilege=
(c.oid,
> 'SELECT, INSERT, UPDATE, REFERENCES'::text));
>
> This is not the bottleneck WHERE clause the query plan from your first
> mail shows. That one is:
>
> ((relkind =3D ANY ('{r,v,f}'::"char"[])) AND (((relname)::information_
> schema.sql_identifier)::text =3D 'bar'::text) AND (pg_has_role(relowner,
> 'USAGE'::text) OR has_table_privilege(oid, 'SELECT, INSERT, UPDATE, DELET=
E,
> TRUNCATE, REFERENCES, TRIGGER'::text) OR has_any_column_privilege(oid,
> 'SELECT, INSERT, UPDATE, REFERENCES'::text)))
>

The part you copied is from the EXPLAIN ANALYZE output. The WHERE clause I
posted earlier (or see view definition) above does unfortunately not
contain the relname.


>
> I can say with certainty that an index on pg_catalog.pg_class.relname is
> going to speed this up. Postgres doesn't allow modifying system catalogs,
> but the `REINDEX SYSTEM <dbname>;` command should rebuild the system
> indexes and pg_catalog.pg_class.relname should be included in them (I
> tested on 9.6).
>
> Do try that once. If you still see sequential scans, check what indexes
> are present on pg_catalog.pg_class.
>

I just fired a 'REINDEX SYSTEM <dbname>;' but the output of EXPLAIN ANALYZE
is unchanged and the query duration did not change.

Best Regards,
Ulf


>
>
> >
> > Besides pg_class_oid_index none of the referenced columns is indexed. I
> tried to add an index on relowner but didn't succeed because the column i=
s
> used in the function call pg_has_role and the query is still forced to do=
 a
> sequence scan.
> >
> > Regards,
> > Ulf
> >
> > 2017-06-28 3:31 GMT+02:00 Pritam Baral <pritam@pritambaral.com <mailto:
> pritam@pritambaral.com>>:
> >
> >     On Wednesday 28 June 2017 05:27 AM, Ulf Lohbr=C3=BCgge wrote:
> >     > Hi all,
> >     >
> >     > we use schemata to separate our customers in a multi-tenant setup
> (9.5.7, Debian stable). Each tenant is managed in his own schema with all
> the tables that only he can access. All tables in all schemata are the sa=
me
> in terms of their DDL: Every tenant uses e.g. his own table 'address'. We
> currently manage around 1200 schemata (i.e. tenants) on one cluster. Ever=
y
> schema consists currently of ~200 tables - so we end up with ~240000 tabl=
es
> plus constraints, indexes, sequences et al.
> >     >
> >     > Our current approach is quite nice in terms of data privacy
> because every tenant is isolated from all other tenants. A tenant uses hi=
s
> own user that gives him only access to the corresponding schema.
> Performance is great for us - we didn't expect Postgres to scale so well!
> >     >
> >     > But performance is pretty bad when we query things in the
> information_schema:
> >     >
> >     > SELECT
> >     >   *
> >     > FROM information_schema.tables
> >     > WHERE table_schema =3D 'foo'
> >     > AND table_name =3D 'bar';``
> >     >
> >     > Above query results in a large sequence scan with a filter that
> removes 1305161 rows:
> >     >
> >     >
>
>                              QUERY PLAN
> >     > ------------------------------------------------------------
> ------------------------------------------------------------
> ------------------------------------------------------------
> ------------------------------------------------------------
> ------------------------------------------------------------
> -------------------------------------------------------
> >     >  Nested Loop Left Join  (cost=3D0.70..101170.18 rows=3D3 width=3D=
265)
> (actual time=3D383.505..383.505 rows=3D0 loops=3D1)
> >     >    ->  Nested Loop  (cost=3D0.00..101144.65 rows=3D3 width=3D141)
> (actual time=3D383.504..383.504 rows=3D0 loops=3D1)
> >     >          Join Filter: (nc.oid =3D c.relnamespace)
> >     >          ->  Seq Scan on pg_class c  (cost=3D0.00..101023.01
> rows=3D867 width=3D77) (actual time=3D383.502..383.502 rows=3D0 loops=3D1=
)
> >     >                Filter: ((relkind =3D ANY ('{r,v,f}'::"char"[])) A=
ND
> (((relname)::information_schema.sql_identifier)::text =3D 'bar'::text) AN=
D
> (pg_has_role(relowner, 'USAGE'::text) OR has_table_privilege(oid, 'SELECT=
,
> INSERT, UPDATE, DELETE, TRUNCATE, REFERENCES, TRIGGER'::text) OR
> has_any_column_privilege(oid, 'SELECT, INSERT, UPDATE, REFERENCES'::text)=
))
> >     >                Rows Removed by Filter: 1305161
> >     >          ->  Materialize  (cost=3D0.00..56.62 rows=3D5 width=3D68=
)
> (never executed)
> >     >                ->  Seq Scan on pg_namespace nc  (cost=3D0.00..56.=
60
> rows=3D5 width=3D68) (never executed)
> >     >                      Filter: ((NOT pg_is_other_temp_schema(oid))
> AND (((nspname)::information_schema.sql_identifier)::text =3D 'foo'::text=
))
> >     >    ->  Nested Loop  (cost=3D0.70..8.43 rows=3D1 width=3D132) (nev=
er
> executed)
> >     >          ->  Index Scan using pg_type_oid_index on pg_type t
> (cost=3D0.42..8.12 rows=3D1 width=3D72) (never executed)
> >     >                Index Cond: (c.reloftype =3D oid)
> >     >          ->  Index Scan using pg_namespace_oid_index on
> pg_namespace nt  (cost=3D0.28..0.30 rows=3D1 width=3D68) (never executed)
> >     >                Index Cond: (oid =3D t.typnamespace)
> >     >  Planning time: 0.624 ms
> >     >  Execution time: 383.784 ms
> >     > (16 rows)
> >     >
> >     > We noticed the degraded performance first when using the psql cli=
.
> Pressing tab after beginning a WHERE clause results in a query against th=
e
> information_schema which is pretty slow and ends in "lag" when trying to
> enter queries.
> >     >
> >     > We also use Flyway (https://flywaydb.org/) to handle our database
> migrations. Unfortunately Flyway is querying the information_schema to
> check if specific tables exist (I guess this is one of the reasons
> information_schema exists) and therefore vastly slows down the migration =
of
> our tenants. Our last migration run on all tenants (schemata) almost took
> 2h because the above query is executed multiple times per tenant. The
> migration run consisted of multiple sql files to be executed and triggere=
d
> more than 10 queries on information_schema per tenant.
> >     >
> >     > I don't think that Flyway is to blame because querying the
> information_schema should be a fast operation (and was fast for us when w=
e
> had less schemata). I tried to speedup querying pg_class by adding indexe=
s
> (after enabling allow_system_table_mods) but didn't succeed. The function
> call 'pg_has_role' is probably not easy to optimize.
> >     >
> >     > Postgres is really doing a great job to handle those many schemat=
a
> and tables but doesn't scale well when querying information_schema. I
> actually don't want to change my current multi-tenant setup (one schema p=
er
> tenant) as it is working great but the slow information_schema is killing
> our deployments.
> >     >
> >     > Are there any other options besides switching from
> one-schema-per-tenant-approach? Any help is greatly appreciated!
> >
> >     Have you tried a `REINDEX SYSTEM <dbname>`?
> >
> >     >
> >     > Regards,
> >     > Ulf
> >
> >     --
> >     #!/usr/bin/env regards
> >     Chhatoi Pritam Baral
> >
> >
> >
> >     --
> >     Sent via pgsql-performance mailing list (
> pgsql-performance@postgresql.org <mailto:pgsql-performance@postgresql.org
> >)
> >     To make changes to your subscription:
> >     http://www.postgresql.org/mailpref/pgsql-performance <
> http://www.postgresql.org/mailpref/pgsql-performance>
> >
> >
>
> [0]: https://www.postgresql.org/docs/9.5/static/infoschema-tables.html
>
> --
> #!/usr/bin/env regards
> Chhatoi Pritam Baral
>
>

--001a11434e26eac1d7055305f3c5
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><div class=3D"gmail_extra"><br><div class=3D"gmail_quo=
te">2017-06-28 10:43 GMT+02:00 Pritam Baral <span dir=3D"ltr">&lt;<a href=
=3D"mailto:pritam@pritambaral.com" target=3D"_blank">pritam@pritambaral.com=
</a>&gt;</span>:<br><blockquote class=3D"gmail_quote" style=3D"margin:0px 0=
px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><span=
 class=3D"gmail-"><br>
<br>
On Wednesday 28 June 2017 02:00 PM, Ulf Lohbr=C3=BCgge wrote:<br>
&gt; Nope, I didn&#39;t try that yet. But I don&#39;t have the impression t=
hat reindexing the indexes in information_schema will help. The table infor=
mation_schema.tables consists of the following indexes:<br>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&quot;pg_class_oid_index&quot; UNIQUE, btree (oid)<=
br>
&gt;=C2=A0 =C2=A0 =C2=A0&quot;pg_class_relname_nsp_index&quot; UNIQUE, btre=
e (relname, relnamespace)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&quot;pg_class_tblspc_relfilenode_<wbr>index&quot; =
btree (reltablespace, relfilenode)<br>
<br>
</span>information_schema.tables is not a table, it&#39;s a view; at least =
on 9.5[0]. These indexes you list are actually indexes on the pg_catalog.pg=
_class table.<br></blockquote><div><br></div><div>Yes, it&#39;s a view. \d+=
 information_schema.tables gives:</div><div><br></div><div><div>View defini=
tion:</div><div>=C2=A0SELECT current_database()::information_schema.sql_ide=
ntifier AS table_catalog,</div><div>=C2=A0 =C2=A0 nc.nspname::information_s=
chema.sql_identifier AS table_schema,</div><div>=C2=A0 =C2=A0 c.relname::in=
formation_schema.sql_identifier AS table_name,</div><div>=C2=A0 =C2=A0 =C2=
=A0 =C2=A0 CASE</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 WHEN nc=
.oid =3D pg_my_temp_schema() THEN &#39;LOCAL TEMPORARY&#39;::text</div><div=
>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 WHEN c.relkind =3D &#39;r&#39;::=
&quot;char&quot; THEN &#39;BASE TABLE&#39;::text</div><div>=C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 WHEN c.relkind =3D &#39;v&#39;::&quot;char&quot=
; THEN &#39;VIEW&#39;::text</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 WHEN c.relkind =3D &#39;f&#39;::&quot;char&quot; THEN &#39;FOREIGN T=
ABLE&#39;::text</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 ELSE NU=
LL::text</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 END::information_schema.char=
acter_data AS table_type,</div><div>=C2=A0 =C2=A0 NULL::character varying::=
information_schema.sql_identifier AS self_referencing_column_name,</div><di=
v>=C2=A0 =C2=A0 NULL::character varying::information_schema.character_data =
AS reference_generation,</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 CASE</div><d=
iv>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 WHEN t.typname IS NOT NULL THE=
N current_database()</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 EL=
SE NULL::name</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 END::information_schema=
.sql_identifier AS user_defined_type_catalog,</div><div>=C2=A0 =C2=A0 nt.ns=
pname::information_schema.sql_identifier AS user_defined_type_schema,</div>=
<div>=C2=A0 =C2=A0 t.typname::information_schema.sql_identifier AS user_def=
ined_type_name,</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 CASE</div><div>=C2=A0=
 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 WHEN c.relkind =3D &#39;r&#39;::&quot;c=
har&quot; OR (c.relkind =3D ANY (ARRAY[&#39;v&#39;::&quot;char&quot;, &#39;=
f&#39;::&quot;char&quot;])) AND (pg_relation_is_updatable(c.oid::regclass, =
false) &amp; 8) =3D 8 THEN &#39;YES&#39;::text</div><div>=C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 ELSE &#39;NO&#39;::text</div><div>=C2=A0 =C2=A0 =
=C2=A0 =C2=A0 END::information_schema.yes_or_no AS is_insertable_into,</div=
><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 CASE</div><div>=C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 WHEN t.typname IS NOT NULL THEN &#39;YES&#39;::text</div>=
<div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 ELSE &#39;NO&#39;::text</div=
><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 END::information_schema.yes_or_no AS is_t=
yped,</div><div>=C2=A0 =C2=A0 NULL::character varying::information_schema.c=
haracter_data AS commit_action</div><div>=C2=A0 =C2=A0FROM pg_namespace nc<=
/div><div>=C2=A0 =C2=A0 =C2=A0JOIN pg_class c ON nc.oid =3D c.relnamespace<=
/div><div>=C2=A0 =C2=A0 =C2=A0LEFT JOIN (pg_type t</div><div>=C2=A0 =C2=A0 =
=C2=A0JOIN pg_namespace nt ON t.typnamespace =3D nt.oid) ON c.reloftype =3D=
 t.oid</div><div>=C2=A0 WHERE (c.relkind =3D ANY (ARRAY[&#39;r&#39;::&quot;=
char&quot;, &#39;v&#39;::&quot;char&quot;, &#39;f&#39;::&quot;char&quot;]))=
 AND NOT pg_is_other_temp_schema(nc.oid) AND (pg_has_role(c.relowner, &#39;=
USAGE&#39;::text) OR has_table_privilege(c.oid, &#39;SELECT, INSERT, UPDATE=
, DELETE, TRUNCATE, REFERENCES, TRIGGER&#39;::text) OR has_any_column_privi=
lege(c.oid, &#39;SELECT, INSERT, UPDATE, REFERENCES&#39;::text));</div></di=
v><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0p=
x 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<span class=3D"gmail-"><br>
&gt;<br>
&gt; The costly sequence scan in question on pg_class happens with the foll=
owing WHERE clause:<br>
&gt;<br>
&gt; WHERE (c.relkind =3D ANY (ARRAY[&#39;r&#39;::&quot;char&quot;, &#39;v&=
#39;::&quot;char&quot;, &#39;f&#39;::&quot;char&quot;])) AND NOT pg_is_othe=
r_temp_schema(nc.<wbr>oid) AND (pg_has_role(c.relowner, &#39;USAGE&#39;::te=
xt) OR has_table_privilege(c.oid, &#39;SELECT, INSERT, UPDATE, DELETE, TRUN=
CATE, REFERENCES, TRIGGER&#39;::text) OR has_any_column_privilege(c.<wbr>oi=
d, &#39;SELECT, INSERT, UPDATE, REFERENCES&#39;::text));<br>
<br>
</span>This is not the bottleneck WHERE clause the query plan from your fir=
st mail shows. That one is:<br>
<span class=3D"gmail-"><br>
((relkind =3D ANY (&#39;{r,v,f}&#39;::&quot;char&quot;[])) AND (((relname):=
:information_<br>
schema.sql_identifier)::text =3D &#39;bar&#39;::text) AND (pg_has_role(relo=
wner, &#39;USAGE&#39;::text) OR has_table_privilege(oid, &#39;SELECT, INSER=
T, UPDATE, DELETE, TRUNCATE, REFERENCES, TRIGGER&#39;::text) OR has_any_col=
umn_privilege(oid, &#39;SELECT, INSERT, UPDATE, REFERENCES&#39;::text)))<br=
></span></blockquote><div><br></div><div>The part you copied is from the EX=
PLAIN ANALYZE output. The WHERE clause I posted earlier (or see view defini=
tion) above does unfortunately not contain the relname.</div><div>=C2=A0</d=
iv><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bord=
er-left:1px solid rgb(204,204,204);padding-left:1ex"><span class=3D"gmail-"=
>
<br>
</span>I can say with certainty that an index on pg_catalog.pg_class.relnam=
e is going to speed this up. Postgres doesn&#39;t allow modifying system ca=
talogs, but the `REINDEX SYSTEM &lt;dbname&gt;;` command should rebuild the=
 system indexes and pg_catalog.pg_class.relname should be included in them =
(I tested on 9.6).<br>
<br>
Do try that once. If you still see sequential scans, check what indexes are=
 present on pg_catalog.pg_class.<br></blockquote><div><br></div><div>I just=
 fired a &#39;REINDEX SYSTEM &lt;dbname&gt;;&#39; but the output of EXPLAIN=
 ANALYZE is unchanged and the query duration did not change.</div><div><br>=
</div><div>Best Regards,</div><div>Ulf</div><div>=C2=A0</div><blockquote cl=
ass=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid=
 rgb(204,204,204);padding-left:1ex">
<span class=3D"gmail-"><br>
<br>
&gt;<br>
&gt; Besides pg_class_oid_index none of the referenced columns is indexed. =
I tried to add an index on relowner but didn&#39;t succeed because the colu=
mn is used in the function call pg_has_role and the query is still forced t=
o do a sequence scan.<br>
&gt;<br>
&gt; Regards,<br>
&gt; Ulf<br>
&gt;<br>
</span>&gt; 2017-06-28 3:31 GMT+02:00 Pritam Baral &lt;<a href=3D"mailto:pr=
itam@pritambaral.com">pritam@pritambaral.com</a> &lt;mailto:<a href=3D"mail=
to:pritam@pritambaral.com">pritam@pritambaral.com</a><wbr>&gt;&gt;:<br>
<div><div class=3D"gmail-h5">&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0On Wednesday 28 June 2017 05:27 AM, Ulf Lohbr=C3=BC=
gge wrote:<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; Hi all,<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; we use schemata to separate our customers in a=
 multi-tenant setup (9.5.7, Debian stable). Each tenant is managed in his o=
wn schema with all the tables that only he can access. All tables in all sc=
hemata are the same in terms of their DDL: Every tenant uses e.g. his own t=
able &#39;address&#39;. We currently manage around 1200 schemata (i.e. tena=
nts) on one cluster. Every schema consists currently of ~200 tables - so we=
 end up with ~240000 tables plus constraints, indexes, sequences et al.<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; Our current approach is quite nice in terms of=
 data privacy because every tenant is isolated from all other tenants. A te=
nant uses his own user that gives him only access to the corresponding sche=
ma. Performance is great for us - we didn&#39;t expect Postgres to scale so=
 well!<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; But performance is pretty bad when we query th=
ings in the information_schema:<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; SELECT<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0*<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; FROM information_schema.tables<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; WHERE table_schema =3D &#39;foo&#39;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; AND table_name =3D &#39;bar&#39;;``<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; Above query results in a large sequence scan w=
ith a filter that removes 1305161 rows:<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0QUERY PLAN<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; ------------------------------<wbr>-----------=
-------------------<wbr>------------------------------<wbr>----------------=
--------------<wbr>------------------------------<wbr>---------------------=
---------<wbr>------------------------------<wbr>--------------------------=
----<wbr>------------------------------<wbr>------------------------------<=
wbr>------------------------------<wbr>-------------------------<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 Nested Loop Left Join=C2=A0 (cost=3D0.70=
..101170.18 rows=3D3 width=3D265) (actual time=3D383.505..383.505 rows=3D0 =
loops=3D1)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 -&gt;=C2=A0 Nested Loop=C2=A0 (co=
st=3D0.00..101144.65 rows=3D3 width=3D141) (actual time=3D383.504..383.504 =
rows=3D0 loops=3D1)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 Join Filter:=
 (nc.oid =3D c.relnamespace)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 -&gt;=C2=A0 =
Seq Scan on pg_class c=C2=A0 (cost=3D0.00..101023.01 rows=3D867 width=3D77)=
 (actual time=3D383.502..383.502 rows=3D0 loops=3D1)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 Filter: ((relkind =3D ANY (&#39;{r,v,f}&#39;::&quot;char&quot;[]=
)) AND (((relname)::information_<wbr>schema.sql_identifier)::text =3D &#39;=
bar&#39;::text) AND (pg_has_role(relowner, &#39;USAGE&#39;::text) OR has_ta=
ble_privilege(oid, &#39;SELECT, INSERT, UPDATE, DELETE, TRUNCATE, REFERENCE=
S, TRIGGER&#39;::text) OR has_any_column_privilege(oid, &#39;SELECT, INSERT=
, UPDATE, REFERENCES&#39;::text)))<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 Rows Removed by Filter: 1305161<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 -&gt;=C2=A0 =
Materialize=C2=A0 (cost=3D0.00..56.62 rows=3D5 width=3D68) (never executed)=
<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 -&gt;=C2=A0 Seq Scan on pg_namespace nc=C2=A0 (cost=3D0.00..56.6=
0 rows=3D5 width=3D68) (never executed)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 Filter: ((NOT pg_is_other_temp_schema(oid))=
 AND (((nspname)::information_<wbr>schema.sql_identifier)::text =3D &#39;fo=
o&#39;::text))<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 -&gt;=C2=A0 Nested Loop=C2=A0 (co=
st=3D0.70..8.43 rows=3D1 width=3D132) (never executed)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 -&gt;=C2=A0 =
Index Scan using pg_type_oid_index on pg_type t=C2=A0 (cost=3D0.42..8.12 ro=
ws=3D1 width=3D72) (never executed)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 Index Cond: (c.reloftype =3D oid)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 -&gt;=C2=A0 =
Index Scan using pg_namespace_oid_index on pg_namespace nt=C2=A0 (cost=3D0.=
28..0.30 rows=3D1 width=3D68) (never executed)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 Index Cond: (oid =3D t.typnamespace)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 Planning time: 0.624 ms<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;=C2=A0 Execution time: 383.784 ms<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; (16 rows)<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; We noticed the degraded performance first when=
 using the psql cli. Pressing tab after beginning a WHERE clause results in=
 a query against the information_schema which is pretty slow and ends in &q=
uot;lag&quot; when trying to enter queries.<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; We also use Flyway (<a href=3D"https://flywayd=
b.org/" rel=3D"noreferrer" target=3D"_blank">https://flywaydb.org/</a>) to =
handle our database migrations. Unfortunately Flyway is querying the inform=
ation_schema to check if specific tables exist (I guess this is one of the =
reasons information_schema exists) and therefore vastly slows down the migr=
ation of our tenants. Our last migration run on all tenants (schemata) almo=
st took 2h because the above query is executed multiple times per tenant. T=
he migration run consisted of multiple sql files to be executed and trigger=
ed more than 10 queries on information_schema per tenant.<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; I don&#39;t think that Flyway is to blame beca=
use querying the information_schema should be a fast operation (and was fas=
t for us when we had less schemata). I tried to speedup querying pg_class b=
y adding indexes (after enabling allow_system_table_mods) but didn&#39;t su=
cceed. The function call &#39;pg_has_role&#39; is probably not easy to opti=
mize.<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; Postgres is really doing a great job to handle=
 those many schemata and tables but doesn&#39;t scale well when querying in=
formation_schema. I actually don&#39;t want to change my current multi-tena=
nt setup (one schema per tenant) as it is working great but the slow inform=
ation_schema is killing our deployments.<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; Are there any other options besides switching =
from one-schema-per-tenant-<wbr>approach? Any help is greatly appreciated!<=
br>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0Have you tried a `REINDEX SYSTEM &lt;dbname&gt;`?<b=
r>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; Regards,<br>
&gt;=C2=A0 =C2=A0 =C2=A0&gt; Ulf<br>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0--<br>
&gt;=C2=A0 =C2=A0 =C2=A0#!/usr/bin/env regards<br>
&gt;=C2=A0 =C2=A0 =C2=A0Chhatoi Pritam Baral<br>
&gt;<br>
&gt;<br>
&gt;<br>
&gt;=C2=A0 =C2=A0 =C2=A0--<br>
</div></div>&gt;=C2=A0 =C2=A0 =C2=A0Sent via pgsql-performance mailing list=
 (<a href=3D"mailto:pgsql-performance@postgresql.org">pgsql-performance@pos=
tgresql.<wbr>org</a> &lt;mailto:<a href=3D"mailto:pgsql-performance@postgre=
sql.org">pgsql-performance@<wbr>postgresql.org</a>&gt;)<br>
<span class=3D"gmail-">&gt;=C2=A0 =C2=A0 =C2=A0To make changes to your subs=
cription:<br>
</span>&gt;=C2=A0 =C2=A0 =C2=A0<a href=3D"http://www.postgresql.org/mailpre=
f/pgsql-performance" rel=3D"noreferrer" target=3D"_blank">http://www.postgr=
esql.org/<wbr>mailpref/pgsql-performance</a> &lt;<a href=3D"http://www.post=
gresql.org/mailpref/pgsql-performance" rel=3D"noreferrer" target=3D"_blank"=
>http://www.postgresql.org/<wbr>mailpref/pgsql-performance</a>&gt;<br>
&gt;<br>
&gt;<br>
<br>
[0]: <a href=3D"https://www.postgresql.org/docs/9.5/static/infoschema-table=
s.html" rel=3D"noreferrer" target=3D"_blank">https://www.postgresql.org/<wb=
r>docs/9.5/static/infoschema-<wbr>tables.html</a><br>
<div class=3D"gmail-HOEnZb"><div class=3D"gmail-h5"><br>
--<br>
#!/usr/bin/env regards<br>
Chhatoi Pritam Baral<br>
<br>
</div></div></blockquote></div><br></div></div>

--001a11434e26eac1d7055305f3c5--