From: Arnold Morein <arnie.morein@mac.com>
Message-Id: <DA499CAE-DC7D-43CF-A885-5CD998DB8C97@mac.com>
Content-Type: multipart/alternative;
	boundary="Apple-Mail=_E9BEF892-949C-458B-8756-73FFCC98896B"
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3826.200.121\))
Subject: Re: Need help in database design
Date: Mon, 23 Dec 2024 12:34:52 -0600
In-Reply-To: <CAHesJ5Kn4bm6DmEVtEmHc1itXovdqNo8iE4kiDunH3gYK5jF4A@mail.gmail.com>
Cc: Ron Johnson <ronljohnsonjr@gmail.com>,
 pgsql-general <pgsql-general@postgresql.org>
To: Divyansh Gupta JNsThMAudy <ag1567827@gmail.com>
References: <CAHesJ5LES3aTDf=xp7NOwrADQ_HWC-Spsv7yLu9ZY+zxzZO53A@mail.gmail.com>
 <b450927c-49da-46e5-ad74-bf38ceff166b@aklaver.com>
 <CAHesJ5+ASNoSNMiC5Ms0Ts=gw7v2_UeBpUT=phujO4yE_XCbEw@mail.gmail.com>
 <CAHesJ5JbkCBZ2f_AvUr8+KWnGPAsobu4zyfnWm8bEeb7X9oqDQ@mail.gmail.com>
 <06e1f1ee-74b2-43a2-9a63-da20ae455ae2@aklaver.com>
 <CAHesJ5JLzhHiGSBSkJZ7x7rGgHeeByP=wWk1D5GG=x8cJ5YY6Q@mail.gmail.com>
 <CAKFQuwYdpzwcbSdQ8TvZ-nVjPeHVVz+5=bWofCbUK+p_o=axrQ@mail.gmail.com>
 <CAHesJ5+yTenkAxOT8H33Cfe=1b2kSyXGqxFYfYz5fgYAVVvFmw@mail.gmail.com>
 <CAHesJ5KaJ8p7QhB9UUoFEbA87cU7ke4GBMkKR3q2FJPVv9GXyw@mail.gmail.com>
 <CANzqJaB_s8eXCZJvYO9CLvgJNqrshD=G5GgECi1M9=vk-JHjdQ@mail.gmail.com>
 <CAHesJ5LgLi9-uGCk3J9TUkuyttysz3fzTaP+o57EjcBtwDYKZA@mail.gmail.com>
 <CANzqJaD-MwXzvg97q0iLvAdkf=DnUMOq0Ex2_eNU7sTxEL7bfA@mail.gmail.com>
 <CAHesJ5KtKm9fjhMdR1+cC-M5jW98Sz6sWKbt0mN6SJcfkq9eig@mail.gmail.com>
 <CAHesJ5Kne6MZakdhcQ9Zc-5KhBvhgUt+zUXuX5v6z+zwTY6gLQ@mail.gmail.com>
 <CANzqJaDb901c=fbicfuzXu1kvf1OR+6rtRZvtHOPyGaOD2E99Q@mail.gmail.com>
 <CAHesJ5LbFjqN0NndYUHKsXX_JssgwM-VPPaYoi28kZvsciFifQ@mail.gmail.com>
 <CAHesJ5Kn4bm6DmEVtEmHc1itXovdqNo8iE4kiDunH3gYK5jF4A@mail.gmail.com>
Archived-At: <https://www.postgresql.org/message-id/DA499CAE-DC7D-43CF-A885-5CD998DB8C97%40mac.com>
Precedence: bulk


--Apple-Mail=_E9BEF892-949C-458B-8756-73FFCC98896B
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain;
	charset=utf-8

I would like to make a suggestion, if I may. Granted, I do not =
understand the underlying task at hand, but:


A table with multiple columns of the same type smacks of designs that =
harken back to the days of mainframes. (STOP THAT!) The data described =
is a non-normalized array of integers that is meaningless outside of =
code. Table structures should be at least a LITTLE self-descriptive.

It is also not flexible (what if you suddenly need t51? how long would =
that table space adjustment take in production?) and space is wasted if =
not all 50 columns are populated.

Use a design that is basically a storage area for name/value pairs:

create table dbo.googledocs_tbl (
    id long identity primary key, =E2=80=94 easy way to access a single =
record
    owner_id integer/long not null, =E2=80=94 fk to owning parent record =
in other table such as user
    owner_type char(2), =E2=80=94 optional field, identifies the owing =
table, makes this table even more generic
    property_name varchar(n) not null, =E2=80=94 required unique name =
for property, not an array reference (t1, t4, t50)
    =E2=80=94 the names are controlled by the developer but should be =
human interpretable which can then be used in queries
    property_value int4 not null =E2=80=94 the important value in =
question
);

The fields owner_id, owner_type, property_name become a tertiary key =
that can never be changed, are unique and easily accessible via index =
lookup.

Add a timestamp if need be

You could then partition the record by owner_type or owner_id or =
whatever else comes to mind.

Then you just have to figure out the best way to index this monster for =
optimized queries.


> On Dec 23, 2024, at 12:31=E2=80=AFPM, Divyansh Gupta JNsThMAudy =
<ag1567827@gmail.com> wrote:
>=20
> Currently I haven't created those columns , I have created addons_json =
column which is a JSONB column yet in a discussion weather I should =
create or consider only one JSONB column.
>=20
>=20
> On Tue, 24 Dec 2024, 12:00=E2=80=AFam Divyansh Gupta JNsThMAudy, =
<ag1567827@gmail.com <mailto:ag1567827@gmail.com>> wrote:
>> Range partition can help when you applies filter for a specific range =
but in my case I need to apply filter on userid always, however I have =
date columns but there is less variation in timestamp which I have =
that's why didn't go for range partition.
>>=20
>>=20
>> On Mon, 23 Dec 2024, 11:57=E2=80=AFpm Ron Johnson, =
<ronljohnsonjr@gmail.com <mailto:ronljohnsonjr@gmail.com>> wrote:
>>>=20
>>> 1. I bet you'd get better performance using RANGE partitioning.
>>> 2. Twenty million rows per userid is a LOT.  No subdivisions (like =
date range)?
>>>=20
>>> On Mon, Dec 23, 2024 at 1:23=E2=80=AFPM Divyansh Gupta JNsThMAudy =
<ag1567827@gmail.com <mailto:ag1567827@gmail.com>> wrote:
>>>> Adrian, Please check this out;
>>>>=20
>>>> PARTITION BY HASH (userid);
>>>> CREATE TABLE dbo.googledocs_tbl_clone_part_0 PARTITION OF =
dbo.googledocs_tbl_clone  FOR VALUES WITH (modulus 84, remainder 0);
>>>> ...
>>>> CREATE TABLE dbo.googledocs_tbl_clone_part_83 PARTITION OF =
dbo.googledocs_tbl_clone  FOR VALUES WITH (modulus 84, remainder 83);
>>>>=20
>>>>=20
>>>>=20
>>>> On Mon, Dec 23, 2024 at 11:48=E2=80=AFPM Divyansh Gupta JNsThMAudy =
<ag1567827@gmail.com <mailto:ag1567827@gmail.com>> wrote:
>>>>> Adrian, the partition is on userid using hash partition with 84 =
partitions
>>>>>=20
>>>>> Ron, there could be more than 20 Million records possible for a =
single userid in that case if I create index on userid only not on other =
column the query is taking more than 30 seconds to return the results.
>>>>>=20
>>>>>=20
>>>>> On Mon, 23 Dec 2024, 11:40=E2=80=AFpm Ron Johnson, =
<ronljohnsonjr@gmail.com <mailto:ronljohnsonjr@gmail.com>> wrote:
>>>>>> If your queries all reference userid, then you only need indices =
on gdid and userid.
>>>>>>=20
>>>>>> On Mon, Dec 23, 2024 at 12:49=E2=80=AFPM Divyansh Gupta =
JNsThMAudy <ag1567827@gmail.com <mailto:ag1567827@gmail.com>> wrote:
>>>>>>> I have one confusion with this design if I opt to create 50 =
columns I need to create 50 index which will work with userid index in =
Bitmap on the other hand if I create a JSONB column I need to create a =
single index ?
>>>>>>>=20
>>>>>>>=20
>>>>>>> On Mon, 23 Dec 2024, 11:10=E2=80=AFpm Ron Johnson, =
<ronljohnsonjr@gmail.com <mailto:ronljohnsonjr@gmail.com>> wrote:
>>>>>>>> Given what you just wrote, I'd stick with 50 separate t* =
columns.  Simplifies queries, simplifies updates, and eliminates JSONB =
conversions.
>>>>>>>>=20
>>>>>>>> On Mon, Dec 23, 2024 at 12:29=E2=80=AFPM Divyansh Gupta =
JNsThMAudy <ag1567827@gmail.com <mailto:ag1567827@gmail.com>> wrote:
>>>>>>>>> Values can be updated based on customer actions
>>>>>>>>>=20
>>>>>>>>> All rows won't have all 50 key value pairs always if I make =
those keys into columns the rows might have null value on the other hand =
if it is JSONB then the key value pair will not be there
>>>>>>>>>=20
>>>>>>>>> Yes in UI customers can search for the key value pairs
>>>>>>>>>=20
>>>>>>>>> During data population the key value pair will be empty array =
in case of JSONB column or NULL in case of table columns, later when =
customer performs some actions that time the key value pairs will =
populate and update, based on what action customer performs.
>>>>>>>>>=20
>>>>>>>>>=20
>>>>>>>>> On Mon, 23 Dec 2024, 10:51=E2=80=AFpm Divyansh Gupta =
JNsThMAudy, <ag1567827@gmail.com <mailto:ag1567827@gmail.com>> wrote:
>>>>>>>>>> Let's make it more understandable, here is the table schema =
with 50 columns in it=20
>>>>>>>>>>=20
>>>>>>>>>> CREATE TABLE dbo.googledocs_tbl (
>>>>>>>>>> gdid int8 GENERATED BY DEFAULT AS IDENTITY( INCREMENT BY 1 =
MINVALUE 1 MAXVALUE 9223372036854775807 START 1 CACHE 1 NO CYCLE) NOT =
NULL,
>>>>>>>>>> userid int8 NOT NULL,
>>>>>>>>>> t1 int4 NULL,
>>>>>>>>>> t2 int4 NULL,
>>>>>>>>>> t3 int4 NULL,
>>>>>>>>>> t4 int4 NULL,
>>>>>>>>>> t5 int4 NULL,
>>>>>>>>>> t6 int4 NULL,
>>>>>>>>>> t7 int4 NULL,
>>>>>>>>>> t8 int4 NULL,
>>>>>>>>>> t9 int4 NULL,
>>>>>>>>>> t10 int4 NULL,
>>>>>>>>>> t11 int4 NULL,
>>>>>>>>>> t12 int4 NULL,
>>>>>>>>>> t13 int4 NULL,
>>>>>>>>>> t14 int4 NULL,
>>>>>>>>>> t15 int4 NULL,
>>>>>>>>>> t16 int4 NULL,
>>>>>>>>>> t17 int4 NULL,
>>>>>>>>>> t18 int4 NULL,
>>>>>>>>>> t19 int4 NULL,
>>>>>>>>>> t20 int4 NULL,
>>>>>>>>>> t21 int4 NULL,
>>>>>>>>>> t22 int4 NULL,
>>>>>>>>>> t23 int4 NULL,
>>>>>>>>>> t24 int4 NULL,
>>>>>>>>>> t25 int4 NULL,
>>>>>>>>>> t26 int4 NULL,
>>>>>>>>>> t27 int4 NULL,
>>>>>>>>>> t28 int4 NULL,
>>>>>>>>>> t29 int4 NULL,
>>>>>>>>>> t30 int4 NULL,
>>>>>>>>>> t31 int4 NULL,
>>>>>>>>>> t32 int4 NULL,
>>>>>>>>>> t33 int4 NULL,
>>>>>>>>>> t34 int4 NULL,
>>>>>>>>>> t35 int4 NULL,
>>>>>>>>>> t36 int4 NULL,
>>>>>>>>>> t37 int4 NULL,
>>>>>>>>>> t38 int4 NULL,
>>>>>>>>>> t39 int4 NULL,
>>>>>>>>>> t40 int4 NULL,
>>>>>>>>>> t41 int4 NULL,
>>>>>>>>>> t42 int4 NULL,
>>>>>>>>>> t43 int4 NULL,
>>>>>>>>>> t44 int4 NULL,
>>>>>>>>>> t45 int4 NULL,
>>>>>>>>>> t46 int4 NULL,
>>>>>>>>>> t47 int4 NULL,
>>>>>>>>>> t48 int4 NULL,
>>>>>>>>>> t49 int4 NULL,
>>>>>>>>>> t50 int4 NULL,
>>>>>>>>>> CONSTRAINT googledocs_tbl_pkey PRIMARY KEY (gdid),
>>>>>>>>>> );
>>>>>>>>>>=20
>>>>>>>>>> Every time when i query I will query it along with userid=20
>>>>>>>>>> Ex : where userid =3D 12345678 and t1 in (1,2,3) and t2 in =
(0,1,2)
>>>>>>>>>> more key filters if customer applies=20
>>>>>>>>>>=20
>>>>>>>>>> On the other hand if I create a single jsonb column the =
schema will look like :
>>>>>>>>>>=20
>>>>>>>>>> CREATE TABLE dbo.googledocs_tbl (
>>>>>>>>>> gdid int8 GENERATED BY DEFAULT AS IDENTITY( INCREMENT BY 1 =
MINVALUE 1 MAXVALUE 9223372036854775807 START 1 CACHE 1 NO CYCLE) NOT =
NULL,
>>>>>>>>>> userid int8 NOT NULL,
>>>>>>>>>> addons_json jsonb default '{}'::jsonb
>>>>>>>>>> CONSTRAINT googledocs_tbl_pkey PRIMARY KEY (gdid),
>>>>>>>>>> );
>>>>>>>>>>=20
>>>>>>>>>> and the query would be like=20
>>>>>>>>>> where userid =3D 12345678 and ((addons_json @> {t1:1}) or  =
(addons_json @> {t1:2}) or=C2=A0 <> (addons_json @> {t1:3})
>>>>>>>>>> more key filters if customer applies=C2=A0
>>>>>>>>>>=20
>>>>>>>>>>=20
>>>>>>>>>>  <>
>>>>>>>>>> On Mon, Dec 23, 2024 at 10:38=E2=80=AFPM David G. Johnston =
<david.g.johnston@gmail.com <mailto:david.g.johnston@gmail.com>> wrote:
>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>> On Mon, Dec 23, 2024, 10:01 Divyansh Gupta JNsThMAudy =
<ag1567827@gmail.com <mailto:ag1567827@gmail.com>> wrote:
>>>>>>>>>>>>=20
>>>>>>>>>>>> So here my question is considering one JSONB column is =
perfect or considering 50 columns will be more optimised.
>>>>>>>>>>>>=20
>>>>>>>>>>>=20
>>>>>>>>>>> The relational database engine is designed around the =
column-based approach.  Especially if the columns are generally =
unchanging, combined with using fixed-width data types.
>>>>>>>>>>>=20
>>>>>>>>>>> David J.
>>>>>>>>>>>=20
>>>>>>>>=20
>>>>>>>>=20
>>>>>>>>=20
>>>>>>>> --
>>>>>>>> Death to <Redacted>, and butter sauce.
>>>>>>>> Don't boil me, I'm still alive.
>>>>>>>> <Redacted> lobster!
>>>>>>=20
>>>>>>=20
>>>>>>=20
>>>>>> --
>>>>>> Death to <Redacted>, and butter sauce.
>>>>>> Don't boil me, I'm still alive.
>>>>>> <Redacted> lobster!
>>>=20
>>>=20
>>>=20
>>> --
>>> Death to <Redacted>, and butter sauce.
>>> Don't boil me, I'm still alive.
>>> <Redacted> lobster!


--Apple-Mail=_E9BEF892-949C-458B-8756-73FFCC98896B
Content-Transfer-Encoding: quoted-printable
Content-Type: text/html;
	charset=utf-8

<html><head><meta http-equiv=3D"content-type" content=3D"text/html; =
charset=3Dutf-8"></head><body style=3D"overflow-wrap: break-word; =
-webkit-nbsp-mode: space; line-break: after-white-space;"><span =
style=3D"caret-color: rgb(0, 0, 0);"><font face=3D"Arial" =
color=3D"#000000">I would like to make a suggestion, if I may. Granted, =
I do not understand the underlying task at hand, but:</font></span><div =
style=3D"caret-color: rgb(0, 0, 0);"><font face=3D"Arial" =
color=3D"#000000"><br style=3D"caret-color: rgb(255, 255, =
255);"></font><div style=3D"caret-color: rgb(255, 255, 255);"><font =
face=3D"Arial" color=3D"#000000"><br></font></div><div =
style=3D"caret-color: rgb(255, 255, 255);"><font face=3D"Arial" =
color=3D"#000000">A table with multiple columns of the same type smacks =
of designs that harken back to the days of mainframes. (STOP THAT!) The =
data described is a non-normalized array of integers that is meaningless =
outside of code. Table structures should be at least a LITTLE =
self-descriptive.</font></div><div style=3D"caret-color: rgb(255, 255, =
255);"><font face=3D"Arial" color=3D"#000000"><br></font></div><div =
style=3D"caret-color: rgb(255, 255, 255);"><font face=3D"Arial" =
color=3D"#000000">It is also not flexible (what if you suddenly need =
t51? how long would that table space adjustment take in production?) and =
space is wasted if not all 50 columns are populated.</font></div><div =
style=3D"caret-color: rgb(255, 255, 255);"><font face=3D"Arial" =
color=3D"#000000"><br></font></div><div style=3D"caret-color: rgb(255, =
255, 255);"><font face=3D"Arial" color=3D"#000000">Use a design that is =
basically a storage area for name/value pairs:<br></font><div><font =
face=3D"Arial" color=3D"#000000"><br></font></div><div><font =
color=3D"#000000" face=3D"Courier New">create table dbo.googledocs_tbl =
(</font></div><div><font color=3D"#000000" face=3D"Courier New">&nbsp; =
&nbsp; id long identity primary key, =E2=80=94 easy way to access a =
single record</font></div><div><font color=3D"#000000" face=3D"Courier =
New"><span style=3D"caret-color: rgb(0, 0, 0);">&nbsp; =
&nbsp;</span><span style=3D"caret-color: rgb(0, 0, =
0);">&nbsp;</span>owner_id integer/long not null, =E2=80=94 fk to owning =
parent record in other table such as user</font></div><div><font =
color=3D"#000000" face=3D"Courier New"><span style=3D"caret-color: =
rgb(0, 0, 0);">&nbsp; &nbsp;</span><span style=3D"caret-color: rgb(0, 0, =
0);">&nbsp;</span>owner_type char(2), =E2=80=94 optional field, =
identifies the owing table, makes this table even more =
generic</font></div><div><font color=3D"#000000" face=3D"Courier =
New"><span style=3D"caret-color: rgb(0, 0, 0);">&nbsp; =
&nbsp;</span><span style=3D"caret-color: rgb(0, 0, =
0);">&nbsp;</span>property_name varchar(n) not null, =E2=80=94 required =
unique name for property, not an array reference (t1, t4, =
t50)</font></div><div><font color=3D"#000000" face=3D"Courier =
New">&nbsp; &nbsp; =E2=80=94 the names are controlled by the developer =
but should be human interpretable which can then be used in =
queries</font></div><div><font color=3D"#000000" face=3D"Courier =
New"><span style=3D"caret-color: rgb(0, 0, 0);">&nbsp; =
&nbsp;</span><span style=3D"caret-color: rgb(0, 0, =
0);">&nbsp;</span>property_value int4 not null =E2=80=94 the important =
value in question</font></div><div><font color=3D"#000000" face=3D"Courier=
 New">);</font></div><div><font face=3D"Arial" =
color=3D"#000000"><br></font></div><div><font face=3D"Arial" =
color=3D"#000000">The fields owner_id, owner_type, property_name become =
a tertiary key that can never be changed, are unique and easily =
accessible via index lookup.</font></div><div><font face=3D"Arial" =
color=3D"#000000"><br></font></div><div><font face=3D"Arial" =
color=3D"#000000">Add a timestamp if need be</font></div><div><font =
face=3D"Arial" color=3D"#000000"><br></font></div><div =
style=3D"caret-color: rgb(0, 0, 0);"><font face=3D"Arial" =
color=3D"#000000">You could then partition the record by owner_type or =
owner_id or whatever else comes to mind.</font></div><div><font =
face=3D"Arial" color=3D"#000000"><br></font></div><div><font =
face=3D"Arial" color=3D"#000000">Then you just have to figure out the =
best way to index this monster for optimized =
queries.</font></div><div><font face=3D"Arial" =
color=3D"#000000"><br></font></div></div></div><br =
class=3D"Apple-interchange-newline" style=3D"caret-color: rgb(0, 0, 0); =
color: rgb(0, 0, 0);"><div><br><blockquote type=3D"cite"><div>On Dec 23, =
2024, at 12:31=E2=80=AFPM, Divyansh Gupta JNsThMAudy =
&lt;ag1567827@gmail.com&gt; wrote:</div><br =
class=3D"Apple-interchange-newline"><div><p dir=3D"ltr">Currently I =
haven't created those columns , I have created addons_json column which =
is a JSONB column yet in a discussion weather I should create or =
consider only one JSONB column.</p>
<br><div class=3D"gmail_quote gmail_quote_container"><div dir=3D"ltr" =
class=3D"gmail_attr">On Tue, 24 Dec 2024, 12:00=E2=80=AFam Divyansh =
Gupta JNsThMAudy, &lt;<a =
href=3D"mailto:ag1567827@gmail.com">ag1567827@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><p dir=3D"ltr">Range =
partition can help when you applies filter for a specific range but in =
my case I need to apply filter on userid always, however I have date =
columns but there is less variation in timestamp which I have that's why =
didn't go for range partition.</p>
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On =
Mon, 23 Dec 2024, 11:57=E2=80=AFpm Ron Johnson, &lt;<a =
href=3D"mailto:ronljohnsonjr@gmail.com" target=3D"_blank" =
rel=3D"noreferrer">ronljohnsonjr@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 =
.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div =
dir=3D"ltr"><br></div><div>1. I bet you'd get better performance using =
RANGE partitioning.</div><div>2. Twenty million rows per userid&nbsp;is =
a <b>LOT</b>.&nbsp; No subdivisions (like date range)?</div><br><div =
class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Mon, Dec =
23, 2024 at 1:23=E2=80=AFPM Divyansh Gupta JNsThMAudy &lt;<a =
href=3D"mailto:ag1567827@gmail.com" rel=3D"noreferrer noreferrer" =
target=3D"_blank">ag1567827@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr">Adrian, Please check =
this out;<br><br><span style=3D"background-color:rgb(47,47,47);padding:0px=
 0px 0px 2px"><span =
style=3D"color:rgb(204,204,204);font-family:Consolas;font-size:10pt;white-=
space:pre-wrap"><span =
style=3D"color:rgb(115,158,202);font-weight:bold">PARTITION</span> <span =
style=3D"color:rgb(115,158,202);font-weight:bold">BY</span> <span =
style=3D"color:rgb(158,158,158)">HASH</span> (<span =
style=3D"color:rgb(158,158,158)">userid</span>)<span =
style=3D"color:rgb(238,204,100)">;
</span></span><span style=3D"padding:0px 0px 0px 2px"><span =
style=3D"color:rgb(204,204,204);font-family:Consolas;font-size:10pt;white-=
space:pre-wrap"><span =
style=3D"color:rgb(115,158,202);font-weight:bold">CREATE</span> <span =
style=3D"color:rgb(115,158,202);font-weight:bold">TABLE</span> <span =
style=3D"color:rgb(204,155,117)">dbo</span>.<span =
style=3D"color:rgb(183,136,211)">googledocs_tbl_clone_part_0</span> =
<span style=3D"color:rgb(115,158,202);font-weight:bold">PARTITION</span> =
<span style=3D"color:rgb(115,158,202);font-weight:bold">OF</span> <span =
style=3D"color:rgb(158,158,158)">dbo</span>.<span =
style=3D"color:rgb(158,158,158)">googledocs_tbl_clone</span>  <span =
style=3D"color:rgb(115,158,202);font-weight:bold">FOR</span> <span =
style=3D"color:rgb(115,158,202);font-weight:bold">VALUES</span> <span =
style=3D"color:rgb(115,158,202);font-weight:bold">WITH</span> (<span =
style=3D"color:rgb(158,158,158)">modulus</span> <span =
style=3D"color:rgb(192,192,192)">84</span>, <span =
style=3D"color:rgb(158,158,158)">remainder</span> <span =
style=3D"color:rgb(192,192,192)">0</span>)<span =
style=3D"color:rgb(238,204,100)">;
...
</span></span><span style=3D"padding:0px 0px 0px 2px"><span =
style=3D"color:rgb(204,204,204);font-family:Consolas;font-size:10pt;white-=
space:pre-wrap"><span =
style=3D"color:rgb(115,158,202);font-weight:bold">CREATE</span> <span =
style=3D"color:rgb(115,158,202);font-weight:bold">TABLE</span> <span =
style=3D"color:rgb(204,155,117)">dbo</span>.<span =
style=3D"color:rgb(183,136,211)">googledocs_tbl_clone_part_83</span> =
<span style=3D"color:rgb(115,158,202);font-weight:bold">PARTITION</span> =
<span style=3D"color:rgb(115,158,202);font-weight:bold">OF</span> <span =
style=3D"color:rgb(158,158,158)">dbo</span>.<span =
style=3D"color:rgb(158,158,158)">googledocs_tbl_clone</span>  <span =
style=3D"color:rgb(115,158,202);font-weight:bold">FOR</span> <span =
style=3D"color:rgb(115,158,202);font-weight:bold">VALUES</span> <span =
style=3D"color:rgb(115,158,202);font-weight:bold">WITH</span> (<span =
style=3D"color:rgb(158,158,158)">modulus</span> <span =
style=3D"color:rgb(192,192,192)">84</span>, <span =
style=3D"color:rgb(158,158,158)">remainder</span> <span =
style=3D"color:rgb(192,192,192)">83</span>)<span =
style=3D"color:rgb(238,204,100)">;


</span></span></span><span =
style=3D"color:rgb(204,204,204);font-family:Consolas;font-size:10pt;white-=
space:pre-wrap"><span =
style=3D"color:rgb(238,204,100)"></span></span></span><span =
style=3D"color:rgb(204,204,204);font-family:Consolas;font-size:10pt;white-=
space:pre-wrap"><span =
style=3D"color:rgb(238,204,100)"></span></span></span></div><br><div =
class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Mon, Dec =
23, 2024 at 11:48=E2=80=AFPM Divyansh Gupta JNsThMAudy &lt;<a =
href=3D"mailto:ag1567827@gmail.com" rel=3D"noreferrer noreferrer" =
target=3D"_blank">ag1567827@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex"><p dir=3D"ltr">Adrian, the partition =
is on userid using hash partition with 84 partitions</p><p =
dir=3D"ltr">Ron, there could be more than 20 Million records possible =
for a single userid in that case if I create index on userid only not on =
other column the query is taking more than 30 seconds to return the =
results.</p>
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On =
Mon, 23 Dec 2024, 11:40=E2=80=AFpm Ron Johnson, &lt;<a =
href=3D"mailto:ronljohnsonjr@gmail.com" rel=3D"noreferrer noreferrer" =
target=3D"_blank">ronljohnsonjr@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div>If your queries =
all reference userid, then you only need indices on gdid and =
userid.</div><br><div class=3D"gmail_quote"><div dir=3D"ltr" =
class=3D"gmail_attr">On Mon, Dec 23, 2024 at 12:49=E2=80=AFPM Divyansh =
Gupta JNsThMAudy &lt;<a href=3D"mailto:ag1567827@gmail.com" =
rel=3D"noreferrer noreferrer noreferrer" =
target=3D"_blank">ag1567827@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex"><p dir=3D"ltr">I have one confusion =
with this design if I opt to create 50 columns I need to create 50 index =
which will work with userid index in Bitmap on the other hand if I =
create a JSONB column I need to create a single index ?</p>
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On =
Mon, 23 Dec 2024, 11:10=E2=80=AFpm Ron Johnson, &lt;<a =
href=3D"mailto:ronljohnsonjr@gmail.com" rel=3D"noreferrer noreferrer =
noreferrer" target=3D"_blank">ronljohnsonjr@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr"><div>Given what you =
just wrote, I'd stick with 50 separate t* columns.&nbsp; Simplifies =
queries, simplifies updates, and eliminates JSONB =
conversions.</div><br><div class=3D"gmail_quote"><div dir=3D"ltr" =
class=3D"gmail_attr">On Mon, Dec 23, 2024 at 12:29=E2=80=AFPM Divyansh =
Gupta JNsThMAudy &lt;<a href=3D"mailto:ag1567827@gmail.com" =
rel=3D"noreferrer noreferrer noreferrer noreferrer" =
target=3D"_blank">ag1567827@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex"><p dir=3D"ltr">Values can be updated =
based on customer actions</p><p dir=3D"ltr">All rows won't have all 50 =
key value pairs always if I make those keys into columns the rows might =
have null value on the other hand if it is JSONB then the key value pair =
will not be there</p><p dir=3D"ltr">Yes in UI customers can search for =
the key value pairs</p><p dir=3D"ltr">During data population the key =
value pair will be empty array in case of JSONB column or NULL in case =
of table columns, later when customer performs some actions that time =
the key value pairs will populate and update, based on what action =
customer performs.<br>
</p>
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On =
Mon, 23 Dec 2024, 10:51=E2=80=AFpm Divyansh Gupta JNsThMAudy, &lt;<a =
href=3D"mailto:ag1567827@gmail.com" rel=3D"noreferrer noreferrer =
noreferrer noreferrer" target=3D"_blank">ag1567827@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr">Let's make it more =
understandable, here is the table schema with 50 columns in =
it&nbsp;<br><br>CREATE TABLE dbo.googledocs_tbl (<br>	gdid int8 =
GENERATED BY DEFAULT AS IDENTITY( INCREMENT BY 1 MINVALUE 1 MAXVALUE =
9223372036854775807 START 1 CACHE 1 NO CYCLE) NOT NULL,<br>	userid =
int8 NOT NULL,<br>	t1 int4 NULL,<br>	t2 int4 NULL,<br>	=
t3 int4 NULL,<br>	t4 int4 NULL,<br>	t5 int4 NULL,<br>	=
t6 int4 NULL,<br>	t7 int4 NULL,<br>	t8 int4 NULL,<br>	=
t9 int4 NULL,<br>	t10 int4 NULL,<br>	t11 int4 NULL,<br>	=
t12 int4 NULL,<br>	t13 int4 NULL,<br>	t14 int4 NULL,<br>	=
t15 int4 NULL,<br>	t16 int4 NULL,<br>	t17 int4 NULL,<br>	=
t18 int4 NULL,<br>	t19 int4 NULL,<br>	t20 int4 NULL,<br>	=
t21 int4 NULL,<br>	t22 int4 NULL,<br>	t23 int4 NULL,<br>	=
t24 int4 NULL,<br>	t25 int4 NULL,<br>	t26 int4 NULL,<br>	=
t27 int4 NULL,<br>	t28 int4 NULL,<br>	t29 int4 NULL,<br>	=
t30 int4 NULL,<br>	t31 int4 NULL,<br>	t32 int4 NULL,<br>	=
t33 int4 NULL,<br>	t34 int4 NULL,<br>	t35 int4 NULL,<br>	=
t36 int4 NULL,<br>	t37 int4 NULL,<br>	t38 int4 NULL,<br>	=
t39 int4 NULL,<br>	t40 int4 NULL,<br>	t41 int4 NULL,<br>	=
t42 int4 NULL,<br>	t43 int4 NULL,<br>	t44 int4 NULL,<br>	=
t45 int4 NULL,<br>	t46 int4 NULL,<br>	t47 int4 NULL,<br>	=
t48 int4 NULL,<br>	t49 int4 NULL,<br>	t50 int4 NULL,<br>	=
CONSTRAINT googledocs_tbl_pkey PRIMARY KEY (gdid),<br>);<br><br>Every =
time when i query I will query it along with userid&nbsp;<br>Ex : where =
userid =3D 12345678 and t1 in (1,2,3) and t2 in (0,1,2)<br>more key =
filters if customer applies&nbsp;<br><br>On the other hand if I create a =
single jsonb column the schema will look like :<br><br>CREATE TABLE =
dbo.googledocs_tbl (<br>	gdid int8 GENERATED BY DEFAULT AS =
IDENTITY( INCREMENT BY 1 MINVALUE 1 MAXVALUE 9223372036854775807 START 1 =
CACHE 1 NO CYCLE) NOT NULL,<br>	userid int8 NOT NULL,<br>	=
addons_json jsonb default '{}'::jsonb<br>	CONSTRAINT =
googledocs_tbl_pkey PRIMARY KEY (gdid),<br>);<br><br>and the query would =
be like&nbsp;<br>where userid =3D 12345678 and ((addons_json&nbsp;@&gt; =
{t1:1}) or&nbsp;

(addons_json&nbsp;<a class=3D"gmail_plusreply" =
id=3D"m_2851015775117840618m_825930680353329927m_-8977558299889630118m_197=
9415867627385579m_226170299586707328m_2525891115250520179m_570473913477545=
3558m_4822255652052756050m_-1215567552791878704gmail-plusReplyChip-0" =
rel=3D"noreferrer noreferrer noreferrer noreferrer noreferrer">@&gt; =
{t1:2}) or&nbsp;</a>

(addons_json&nbsp;<a class=3D"gmail_plusreply" =
id=3D"m_2851015775117840618m_825930680353329927m_-8977558299889630118m_197=
9415867627385579m_226170299586707328m_2525891115250520179m_570473913477545=
3558m_4822255652052756050m_-1215567552791878704gmail-plusReplyChip-0" =
rel=3D"noreferrer noreferrer noreferrer noreferrer noreferrer">@&gt; =
{t1:3})<br>more key filters if customer =
applies&nbsp;<br><br><br></a></div><br><div class=3D"gmail_quote"><div =
dir=3D"ltr" class=3D"gmail_attr">On Mon, Dec 23, 2024 at 10:38=E2=80=AFPM =
David G. Johnston &lt;<a href=3D"mailto:david.g.johnston@gmail.com" =
rel=3D"noreferrer noreferrer noreferrer noreferrer noreferrer" =
target=3D"_blank">david.g.johnston@gmail.com</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex"><div dir=3D"auto"><br><br><div =
class=3D"gmail_quote" dir=3D"auto"><div dir=3D"ltr" =
class=3D"gmail_attr">On Mon, Dec 23, 2024, 10:01 Divyansh Gupta =
JNsThMAudy &lt;<a href=3D"mailto:ag1567827@gmail.com" rel=3D"noreferrer =
noreferrer noreferrer noreferrer noreferrer" =
target=3D"_blank">ag1567827@gmail.com</a>&gt; wrote:</div><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px =
solid rgb(204,204,204);padding-left:1ex"><div dir=3D"auto"><p =
dir=3D"ltr"><br></p><p dir=3D"ltr">So here my question is considering =
one JSONB column is perfect or considering 50 columns will be more =
optimised.</p></div></blockquote></div><div dir=3D"auto">The relational =
database engine is designed around the column-based approach.&nbsp; =
Especially if the columns are generally unchanging, combined with using =
fixed-width data types.</div><div dir=3D"auto"><br></div><div =
dir=3D"auto">David J.</div><div dir=3D"auto"><br></div><div =
class=3D"gmail_quote" dir=3D"auto"><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid =
rgb(204,204,204);padding-left:1ex">
</blockquote></div></div>
</blockquote></div>
</blockquote></div>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span =
class=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" =
class=3D"gmail_signature"><div dir=3D"ltr">Death to &lt;Redacted&gt;, =
and butter sauce.<div>Don't boil me, I'm still =
alive.<br><div><div>&lt;Redacted&gt; =
lobster!</div></div></div></div></div></div>
</blockquote></div>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span =
class=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" =
class=3D"gmail_signature"><div dir=3D"ltr">Death to &lt;Redacted&gt;, =
and butter sauce.<div>Don't boil me, I'm still =
alive.<br><div><div>&lt;Redacted&gt; =
lobster!</div></div></div></div></div></div>
</blockquote></div>
</blockquote></div>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span =
class=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" =
class=3D"gmail_signature"><div dir=3D"ltr">Death to &lt;Redacted&gt;, =
and butter sauce.<div>Don't boil me, I'm still =
alive.<br><div><div>&lt;Redacted&gt; =
lobster!</div></div></div></div></div></div>
</blockquote></div>
</blockquote></div>
</div></blockquote></div><br></body></html>=

--Apple-Mail=_E9BEF892-949C-458B-8756-73FFCC98896B--