MIME-Version: 1.0
References: <CAHesJ5LES3aTDf=xp7NOwrADQ_HWC-Spsv7yLu9ZY+zxzZO53A@mail.gmail.com>
 <b450927c-49da-46e5-ad74-bf38ceff166b@aklaver.com> <CAHesJ5+ASNoSNMiC5Ms0Ts=gw7v2_UeBpUT=phujO4yE_XCbEw@mail.gmail.com>
 <CAHesJ5JbkCBZ2f_AvUr8+KWnGPAsobu4zyfnWm8bEeb7X9oqDQ@mail.gmail.com>
 <06e1f1ee-74b2-43a2-9a63-da20ae455ae2@aklaver.com> <CAHesJ5JLzhHiGSBSkJZ7x7rGgHeeByP=wWk1D5GG=x8cJ5YY6Q@mail.gmail.com>
 <CAKFQuwYdpzwcbSdQ8TvZ-nVjPeHVVz+5=bWofCbUK+p_o=axrQ@mail.gmail.com>
 <CAHesJ5+yTenkAxOT8H33Cfe=1b2kSyXGqxFYfYz5fgYAVVvFmw@mail.gmail.com>
 <CAHesJ5KaJ8p7QhB9UUoFEbA87cU7ke4GBMkKR3q2FJPVv9GXyw@mail.gmail.com>
 <CANzqJaB_s8eXCZJvYO9CLvgJNqrshD=G5GgECi1M9=vk-JHjdQ@mail.gmail.com> <CAHesJ5LgLi9-uGCk3J9TUkuyttysz3fzTaP+o57EjcBtwDYKZA@mail.gmail.com>
In-Reply-To: <CAHesJ5LgLi9-uGCk3J9TUkuyttysz3fzTaP+o57EjcBtwDYKZA@mail.gmail.com>
From: Ron Johnson <ronljohnsonjr@gmail.com>
Date: Mon, 23 Dec 2024 13:09:38 -0500
Message-ID: <CANzqJaD-MwXzvg97q0iLvAdkf=DnUMOq0Ex2_eNU7sTxEL7bfA@mail.gmail.com>
Subject: Re: Need help in database design
To: pgsql-general <pgsql-general@postgresql.org>
Content-Type: multipart/alternative; boundary="000000000000f9d2c60629f3e6d1"
Archived-At: <https://www.postgresql.org/message-id/CANzqJaD-MwXzvg97q0iLvAdkf%3DDnUMOq0Ex2_eNU7sTxEL7bfA%40mail.gmail.com>
Precedence: bulk

--000000000000f9d2c60629f3e6d1
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

If your queries all reference userid, then you only need indices on gdid
and userid.

On Mon, Dec 23, 2024 at 12:49=E2=80=AFPM Divyansh Gupta JNsThMAudy <
ag1567827@gmail.com> wrote:

> I have one confusion with this design if I opt to create 50 columns I nee=
d
> to create 50 index which will work with userid index in Bitmap on the oth=
er
> hand if I create a JSONB column I need to create a single index ?
>
> On Mon, 23 Dec 2024, 11:10=E2=80=AFpm Ron Johnson, <ronljohnsonjr@gmail.c=
om>
> wrote:
>
>> Given what you just wrote, I'd stick with 50 separate t* columns.
>> Simplifies queries, simplifies updates, and eliminates JSONB conversions=
.
>>
>> On Mon, Dec 23, 2024 at 12:29=E2=80=AFPM Divyansh Gupta JNsThMAudy <
>> ag1567827@gmail.com> wrote:
>>
>>> Values can be updated based on customer actions
>>>
>>> All rows won't have all 50 key value pairs always if I make those keys
>>> into columns the rows might have null value on the other hand if it is
>>> JSONB then the key value pair will not be there
>>>
>>> Yes in UI customers can search for the key value pairs
>>>
>>> During data population the key value pair will be empty array in case o=
f
>>> JSONB column or NULL in case of table columns, later when customer perf=
orms
>>> some actions that time the key value pairs will populate and update, ba=
sed
>>> on what action customer performs.
>>>
>>> On Mon, 23 Dec 2024, 10:51=E2=80=AFpm Divyansh Gupta JNsThMAudy, <
>>> ag1567827@gmail.com> wrote:
>>>
>>>> Let's make it more understandable, here is the table schema with 50
>>>> columns in it
>>>>
>>>> CREATE TABLE dbo.googledocs_tbl (
>>>> gdid int8 GENERATED BY DEFAULT AS IDENTITY( INCREMENT BY 1 MINVALUE 1
>>>> MAXVALUE 9223372036854775807 START 1 CACHE 1 NO CYCLE) NOT NULL,
>>>> userid int8 NOT NULL,
>>>> t1 int4 NULL,
>>>> t2 int4 NULL,
>>>> t3 int4 NULL,
>>>> t4 int4 NULL,
>>>> t5 int4 NULL,
>>>> t6 int4 NULL,
>>>> t7 int4 NULL,
>>>> t8 int4 NULL,
>>>> t9 int4 NULL,
>>>> t10 int4 NULL,
>>>> t11 int4 NULL,
>>>> t12 int4 NULL,
>>>> t13 int4 NULL,
>>>> t14 int4 NULL,
>>>> t15 int4 NULL,
>>>> t16 int4 NULL,
>>>> t17 int4 NULL,
>>>> t18 int4 NULL,
>>>> t19 int4 NULL,
>>>> t20 int4 NULL,
>>>> t21 int4 NULL,
>>>> t22 int4 NULL,
>>>> t23 int4 NULL,
>>>> t24 int4 NULL,
>>>> t25 int4 NULL,
>>>> t26 int4 NULL,
>>>> t27 int4 NULL,
>>>> t28 int4 NULL,
>>>> t29 int4 NULL,
>>>> t30 int4 NULL,
>>>> t31 int4 NULL,
>>>> t32 int4 NULL,
>>>> t33 int4 NULL,
>>>> t34 int4 NULL,
>>>> t35 int4 NULL,
>>>> t36 int4 NULL,
>>>> t37 int4 NULL,
>>>> t38 int4 NULL,
>>>> t39 int4 NULL,
>>>> t40 int4 NULL,
>>>> t41 int4 NULL,
>>>> t42 int4 NULL,
>>>> t43 int4 NULL,
>>>> t44 int4 NULL,
>>>> t45 int4 NULL,
>>>> t46 int4 NULL,
>>>> t47 int4 NULL,
>>>> t48 int4 NULL,
>>>> t49 int4 NULL,
>>>> t50 int4 NULL,
>>>> CONSTRAINT googledocs_tbl_pkey PRIMARY KEY (gdid),
>>>> );
>>>>
>>>> Every time when i query I will query it along with userid
>>>> Ex : where userid =3D 12345678 and t1 in (1,2,3) and t2 in (0,1,2)
>>>> more key filters if customer applies
>>>>
>>>> On the other hand if I create a single jsonb column the schema will
>>>> look like :
>>>>
>>>> CREATE TABLE dbo.googledocs_tbl (
>>>> gdid int8 GENERATED BY DEFAULT AS IDENTITY( INCREMENT BY 1 MINVALUE 1
>>>> MAXVALUE 9223372036854775807 START 1 CACHE 1 NO CYCLE) NOT NULL,
>>>> userid int8 NOT NULL,
>>>> addons_json jsonb default '{}'::jsonb
>>>> CONSTRAINT googledocs_tbl_pkey PRIMARY KEY (gdid),
>>>> );
>>>>
>>>> and the query would be like
>>>> where userid =3D 12345678 and ((addons_json @> {t1:1}) or  (addons_jso=
n @>
>>>> {t1:2}) or  (addons_json @> {t1:3})
>>>> more key filters if customer applies
>>>>
>>>>
>>>>
>>>> On Mon, Dec 23, 2024 at 10:38=E2=80=AFPM David G. Johnston <
>>>> david.g.johnston@gmail.com> wrote:
>>>>
>>>>>
>>>>>
>>>>> On Mon, Dec 23, 2024, 10:01 Divyansh Gupta JNsThMAudy <
>>>>> ag1567827@gmail.com> wrote:
>>>>>
>>>>>>
>>>>>> So here my question is considering one JSONB column is perfect or
>>>>>> considering 50 columns will be more optimised.
>>>>>>
>>>>> The relational database engine is designed around the column-based
>>>>> approach.  Especially if the columns are generally unchanging, combin=
ed
>>>>> with using fixed-width data types.
>>>>>
>>>>> David J.
>>>>>
>>>>>
>>
>> --
>> Death to <Redacted>, and butter sauce.
>> Don't boil me, I'm still alive.
>> <Redacted> lobster!
>>
>

--=20
Death to <Redacted>, and butter sauce.
Don't boil me, I'm still alive.
<Redacted> lobster!

--000000000000f9d2c60629f3e6d1
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div>If your queries all reference userid, then you only n=
eed indices on gdid and userid.</div><br><div class=3D"gmail_quote gmail_qu=
ote_container"><div dir=3D"ltr" class=3D"gmail_attr">On Mon, Dec 23, 2024 a=
t 12:49=E2=80=AFPM Divyansh Gupta JNsThMAudy &lt;<a href=3D"mailto:ag156782=
7@gmail.com">ag1567827@gmail.com</a>&gt; wrote:<br></div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex"><p dir=3D"ltr">I have one confusion with t=
his design if I opt to create 50 columns I need to create 50 index which wi=
ll work with userid index in Bitmap on the other hand if I create a JSONB c=
olumn I need to create a single index ?</p>
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Mon=
, 23 Dec 2024, 11:10=E2=80=AFpm Ron Johnson, &lt;<a href=3D"mailto:ronljohn=
sonjr@gmail.com" target=3D"_blank">ronljohnsonjr@gmail.com</a>&gt; wrote:<b=
r></div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex=
;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir=3D"ltr">=
<div>Given what you just wrote, I&#39;d stick with 50 separate t* columns.=
=C2=A0 Simplifies queries, simplifies updates, and eliminates JSONB convers=
ions.</div><br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_a=
ttr">On Mon, Dec 23, 2024 at 12:29=E2=80=AFPM Divyansh Gupta JNsThMAudy &lt=
;<a href=3D"mailto:ag1567827@gmail.com" rel=3D"noreferrer" target=3D"_blank=
">ag1567827@gmail.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_qu=
ote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,20=
4);padding-left:1ex"><p dir=3D"ltr">Values can be updated based on customer=
 actions</p>
<p dir=3D"ltr">All rows won&#39;t have all 50 key value pairs always if I m=
ake those keys into columns the rows might have null value on the other han=
d if it is JSONB then the key value pair will not be there</p>
<p dir=3D"ltr">Yes in UI customers can search for the key value pairs</p>
<p dir=3D"ltr">During data population the key value pair will be empty arra=
y in case of JSONB column or NULL in case of table columns, later when cust=
omer performs some actions that time the key value pairs will populate and =
update, based on what action customer performs.<br>
</p>
<br><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Mon=
, 23 Dec 2024, 10:51=E2=80=AFpm Divyansh Gupta JNsThMAudy, &lt;<a href=3D"m=
ailto:ag1567827@gmail.com" rel=3D"noreferrer" target=3D"_blank">ag1567827@g=
mail.com</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D=
"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-le=
ft:1ex"><div dir=3D"ltr">Let&#39;s make it more understandable, here is the=
 table schema with 50 columns in it=C2=A0<br><br>CREATE TABLE dbo.googledoc=
s_tbl (<br>	gdid int8 GENERATED BY DEFAULT AS IDENTITY( INCREMENT BY 1 MINV=
ALUE 1 MAXVALUE 9223372036854775807 START 1 CACHE 1 NO CYCLE) NOT NULL,<br>=
	userid int8 NOT NULL,<br>	t1 int4 NULL,<br>	t2 int4 NULL,<br>	t3 int4 NULL=
,<br>	t4 int4 NULL,<br>	t5 int4 NULL,<br>	t6 int4 NULL,<br>	t7 int4 NULL,<b=
r>	t8 int4 NULL,<br>	t9 int4 NULL,<br>	t10 int4 NULL,<br>	t11 int4 NULL,<br=
>	t12 int4 NULL,<br>	t13 int4 NULL,<br>	t14 int4 NULL,<br>	t15 int4 NULL,<b=
r>	t16 int4 NULL,<br>	t17 int4 NULL,<br>	t18 int4 NULL,<br>	t19 int4 NULL,<=
br>	t20 int4 NULL,<br>	t21 int4 NULL,<br>	t22 int4 NULL,<br>	t23 int4 NULL,=
<br>	t24 int4 NULL,<br>	t25 int4 NULL,<br>	t26 int4 NULL,<br>	t27 int4 NULL=
,<br>	t28 int4 NULL,<br>	t29 int4 NULL,<br>	t30 int4 NULL,<br>	t31 int4 NUL=
L,<br>	t32 int4 NULL,<br>	t33 int4 NULL,<br>	t34 int4 NULL,<br>	t35 int4 NU=
LL,<br>	t36 int4 NULL,<br>	t37 int4 NULL,<br>	t38 int4 NULL,<br>	t39 int4 N=
ULL,<br>	t40 int4 NULL,<br>	t41 int4 NULL,<br>	t42 int4 NULL,<br>	t43 int4 =
NULL,<br>	t44 int4 NULL,<br>	t45 int4 NULL,<br>	t46 int4 NULL,<br>	t47 int4=
 NULL,<br>	t48 int4 NULL,<br>	t49 int4 NULL,<br>	t50 int4 NULL,<br>	CONSTRA=
INT googledocs_tbl_pkey PRIMARY KEY (gdid),<br>);<br><br>Every time when i =
query I will query it along with userid=C2=A0<br>Ex : where userid =3D 1234=
5678 and t1 in (1,2,3) and t2 in (0,1,2)<br>more key filters if customer ap=
plies=C2=A0<br><br>On the other hand if I create a single jsonb column the =
schema will look like :<br><br>CREATE TABLE dbo.googledocs_tbl (<br>	gdid i=
nt8 GENERATED BY DEFAULT AS IDENTITY( INCREMENT BY 1 MINVALUE 1 MAXVALUE 92=
23372036854775807 START 1 CACHE 1 NO CYCLE) NOT NULL,<br>	userid int8 NOT N=
ULL,<br>	addons_json jsonb default &#39;{}&#39;::jsonb<br>	CONSTRAINT googl=
edocs_tbl_pkey PRIMARY KEY (gdid),<br>);<br><br>and the query would be like=
=C2=A0<br>where userid =3D 12345678 and ((addons_json=C2=A0@&gt; {t1:1}) or=
=C2=A0

(addons_json=C2=A0<a class=3D"gmail_plusreply" id=3D"m_2525891115250520179m=
_5704739134775453558m_4822255652052756050m_-1215567552791878704gmail-plusRe=
plyChip-0" rel=3D"noreferrer noreferrer">@&gt; {t1:2}) or=C2=A0</a>

(addons_json=C2=A0<a class=3D"gmail_plusreply" id=3D"m_2525891115250520179m=
_5704739134775453558m_4822255652052756050m_-1215567552791878704gmail-plusRe=
plyChip-0" rel=3D"noreferrer noreferrer">@&gt; {t1:3})<br>more key filters =
if customer applies=C2=A0<br><br><br></a></div><br><div class=3D"gmail_quot=
e"><div dir=3D"ltr" class=3D"gmail_attr">On Mon, Dec 23, 2024 at 10:38=E2=
=80=AFPM David G. Johnston &lt;<a href=3D"mailto:david.g.johnston@gmail.com=
" rel=3D"noreferrer noreferrer" target=3D"_blank">david.g.johnston@gmail.co=
m</a>&gt; wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin=
:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"=
><div dir=3D"auto"><br><br><div class=3D"gmail_quote" dir=3D"auto"><div dir=
=3D"ltr" class=3D"gmail_attr">On Mon, Dec 23, 2024, 10:01 Divyansh Gupta JN=
sThMAudy &lt;<a href=3D"mailto:ag1567827@gmail.com" rel=3D"noreferrer noref=
errer" target=3D"_blank">ag1567827@gmail.com</a>&gt; wrote:</div><blockquot=
e class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px s=
olid rgb(204,204,204);padding-left:1ex"><div dir=3D"auto"><p dir=3D"ltr"><b=
r></p><p dir=3D"ltr">So here my question is considering one JSONB column is=
 perfect or considering 50 columns will be more optimised.</p></div></block=
quote></div><div dir=3D"auto">The relational database engine is designed ar=
ound the column-based approach.=C2=A0 Especially if the columns are general=
ly unchanging, combined with using fixed-width data types.</div><div dir=3D=
"auto"><br></div><div dir=3D"auto">David J.</div><div dir=3D"auto"><br></di=
v><div class=3D"gmail_quote" dir=3D"auto"><blockquote class=3D"gmail_quote"=
 style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);p=
adding-left:1ex">
</blockquote></div></div>
</blockquote></div>
</blockquote></div>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span class=
=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_s=
ignature"><div dir=3D"ltr">Death to &lt;Redacted&gt;, and butter sauce.<div=
>Don&#39;t boil me, I&#39;m still alive.<br><div><div>&lt;Redacted&gt; lobs=
ter!</div></div></div></div></div></div>
</blockquote></div>
</blockquote></div><div><br clear=3D"all"></div><div><br></div><span class=
=3D"gmail_signature_prefix">-- </span><br><div dir=3D"ltr" class=3D"gmail_s=
ignature"><div dir=3D"ltr">Death to &lt;Redacted&gt;, and butter sauce.<div=
>Don&#39;t boil me, I&#39;m still alive.<br><div><div>&lt;Redacted&gt; lobs=
ter!</div></div></div></div></div></div>

--000000000000f9d2c60629f3e6d1--