MIME-Version: 1.0
References: <CACxu=vJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg@mail.gmail.com>
 <1342498.1729444411@sss.pgh.pa.us> <CACxu=vLXvpzN4X3k+9jsMt6ujuOvFVUSkA80t_cROSsF4y2jQQ@mail.gmail.com>
 <1445998.1729482404@sss.pgh.pa.us>
In-Reply-To: <1445998.1729482404@sss.pgh.pa.us>
From: Michel Pelletier <pelletier.michel@gmail.com>
Date: Mon, 21 Oct 2024 10:23:33 -0700
Message-ID: <CACxu=vKEF8Qa-OaADFxf0uMg-xw6gH_CNCWd2s+xaqh-gY4=xg@mail.gmail.com>
Subject: Re: Using Expanded Objects other than Arrays from plpgsql
To: Tom Lane <tgl@sss.pgh.pa.us>
Cc: pgsql-hackers@postgresql.org
Content-Type: multipart/alternative; boundary="000000000000b962750624ffebff"
Archived-At: <https://www.postgresql.org/message-id/CACxu%3DvKEF8Qa-OaADFxf0uMg-xw6gH_CNCWd2s%2Bxaqh-gY4%3Dxg%40mail.gmail.com>
Precedence: bulk

--000000000000b962750624ffebff
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

On Sun, Oct 20, 2024 at 8:46=E2=80=AFPM Tom Lane <tgl@sss.pgh.pa.us> wrote:

> Michel Pelletier <pelletier.michel@gmail.com> writes:
> > On Sun, Oct 20, 2024 at 10:13=E2=80=AFAM Tom Lane <tgl@sss.pgh.pa.us> w=
rote:
>

(from thread
https://www.postgresql.org/message-id/CACxu%3DvJaKFNsYxooSnW1wEgsAO5u_v1XYB=
acfVJ14wgJV_PYeg%40mail.gmail.com
)


> >> But it seems like we could get an easy win by adjusting
> >> plpgsql_exec_function along the lines of
> >> ...
>
> > I tried this change and couldn't get it to work, on the next line:
> >     if (VARATT_IS_EXTERNAL_EXPANDED_RW(DatumGetPointer(var->value)))
> > var->value might not be a pointer, as it seems at least from my gdb
> > scratching, but say an integer.  This segfaults on non-array but
> > non-expandable datum.
>
>   We need the same test that exec_assign_value makes,
> !var->datatype->typbyval, before it's safe to apply DatumGetPointer.
> So line 549 needs to be more like
>
> -                    if (!var->isnull && var->datatype->typisarray)
> +                    if (!var->isnull && !var->datatype->typbyval)
>
> > Another comment that caught my eye was this one:
> >
> https://github.com/postgres/postgres/blob/master/src/pl/plpgsql/src/pl_ex=
ec.c#L8304
> > Not sure what the implication is there.
>
> Yeah, that's some more unfinished business.  I'm not sure if it
> matters to your use-case or not.
>
> BTW, we probably should move this thread to pgsql-hackers.


And here we are, thanks for your help on this Tom.  For some thread
switching context for others, I'm writing a postgres extension that wraps
the SuiteSparse:GraphBLAS API and provides new types for sparse and dense
matrices and vectors.  It's like a combination of numpy and scipy.sparse
but for Postgres with an emphasis on graph analytics as sparse adjacency
matrices using linear algebra.

I use the expandeddatum API to flatten and expand on disk compressed
representations of these objects into "live" in-memory objects managed by
SuiteSparse.  All GraphBLAS objects are opaque handles, and my expanded
objects are essentially a box around this handle.  I use memory context
callbacks to free the handles when the memory context of the box is freed.
This works very well and I've made a lot of progress on creating a very
clean algebraic API, here are the doctests for matrices, this is all
generated from live code!

https://onesparse.github.io/OneSparse/test_matrix_header/

Doing some benchmarking I noticed that when writing some simple graph
algorithms in plpgsql, my objects were being constantly flattened and
expanded.  Posting my question above to pgsql-general Tom gave me some tips
and suggested I move the thread here.

My non-expert summary: plpgsql only optimizes for expanded objects if they
are arrays.  Non array expanded objects get flattened/expanded on every
use.  For large matrices and vectors this is very expensive.  Ideally I'd
like to expand my object, use it throughout the function, return it to
another function that may use it, and only flatten it when it goes to disk
or it's completely unavoidable.  The comment in expandeddatum.h really kind
of sells this as one of the main features:

 * An expanded object is meant to survive across multiple operations, but
 * not to be enormously long-lived; for example it might be a local variabl=
e
 * in a PL/pgSQL procedure.  So its extra bulk compared to the on-disk
format
 * is a worthwhile trade-off.

In my case it's not a question of saving bulk, the on disk representation
of a matrix is not useful at compute time, it needs to be expanded (using
GraphBLAS's serialize/deserialize API) for it to be usable for matrix
operations like matmul.  In most cases algorithms using these objects
iterate in a loop, doing various algebraic operations almost always
involving a matmul until they converge on a stable solution or they exhaust
the input elements.  Here for example is a "minimum parent BFS" that takes
a graph and a starting node, and traverses the graph breadth first,
computing a vector of every node and its minimum parent id.

CREATE OR REPLACE FUNCTION bfs(graph matrix, start_node bigint)
>     RETURNS vector LANGUAGE plpgsql AS
>     $$
>     DECLARE
>     bfs_vector vector =3D vector('int32');
>     next_vector vector =3D vector('int32');
>     BEGIN
>         bfs_vector =3D set_element(bfs_vector, start_node, 1);
>         WHILE sssp_vector !=3D next_vector LOOP
>             next_vector =3D dup(bfs_vector);
>             bfs_vector =3D vxm(bfs_vector, graph, 'any_secondi_int32',
>                              w=3D>bfs_vector, accum=3D>'min_int32');
>         END LOOP;
>     RETURN bfs_vector;
>     end;
>     $$;
>

(If you're wondering "Why would anyone do it this way" it's because
SuiteSparse is optimized for parallel sparse matrix multiplication and has
a JIT compiler that can target multiple architectures, at the moment CPUs
and CUDA GPUs.  Reusing the same Linear Algebra already prevalent in graph
theory means not having to think about any low level implementation issues
and having code that is completely portable from CPU to GPU or other
accelerators).

So, I made the two small changes Tom suggested above and I have them in a
side fork here:

https://github.com/postgres/postgres/compare/master...michelp:postgres-upst=
ream:michelp-flatless#diff-0c35024d1576c347689c7abad68abd8562a0aa5f0d2c63d6=
d65df4b360b0e807

Good news, my code still works, but bad news is there is still a lot of
flattening/expanding/freeing going on at multiple points in each iteration
of the algorithm.  I'll note too that:

    BEGIN
        bfs_vector =3D set_element(sssp_vector, start_node, 1);

I'd prefer that to not be an assignment, set_element mutates the object (I
eventually plan to support subscripting syntax like bfs_vector[start_node]
=3D 1)

same with:

            bfs_vector =3D vxm(bfs_vector, graph, 'any_secondi_int32',
                             w=3D>bfs_vector, accum=3D>'min_int32');

This matmul mutates bfs_vector, I shouldn't need to reassign it back but at
the moment it seems necessary otherwise the mutations are lost but this
costs a full flatten/expand cycle.

Short term my goal is to optimize plpgsql so that my objects stay expanded
for the life of the function.  Long term there's some "unfinished business"
to use Tom's words around the expandeddatum API.  I'm not really qualified
to speak on these issue but this is my understanding of some of them:

  - plpgsql knows how to expand arrays and is hardwired for it, but how
would it know how to expand other expandable types?

  - Issues with exec_check_rw_parameter also being hardwired to only
optimize expanded objects for array append and prepend, I suspect this has
something to do with my issue above about mutating objects in place.

I may have missed something but hopefully that brings anyone up to speed
interested in this topic.

-Michel

--000000000000b962750624ffebff
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div dir=3D"ltr">On Sun, Oct 20, 2024 at 8:46=E2=80=AFPM T=
om Lane &lt;<a href=3D"mailto:tgl@sss.pgh.pa.us">tgl@sss.pgh.pa.us</a>&gt; =
wrote:<br></div><div class=3D"gmail_quote"><blockquote class=3D"gmail_quote=
" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);=
padding-left:1ex">Michel Pelletier &lt;<a href=3D"mailto:pelletier.michel@g=
mail.com" target=3D"_blank">pelletier.michel@gmail.com</a>&gt; writes:<br>
&gt; On Sun, Oct 20, 2024 at 10:13=E2=80=AFAM Tom Lane &lt;<a href=3D"mailt=
o:tgl@sss.pgh.pa.us" target=3D"_blank">tgl@sss.pgh.pa.us</a>&gt; wrote:<br>=
</blockquote><div><br></div><div>(from thread=C2=A0<a href=3D"https://www.p=
ostgresql.org/message-id/CACxu%3DvJaKFNsYxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_P=
Yeg%40mail.gmail.com">https://www.postgresql.org/message-id/CACxu%3DvJaKFNs=
YxooSnW1wEgsAO5u_v1XYBacfVJ14wgJV_PYeg%40mail.gmail.com</a>)=C2=A0</div><di=
v>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px=
 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
&gt;&gt; But it seems like we could get an easy win by adjusting<br>
&gt;&gt; plpgsql_exec_function along the lines of<br>
&gt;&gt; ...<br>
<br>
&gt; I tried this change and couldn&#39;t get it to work, on the next line:=
<br>
&gt;=C2=A0 =C2=A0 =C2=A0if (VARATT_IS_EXTERNAL_EXPANDED_RW(DatumGetPointer(=
var-&gt;value)))<br>
&gt; var-&gt;value might not be a pointer, as it seems at least from my gdb=
<br>
&gt; scratching, but say an integer.=C2=A0 This segfaults on non-array but<=
br>
&gt; non-expandable datum.<br>
<br>=C2=A0 We need the same test that exec_assign_value makes,<br>
!var-&gt;datatype-&gt;typbyval, before it&#39;s safe to apply DatumGetPoint=
er.<br>
So line 549 needs to be more like<br>
<br>
-=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 if (=
!var-&gt;isnull &amp;&amp; var-&gt;datatype-&gt;typisarray)<br>
+=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 if (=
!var-&gt;isnull &amp;&amp; !var-&gt;datatype-&gt;typbyval)<br>
<br>
&gt; Another comment that caught my eye was this one:<br>
&gt; <a href=3D"https://github.com/postgres/postgres/blob/master/src/pl/plp=
gsql/src/pl_exec.c#L8304" rel=3D"noreferrer" target=3D"_blank">https://gith=
ub.com/postgres/postgres/blob/master/src/pl/plpgsql/src/pl_exec.c#L8304</a>=
<br>
&gt; Not sure what the implication is there.<br>
<br>
Yeah, that&#39;s some more unfinished business.=C2=A0 I&#39;m not sure if i=
t<br>
matters to your use-case or not.<br>
<br>
BTW, we probably should move this thread to pgsql-hackers.</blockquote><div=
><br></div><div>And here we are, thanks for your help on this Tom.=C2=A0 Fo=
r some thread switching context for others, I&#39;m writing a postgres exte=
nsion that wraps the SuiteSparse:GraphBLAS API and provides new types for s=
parse and dense matrices and vectors.=C2=A0 It&#39;s like a combination of =
numpy and scipy.sparse but for Postgres with an emphasis on graph analytics=
 as sparse adjacency matrices using linear algebra.</div><div><br></div><di=
v>I use the expandeddatum=C2=A0API to flatten and expand on disk compressed=
 representations of these objects into &quot;live&quot; in-memory objects m=
anaged by SuiteSparse.=C2=A0 All GraphBLAS objects are opaque handles, and =
my expanded objects are essentially a box around this handle.=C2=A0 I use m=
emory context callbacks to free the handles when the memory context of the =
box is freed.=C2=A0 This works very well and I&#39;ve made a lot of progres=
s on creating a very clean algebraic API, here are the doctests for matrice=
s, this is all generated from live code!</div><div><br></div><div><a href=
=3D"https://onesparse.github.io/OneSparse/test_matrix_header/">https://ones=
parse.github.io/OneSparse/test_matrix_header/</a><br></div><div><br></div><=
div>Doing some benchmarking I noticed that when writing some simple graph a=
lgorithms in plpgsql, my objects were being constantly flattened and expand=
ed.=C2=A0 Posting my question above to pgsql-general Tom gave me some tips =
and suggested I move the thread here.=C2=A0=C2=A0</div><div><br></div><div>=
My non-expert summary: plpgsql only optimizes for expanded objects if they =
are arrays.=C2=A0 Non array expanded objects get flattened/expanded on ever=
y use.=C2=A0 For large matrices and vectors this is very expensive.=C2=A0 I=
deally I&#39;d like to expand my object, use it throughout the function, re=
turn it to another function that may use it, and only flatten it when it go=
es to disk or it&#39;s completely unavoidable.=C2=A0 The comment in expande=
ddatum.h really kind of sells this as one of the main features:</div><div><=
br></div><div>=C2=A0* An expanded object is meant to survive across multipl=
e operations, but<br>=C2=A0* not to be enormously long-lived; for example i=
t might be a local variable<br>=C2=A0* in a PL/pgSQL procedure.=C2=A0 So it=
s extra bulk compared to the on-disk format<br>=C2=A0* is a worthwhile trad=
e-off.<br></div><div><br></div><div>In my case it&#39;s not a question of s=
aving bulk, the on disk representation of a matrix is not useful at compute=
 time, it=C2=A0needs to be expanded (using GraphBLAS&#39;s=C2=A0serialize/d=
eserialize API) for it to be usable for matrix operations like matmul.=C2=
=A0 In most cases algorithms using these objects iterate in a loop, doing v=
arious algebraic operations almost always involving a matmul until they con=
verge on a stable solution or they exhaust the input elements.=C2=A0 Here f=
or example is a &quot;minimum parent BFS&quot; that takes a graph and a sta=
rting node, and traverses the graph breadth first, computing a vector of ev=
ery node and its minimum parent id.</div><div><br></div><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rg=
b(204,204,204);padding-left:1ex">CREATE OR REPLACE FUNCTION bfs(graph matri=
x, start_node bigint)<br>=C2=A0 =C2=A0 RETURNS vector LANGUAGE plpgsql AS<b=
r>=C2=A0 =C2=A0 $$<br>=C2=A0 =C2=A0 DECLARE<br>=C2=A0 =C2=A0 bfs_vector vec=
tor =3D vector(&#39;int32&#39;);<br>=C2=A0 =C2=A0 next_vector vector =3D ve=
ctor(&#39;int32&#39;);<br>=C2=A0 =C2=A0 BEGIN<br>=C2=A0 =C2=A0 =C2=A0 =C2=
=A0 bfs_vector =3D set_element(bfs_vector, start_node, 1);<br>=C2=A0 =C2=A0=
 =C2=A0 =C2=A0 WHILE sssp_vector !=3D next_vector LOOP<br>=C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 next_vector =3D dup(bfs_vector);<br>=C2=A0 =C2=A0 =
=C2=A0 =C2=A0 =C2=A0 =C2=A0 bfs_vector =3D vxm(bfs_vector, graph, &#39;any_=
secondi_int32&#39;,<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0w=3D&gt;bfs_vector, acc=
um=3D&gt;&#39;min_int32&#39;);<br>=C2=A0 =C2=A0 =C2=A0 =C2=A0 END LOOP;<br>=
=C2=A0 =C2=A0 RETURN bfs_vector;<br>=C2=A0 =C2=A0 end;<br>=C2=A0 =C2=A0 $$;=
<br></blockquote><div><br></div><div>(If you&#39;re wondering &quot;Why wou=
ld anyone do it this way&quot; it&#39;s because SuiteSparse is optimized fo=
r parallel sparse matrix multiplication and has a JIT compiler that can tar=
get multiple architectures, at the moment CPUs and CUDA GPUs.=C2=A0 Reusing=
 the same Linear Algebra already prevalent in graph theory means not having=
 to think about any low level implementation issues and having code that is=
 completely portable from CPU to GPU or other accelerators).</div><div><br>=
</div><div>So, I made the two small changes Tom suggested above and I have =
them in a side fork here:</div><div><br></div><div><a href=3D"https://githu=
b.com/postgres/postgres/compare/master...michelp:postgres-upstream:michelp-=
flatless#diff-0c35024d1576c347689c7abad68abd8562a0aa5f0d2c63d6d65df4b360b0e=
807">https://github.com/postgres/postgres/compare/master...michelp:postgres=
-upstream:michelp-flatless#diff-0c35024d1576c347689c7abad68abd8562a0aa5f0d2=
c63d6d65df4b360b0e807</a><br></div><div><br></div><div>Good news, my code s=
till works, but bad news is there is still a lot of flattening/expanding/fr=
eeing going on at multiple points in each iteration of the algorithm.=C2=A0=
 I&#39;ll note too that:</div><div><br></div><div>=C2=A0 =C2=A0 BEGIN<br>=
=C2=A0 =C2=A0 =C2=A0 =C2=A0 bfs_vector =3D set_element(sssp_vector, start_n=
ode, 1);<br></div><div><br></div><div>I&#39;d prefer that to not be an assi=
gnment, set_element mutates the object (I eventually plan to support subscr=
ipting syntax like bfs_vector[start_node] =3D 1)</div><div><br></div><div>s=
ame with:</div><div><br></div><div>=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 bfs_vector =3D vxm(bfs_vector, graph, &#39;any_secondi_int32&#39;,<br>=
=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=
=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0w=3D&gt;bfs_vector, accum=3D&gt;&#39;min_int=
32&#39;);<br></div><div><br></div><div>This matmul mutates bfs_vector, I sh=
ouldn&#39;t need to reassign it back but at the moment it seems necessary o=
therwise the mutations are lost but this costs a full flatten/expand cycle.=
</div><div><br></div><div>Short term my goal is to optimize plpgsql so that=
 my objects stay expanded for the life of the function.=C2=A0 Long term the=
re&#39;s some &quot;unfinished business&quot; to use Tom&#39;s words around=
 the expandeddatum=C2=A0API.=C2=A0 I&#39;m not really qualified to speak on=
 these issue but this is my understanding of some of them:</div><div><br></=
div><div>=C2=A0 - plpgsql knows how to expand arrays and is hardwired for i=
t, but how would it know how to expand other expandable types?</div><div><b=
r></div><div>=C2=A0 - Issues with exec_check_rw_parameter also being hardwi=
red to only optimize=C2=A0expanded objects for array append and prepend, I =
suspect this has something to do with my issue above about mutating objects=
 in place.</div><div><br></div><div>I may have missed something but hopeful=
ly that brings anyone up to speed interested in this topic.</div><div><br><=
/div><div>-Michel</div><br></div></div>

--000000000000b962750624ffebff--