Content-Type: multipart/alternative;
 boundary="===============7060529155987209325=="
MIME-Version: 1.0
Subject: pg_infer 1.0.0 released -- transformer model knowledge as SQL
 relations
To: PostgreSQL Announce <pgsql-announce@lists.postgresql.org>
From: Greg Burd via PostgreSQL Announce <announce-noreply@postgresql.org>
Reply-To: greg@burd.me
Date: Fri, 22 May 2026 18:49:09 +0000
Message-ID: <177947574985.794.11485875377229486447@wrigleys.postgresql.org>
Auto-Submitted: auto-generated
Archived-At: 
 <https://www.postgresql.org/message-id/177947574985.794.11485875377229486447%40wrigleys.postgresql.org>
Precedence: bulk

--===============7060529155987209325==
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: quoted-printable

I am pleased to announce the first public release of [pg_infer](https://cod=
eberg.org/gregburd/pg_infer), a PostgreSQL 18+
extension that exposes the internals of small transformer language models -=
- gate activations, feature labels, learned associations, embeddings -- as =
SQL-queryable relations and a custom index access method.

pg_infer is not "natural language to SQL." It is not "SQL to natural langua=
ge." There is no chat interface, no agent loop, no prompt template generati=
ng queries. pg_infer brings model inference into the query plan as an opera=
tor the planner can cost, schedule, parallelize, and combine with ordinary =
predicates and joins. The model becomes a first-class data source -- a set =
of relations the planner can scan, filter, and join -- not an external serv=
ice the database talks to.

Quick example
-------------

    -- Register a vindex (extracted model knowledge):
    `SELECT infer_create_model('qwen05b', '/data/qwen-0.5b.vindex');`

    -- What does the model know about France?

```
    SELECT * FROM describe('France');
    --  relation  |  target   | confidence | layer
    -- -----------+-----------+------------+-------
    --  capital   | Paris     |       42.7 |    18
    --  language  | French    |       38.1 |    17
    --  continent | Europe    |       35.4 |    16
    -- ...
```

    -- `ORDER BY` model-knowledge similarity:
```
    CREATE INDEX ON documents USING infer (title)
       WITH (model =3D 'qwen05b');

    SELECT * FROM documents
     ORDER BY title <~> 'artificial intelligence'
     LIMIT 5;
```

The `<~>` operator is index-backed, supports
EXPLAIN (ANALYZE, BUFFERS), and composes with WHERE, JOIN,
aggregation, and partitioning the way any other operator does.

What pg_infer does that other extensions do not
-----------------------------------------------

  * pgvector / pgvectorscale stores user-supplied embedding
    vectors and answers nearest-neighbour distance queries.
    pg_infer goes a layer deeper: it stores the model itself
    (gate vectors, feature activations, label metadata) in
    WAL-logged 8KB pages, and answers questions like "does
    the model treat A and B as related?" -- not "are these
    two embeddings close?"

  * pg_search / RAG-style integrations turn user queries into
    embedding lookups against external vector stores. pg_infer
    exposes the model's internal structure to SQL: walk(prompt)
    returns per-layer feature activations; describe(entity)
    returns the relations the model has learned about an
    entity; implies(a, b) tests directional support.

  * pg_infer's index AM ships in two modes:
      - "model" indexes WAL-log the entire vindex inside
        PostgreSQL pages, so backup, replication, and
        point-in-time recovery cover the model the same way
        they cover your tables.
      - "column" indexes attach a model to a text column and
        make ORDER BY <~> on that column index-driven.

  * The mmap-backed local backend lets multiple PG backends
    share decoded model pages through the OS page cache; the
    remote backend (larql-server / larql-router over HTTP/2 or
    a Unix socket) shares one copy of the model across a host
    and supports layer-sharded routing. In-flight remote calls
    respond to pg_cancel_backend(...) within roughly 100ms.

CPU inference, BitNet, and idle-cluster compute
-----------------------------------------------

Database servers almost never have GPUs. They have a lot of fast cores, a l=
ot of RAM, and -- on most production deployments -- standby replicas, read-=
only physical replicas, logical subscribers, and DR hosts that spend most o=
f the day at single-digit CPU utilization while the primary takes the write=
 traffic.

pg_infer and the underlying larql crates target this hardware profile direc=
tly:

  * The default execution paths run efficiently on CPU, with
    BLAS-backed linear algebra (OpenBLAS) and f16 gate vectors
    that decode to f32 lazily.

  * pg_infer / larql support models in the Microsoft BitNet
    b1.58 family ("two-bit / 1.58-bit" ternary-weight
    transformers, https://arxiv.org/abs/2402.17764), which
    were specifically designed to run on commodity CPUs at
    competitive quality and dramatically lower memory and power
    cost than f16 baselines. Combined with f16 gate
    activations, this brings useful inference inside a
    PostgreSQL backend without any specialized accelerator.

  * The cluster model is the point. A typical PostgreSQL HA /
    DR / read-scale deployment has one busy primary and one or
    more largely idle physical replicas, plus, increasingly, a
    fleet of logical subscribers. Those replicas already pay
    for themselves in availability, but their CPUs are idle the
    vast majority of the time. With pg_infer's remote backend,
    larql-server runs on the replica hosts and serves model
    operators back to the primary's query plans -- the model is
    materialized once per host, the activation cache is shared,
    and the work happens on capacity you have already paid for.
    No GPU, no separate inference cluster, no extra network
    egress.

A few queries that are uniquely pg_infer
----------------------------------------

  * Model-aware document ranking that does not depend on
    keyword overlap or pre-computed embeddings:

```
      SELECT id, title
        FROM papers
       ORDER BY title <~> 'neural architecture search'
       LIMIT 10;
```

    This finds "AutoML for Deep Networks" because the model
    learned that relationship -- pg_trgm cannot, FTS cannot,
    and pgvector can only do so if you computed and stored
    embeddings ahead of time with a model whose semantics
    happen to agree with your query.

  * Joining model knowledge with relational data:

```
      SELECT c.id, c.name, p.title,
             p.title <~> c.research_interest AS dist
        FROM candidates c
        JOIN papers     p
          ON p.title <~> c.research_interest < 0.2
       WHERE c.country =3D 'DE';
```

    Standard SQL semantics, standard PostgreSQL planner, plus
    a model-driven join condition.

  * Probing what a model "knows" without running it:

```
      SELECT relation, target, confidence
        FROM describe('PostgreSQL')
       WHERE confidence > 30;
```

  * Auditing model behaviour over time. Because the vindex is
    stored in WAL-logged pages, point-in-time recovery on a
    pg_infer-using cluster gives you the model state at any
    historical moment alongside the data state. "What was the
    model saying about this entity at 03:14 UTC last Tuesday?"
    is a literal PITR + describe(...) question.

Acknowledgements
----------------

pg_infer would not exist without the LARQL project by Chris Hayuk (https://=
github.com/chrishayuk). LARQL pioneered the idea of making transformer mode=
l internals queryable -- extracting gate vectors, feature activations, and =
learned associations into a format ("vindex") that can be explored interact=
ively and expressed as a query language. The vindex format, the gate KNN al=
gorithm, and the feature-labeling pipeline all originate from LARQL; pg_inf=
er adapts them into a PostgreSQL access method, a WAL-logged storage layer,=
 and a planner-visible operator.

If the larql ideas resonate, please look at the original work and at Chris'=
s video walkthroughs explaining the vindex format, the gate-KNN algorithm, =
and the LARQL query language:

* Chris Hayuk on YouTube:    https://www.youtube.com/@chrishayuk
* Original LARQL repo:       https://github.com/chrishayuk/larql
* larql-rs (Rust port):      https://github.com/chrishayuk/chuk-larql-rs

Thank you, Chris, for the foundational work and for being open with the des=
ign.

A note on stability and feedback
--------------------------------

pg_infer is new software. The SQL surface, the index AM, the remote backend=
 protocol, and the test suite are stable enough to release at 1.0.0, and th=
e vindex on-disk format is stable forward; but the project is young, and th=
e combination of PostgreSQL + pgrx + transformer internals is unusual enoug=
h that there are certainly bugs that the existing tests do not yet provoke.=
 It is not a beta and not a research toy; it is real software released earl=
y, and it should be used with appropriate caution.

Bug reports, feature requests, and pull requests are very welcome -- especi=
ally reproductions, vindex compatibility issues, planner-cost regressions, =
and integration suggestions for other PostgreSQL extensions.

Links
-----

* Repository:    [https://codeberg.org/gregburd/pg_infer](https://codeberg.=
org/gregburd/pg_infer)=20
* Issues:        [https://codeberg.org/gregburd/pg_infer/issues](https://co=
deberg.org/gregburd/pg_infer/issues)
* LARQL:         [https://github.com/chrishayuk/larql](https://github.com/c=
hrishayuk/larql)=20
* larql-rs:      [https://github.com/chrishayuk/chuk-larql-rs](https://gith=
ub.com/chrishayuk/chuk-larql-rs)
* BitNet b1.58:  [https://arxiv.org/abs/2402.17764](https://arxiv.org/abs/2=
402.17764)
--===============7060529155987209325==
Content-Type: text/html; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: quoted-printable


<!doctype html>
<html>
  <head>
    <meta name=3D"viewport" content=3D"width=3Ddevice-width">
    <meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3DUTF-8=
">
    <title>pg_infer 1.0.0 released -- transformer model knowledge as SQL re=
lations</title>
    <style>

    @media only screen and (max-width: 620px) {
      table[class=3Dbody] h1 {
        font-size: 28px !important;
        margin-bottom: 10px !important;
      }
      table[class=3Dbody] p,
            table[class=3Dbody] ul,
            table[class=3Dbody] ol,
            table[class=3Dbody] td,
            table[class=3Dbody] span,
            table[class=3Dbody] a {
        font-size: 16px !important;
      }
      table[class=3Dbody] .wrapper,
            table[class=3Dbody] .article {
        padding: 10px !important;
      }
      table[class=3Dbody] .content {
        padding: 0 !important;
      }
      table[class=3Dbody] .container {
        padding: 0 !important;
        width: 100% !important;
      }
      table[class=3Dbody] .main {
        border-left-width: 0 !important;
        border-radius: 0 !important;
        border-right-width: 0 !important;
      }
      table[class=3Dbody] .btn table {
        width: 100% !important;
      }
      table[class=3Dbody] .btn a {
        width: 100% !important;
      }
      table[class=3Dbody] .img-responsive {
        height: auto !important;
        max-width: 100% !important;
        width: auto !important;
      }
    }

    @media all {
      .ExternalClass {
        width: 100%;
      }
      .ExternalClass,
            .ExternalClass p,
            .ExternalClass span,
            .ExternalClass font,
            .ExternalClass td,
            .ExternalClass div {
        line-height: 100%;
      }
      .apple-link a {
        color: inherit !important;
        font-family: inherit !important;
        font-size: inherit !important;
        font-weight: inherit !important;
        line-height: inherit !important;
        text-decoration: none !important;
      }
      #MessageViewBody a {
        color: inherit;
        text-decoration: none;
        font-size: inherit;
        font-family: inherit;
        font-weight: inherit;
        line-height: inherit;
      }
      .btn-primary table td:hover {
        background-color: #34495e !important;
      }
      .btn-primary a:hover {
        background-color: #34495e !important;
        border-color: #34495e !important;
      }
    }
    </style>
  </head>
  <body class=3D"" style=3D"background-color: #f6f6f6; font-family: sans-se=
rif; -webkit-font-smoothing: antialiased; font-size: 14px; line-height: 1.4=
; margin: 0; padding: 0; -ms-text-size-adjust: 100%; -webkit-text-size-adju=
st: 100%;">
    <table border=3D"0" cellpadding=3D"0" cellspacing=3D"0" class=3D"body" =
style=3D"border-collapse: separate; mso-table-lspace: 0pt; mso-table-rspace=
: 0pt; width: 100%; background-color: #f6f6f6;">
      <tr>
        <td style=3D"font-family: sans-serif; font-size: 14px; vertical-ali=
gn: top;">&nbsp;</td>
        <td class=3D"container" style=3D"font-family: sans-serif; font-size=
: 14px; vertical-align: top; display: block; Margin: 0 auto; max-width: 580=
px; padding: 10px; width: 580px;">
          <div class=3D"content" style=3D"box-sizing: border-box; display: =
block; Margin: 0 auto; max-width: 580px; padding: 10px;">


            <span class=3D"preheader" style=3D"color: transparent; display:=
 none; height: 0; max-height: 0; max-width: 0; opacity: 0; overflow: hidden=
; mso-hide: all; visibility: hidden; width: 0;"></span>
            <table class=3D"main" style=3D"border-collapse: separate; mso-t=
able-lspace: 0pt; mso-table-rspace: 0pt; width: 100%; background: #ffffff; =
border-radius: 3px;">


              <tr>
                <td class=3D"wrapper" style=3D"font-family: sans-serif; fon=
t-size: 14px; vertical-align: top; box-sizing: border-box; padding: 20px;">
                  <table border=3D"0" cellpadding=3D"0" cellspacing=3D"0" s=
tyle=3D"border-collapse: separate; mso-table-lspace: 0pt; mso-table-rspace:=
 0pt; width: 100%;">
                    <tr>
                      <td style=3D"font-family: sans-serif; font-size: 14px=
; vertical-align: top;">

<div>
<h1 style=3D"color: #000; font-family: sans-serif; line-height: 1.4; margin=
: 0; margin-bottom: 30px; font-size: 25px; font-weight: 300; text-align: ce=
nter">pg_infer 1.0.0 released -- transformer model knowledge as SQL relatio=
ns</h1>
</div>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">I am pleased to announce the first public r=
elease of <a href=3D"https://codeberg.org/gregburd/pg_infer" style=3D"color=
: #3498db; text-decoration: underline">pg_infer</a>, a PostgreSQL 18+
extension that exposes the internals of small transformer language models -=
- gate activations, feature labels, learned associations, embeddings -- as =
SQL-queryable relations and a custom index access method.</p>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">pg_infer is not "natural language to SQL." =
It is not "SQL to natural language." There is no chat interface, no agent l=
oop, no prompt template generating queries. pg_infer brings model inference=
 into the query plan as an operator the planner can cost, schedule, paralle=
lize, and combine with ordinary predicates and joins. The model becomes a f=
irst-class data source -- a set of relations the planner can scan, filter, =
and join -- not an external service the database talks to.</p>
<h2 style=3D"color: #000; font-family: sans-serif; font-weight: 400; line-h=
eight: 1.4; margin: 0; margin-bottom: 30px">Quick example</h2>
<pre><code>-- Register a vindex (extracted model knowledge):
`SELECT infer_create_model('qwen05b', '/data/qwen-0.5b.vindex');`

-- What does the model know about France?
</code></pre>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px"><code>SELECT * FROM describe('France');
    --  relation  |  target   | confidence | layer
    -- -----------+-----------+------------+-------
    --  capital   | Paris     |       42.7 |    18
    --  language  | French    |       38.1 |    17
    --  continent | Europe    |       35.4 |    16
    -- ...</code></p>
<pre><code>-- `ORDER BY` model-knowledge similarity:
</code></pre>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">```
    CREATE INDEX ON documents USING infer (title)
       WITH (model =3D 'qwen05b');</p>
<pre><code>SELECT * FROM documents
 ORDER BY title &lt;~&gt; 'artificial intelligence'
 LIMIT 5;
</code></pre>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">```</p>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">The <code>&lt;~&gt;</code> operator is inde=
x-backed, supports
EXPLAIN (ANALYZE, BUFFERS), and composes with WHERE, JOIN,
aggregation, and partitioning the way any other operator does.</p>
<h2 style=3D"color: #000; font-family: sans-serif; font-weight: 400; line-h=
eight: 1.4; margin: 0; margin-bottom: 30px">What pg_infer does that other e=
xtensions do not</h2>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">pgvector / pgvectorscale stores user-suppli=
ed embedding
    vectors and answers nearest-neighbour distance queries.
    pg_infer goes a layer deeper: it stores the model itself
    (gate vectors, feature activations, label metadata) in
    WAL-logged 8KB pages, and answers questions like "does
    the model treat A and B as related?" -- not "are these
    two embeddings close?"</p>
</li>
<li style=3D"list-style-position: inside; margin-left: 5px">
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">pg_search / RAG-style integrations turn use=
r queries into
    embedding lookups against external vector stores. pg_infer
    exposes the model's internal structure to SQL: walk(prompt)
    returns per-layer feature activations; describe(entity)
    returns the relations the model has learned about an
    entity; implies(a, b) tests directional support.</p>
</li>
<li style=3D"list-style-position: inside; margin-left: 5px">
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">pg_infer's index AM ships in two modes:</p>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">"model" indexes=
 WAL-log the entire vindex inside
    PostgreSQL pages, so backup, replication, and
    point-in-time recovery cover the model the same way
    they cover your tables.</li>
<li style=3D"list-style-position: inside; margin-left: 5px">"column" indexe=
s attach a model to a text column and
    make ORDER BY &lt;~&gt; on that column index-driven.</li>
</ul>
</li>
<li style=3D"list-style-position: inside; margin-left: 5px">
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">The mmap-backed local backend lets multiple=
 PG backends
    share decoded model pages through the OS page cache; the
    remote backend (larql-server / larql-router over HTTP/2 or
    a Unix socket) shares one copy of the model across a host
    and supports layer-sharded routing. In-flight remote calls
    respond to pg_cancel_backend(...) within roughly 100ms.</p>
</li>
</ul>
<h2 style=3D"color: #000; font-family: sans-serif; font-weight: 400; line-h=
eight: 1.4; margin: 0; margin-bottom: 30px">CPU inference, BitNet, and idle=
-cluster compute</h2>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">Database servers almost never have GPUs. Th=
ey have a lot of fast cores, a lot of RAM, and -- on most production deploy=
ments -- standby replicas, read-only physical replicas, logical subscribers=
, and DR hosts that spend most of the day at single-digit CPU utilization w=
hile the primary takes the write traffic.</p>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">pg_infer and the underlying larql crates ta=
rget this hardware profile directly:</p>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">The default execution paths run efficiently=
 on CPU, with
    BLAS-backed linear algebra (OpenBLAS) and f16 gate vectors
    that decode to f32 lazily.</p>
</li>
<li style=3D"list-style-position: inside; margin-left: 5px">
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">pg_infer / larql support models in the Micr=
osoft BitNet
    b1.58 family ("two-bit / 1.58-bit" ternary-weight
    transformers, https://arxiv.org/abs/2402.17764), which
    were specifically designed to run on commodity CPUs at
    competitive quality and dramatically lower memory and power
    cost than f16 baselines. Combined with f16 gate
    activations, this brings useful inference inside a
    PostgreSQL backend without any specialized accelerator.</p>
</li>
<li style=3D"list-style-position: inside; margin-left: 5px">
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">The cluster model is the point. A typical P=
ostgreSQL HA /
    DR / read-scale deployment has one busy primary and one or
    more largely idle physical replicas, plus, increasingly, a
    fleet of logical subscribers. Those replicas already pay
    for themselves in availability, but their CPUs are idle the
    vast majority of the time. With pg_infer's remote backend,
    larql-server runs on the replica hosts and serves model
    operators back to the primary's query plans -- the model is
    materialized once per host, the activation cache is shared,
    and the work happens on capacity you have already paid for.
    No GPU, no separate inference cluster, no extra network
    egress.</p>
</li>
</ul>
<h2 style=3D"color: #000; font-family: sans-serif; font-weight: 400; line-h=
eight: 1.4; margin: 0; margin-bottom: 30px">A few queries that are uniquely=
 pg_infer</h2>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">Model-aware doc=
ument ranking that does not depend on
    keyword overlap or pre-computed embeddings:</li>
</ul>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px"><code>SELECT id, title
        FROM papers
       ORDER BY title &lt;~&gt; 'neural architecture search'
       LIMIT 10;</code></p>
<pre><code>This finds "AutoML for Deep Networks" because the model
learned that relationship -- pg_trgm cannot, FTS cannot,
and pgvector can only do so if you computed and stored
embeddings ahead of time with a model whose semantics
happen to agree with your query.
</code></pre>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">Joining model k=
nowledge with relational data:</li>
</ul>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px"><code>SELECT c.id, c.name, p.title,
             p.title &lt;~&gt; c.research_interest AS dist
        FROM candidates c
        JOIN papers     p
          ON p.title &lt;~&gt; c.research_interest &lt; 0.2
       WHERE c.country =3D 'DE';</code></p>
<pre><code>Standard SQL semantics, standard PostgreSQL planner, plus
a model-driven join condition.
</code></pre>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">Probing what a =
model "knows" without running it:</li>
</ul>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px"><code>SELECT relation, target, confidence
        FROM describe('PostgreSQL')
       WHERE confidence &gt; 30;</code></p>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">Auditing model =
behaviour over time. Because the vindex is
    stored in WAL-logged pages, point-in-time recovery on a
    pg_infer-using cluster gives you the model state at any
    historical moment alongside the data state. "What was the
    model saying about this entity at 03:14 UTC last Tuesday?"
    is a literal PITR + describe(...) question.</li>
</ul>
<h2 style=3D"color: #000; font-family: sans-serif; font-weight: 400; line-h=
eight: 1.4; margin: 0; margin-bottom: 30px">Acknowledgements</h2>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">pg_infer would not exist without the LARQL =
project by Chris Hayuk (https://github.com/chrishayuk). LARQL pioneered the=
 idea of making transformer model internals queryable -- extracting gate ve=
ctors, feature activations, and learned associations into a format ("vindex=
") that can be explored interactively and expressed as a query language. Th=
e vindex format, the gate KNN algorithm, and the feature-labeling pipeline =
all originate from LARQL; pg_infer adapts them into a PostgreSQL access met=
hod, a WAL-logged storage layer, and a planner-visible operator.</p>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">If the larql ideas resonate, please look at=
 the original work and at Chris's video walkthroughs explaining the vindex =
format, the gate-KNN algorithm, and the LARQL query language:</p>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">Chris Hayuk on =
YouTube:    https://www.youtube.com/@chrishayuk</li>
<li style=3D"list-style-position: inside; margin-left: 5px">Original LARQL =
repo:       https://github.com/chrishayuk/larql</li>
<li style=3D"list-style-position: inside; margin-left: 5px">larql-rs (Rust =
port):      https://github.com/chrishayuk/chuk-larql-rs</li>
</ul>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">Thank you, Chris, for the foundational work=
 and for being open with the design.</p>
<h2 style=3D"color: #000; font-family: sans-serif; font-weight: 400; line-h=
eight: 1.4; margin: 0; margin-bottom: 30px">A note on stability and feedbac=
k</h2>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">pg_infer is new software. The SQL surface, =
the index AM, the remote backend protocol, and the test suite are stable en=
ough to release at 1.0.0, and the vindex on-disk format is stable forward; =
but the project is young, and the combination of PostgreSQL + pgrx + transf=
ormer internals is unusual enough that there are certainly bugs that the ex=
isting tests do not yet provoke. It is not a beta and not a research toy; i=
t is real software released early, and it should be used with appropriate c=
aution.</p>
<p style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal; =
margin: 0; margin-bottom: 15px">Bug reports, feature requests, and pull req=
uests are very welcome -- especially reproductions, vindex compatibility is=
sues, planner-cost regressions, and integration suggestions for other Postg=
reSQL extensions.</p>
<h2 style=3D"color: #000; font-family: sans-serif; font-weight: 400; line-h=
eight: 1.4; margin: 0; margin-bottom: 30px">Links</h2>
<ul style=3D"font-family: sans-serif; font-size: 14px; font-weight: normal;=
 margin: 0; margin-bottom: 15px">
<li style=3D"list-style-position: inside; margin-left: 5px">Repository:    =
<a href=3D"https://codeberg.org/gregburd/pg_infer" style=3D"color: #3498db;=
 text-decoration: underline">https://codeberg.org/gregburd/pg_infer</a> </l=
i>
<li style=3D"list-style-position: inside; margin-left: 5px">Issues:        =
<a href=3D"https://codeberg.org/gregburd/pg_infer/issues" style=3D"color: #=
3498db; text-decoration: underline">https://codeberg.org/gregburd/pg_infer/=
issues</a></li>
<li style=3D"list-style-position: inside; margin-left: 5px">LARQL:         =
<a href=3D"https://github.com/chrishayuk/larql" style=3D"color: #3498db; te=
xt-decoration: underline">https://github.com/chrishayuk/larql</a> </li>
<li style=3D"list-style-position: inside; margin-left: 5px">larql-rs:      =
<a href=3D"https://github.com/chrishayuk/chuk-larql-rs" style=3D"color: #34=
98db; text-decoration: underline">https://github.com/chrishayuk/chuk-larql-=
rs</a></li>
<li style=3D"list-style-position: inside; margin-left: 5px">BitNet b1.58:  =
<a href=3D"https://arxiv.org/abs/2402.17764" style=3D"color: #3498db; text-=
decoration: underline">https://arxiv.org/abs/2402.17764</a></li>
</ul>

                      </td>
                    </tr>
                  </table>
                </td>
              </tr>

            </table>

            <div class=3D"footer" style=3D"clear: both; Margin-top: 10px; t=
ext-align: center; width: 100%;">
              <table border=3D"0" cellpadding=3D"0" cellspacing=3D"0" style=
=3D"border-collapse: separate; mso-table-lspace: 0pt; mso-table-rspace: 0pt=
; width: 100%;">
                <tr>
                  <td class=3D"content-block" style=3D"font-family: sans-se=
rif; vertical-align: top; padding-bottom: 10px; padding-top: 10px; font-siz=
e: 12px; color: #999999; text-align: center;">
                    <span class=3D"apple-link" style=3D"color: #999999; fon=
t-size: 12px; text-align: center;">
This email was sent to you from Greg Burd. It was delivered on their behalf=
 by
the PostgreSQL project. Any questions about the content of the message shou=
ld be
sent to Greg Burd.
</span>
		    <br><br>
You were sent this email as a subscriber of the <em>pgsql-announce</em> mai=
linglist, for
the content tag Related Open Source.
To unsubscribe from
further emails, or change which emails you want to receive, please click th=
e personal unsubscribe
link that you can find in the headers of this email, or visit
<a href=3D"https://lists.postgresql.org/unsubscribe/" style=3D"color: #3498=
db; text-decoration: underline">https://lists.postgresql.org/unsubscribe/</=
a>.

                  </td>
                </tr>
              </table>
            </div>

          </div>
        </td>
        <td style=3D"font-family: sans-serif; font-size: 14px; vertical-ali=
gn: top;">&nbsp;</td>
      </tr>
    </table>
  </body>
</html>

--===============7060529155987209325==--