DDX for PostgreSQL — Community

OVERVIEW

Every PostgreSQL contribution we can extract from public sources, attributed to a person, indexed by release. We pull from:

Identity resolution is conservative — same email = always one person; same name + different email = two clusters flagged needs_review for operator confirmation. 246 of 2,453 clusters currently need review (mostly [email protected] vs [email protected] for the same human).

people
commits
trailers

BY RELEASE

Loading…

TOP CONTRIBUTORS

Loading…

METHOD

Per-person pages live at /community/<hash>/ where <hash> is the first 8 hex chars of sha256(canonical_email). Same person renders to the same hash regardless of which alias triggered the link, which keeps URLs stable across re-syncs and avoids leaking email plaintext in the URL bar.

Raw data is in PostgreSQL on fra (community.contributions_raw, community.persons, community.contributions_buildfarm) plus the :person/* and :contribution/* namespaces in the mentat triple store. The 6 community MCP verbs — list_contributors, get_contributor, find_contributions, release_contributors, contribution_summary, discussion_links — query the relational tables directly for speed; the mentat triples are there for cross-namespace joins (e.g. "reviewers of patches authored by X").

If your name is missing or merged with someone else's, please tell us — the community.committer_aliases hard-override table is hand-curated and takes precedence over the auto-clustering pass.

HOW THE BY-RELEASE NUMBERS ARE COMPUTED

The release column an individual contribution falls into is decided by a date heuristic against the commit's committed_at timestamp:

This is approximate, not branch-aware: a backport commit landed on REL_18_STABLE in 2026 will currently be classified as PG19-dev because the classifier looks at committed_at, not the branch the commit lives on. Branch-aware classification is on the roadmap. For each release the columns — people, commits, authored / reviewed / tested / reported / committed / reverted — come from grouping community.contributions_raw by (release, role) and counting distinct commit SHAs per role; revert detection runs against ag_git_commits.message separately.

SCOPE, REVERT DETECTION, DEDUPLICATION

Three deliberate choices behind these numbers:

VERIFY IT YOURSELF

The full underlying dataset and the aggregation algorithm are published as a self-contained PostgreSQL script: /community/data.sql. The script ships a fresh copy on each refresh and contains:

Load it into any PostgreSQL 14+ database with psql -f data.sql and run the same queries the page uses. If the numbers don't match, that's a bug; please tell us.

FEEDBACK WELCOME

This page is a community resource and we want to credit everyone correctly. If an alias is missing, two identities should be merged, a release column looks wrong, or you spot a contribution we missed, please tell us. Adjustments to better reflect community participation are the goal — the more accurate the data, the more useful the page.