Received: from localhost (wm.hub.org [200.46.204.128]) by postgresql.org (Postfix) with ESMTP id E46979FB21D; Wed, 25 Oct 2006 06:38:27 -0300 (ADT) Received: from postgresql.org ([200.46.204.71]) by localhost (mx1.hub.org [200.46.204.128]) (amavisd-new, port 10024) with ESMTP id 18960-04; Wed, 25 Oct 2006 09:38:15 +0000 (UTC) X-Greylist: from auto-whitelisted by SQLgrey- X-Greylist: from auto-whitelisted by SQLgrey- Received: from bugaboo.mu (ns1.bugaboo.mu [213.133.111.57]) by postgresql.org (Postfix) with ESMTP id D76009FB253; Wed, 25 Oct 2006 06:38:14 -0300 (ADT) Received: from [192.168.77.26] (p54BDC1DC.dip0.t-ipconnect.de [::ffff:84.189.193.220]) (AUTH: CRAM-MD5 markus@bluegap.ch) by bugaboo.mu with esmtp; Wed, 25 Oct 2006 11:38:12 +0200 id 00E07D64.453F3084.000036A1 Message-ID: <453F3083.6080302@bluegap.ch> Date: Wed, 25 Oct 2006 11:38:11 +0200 From: Markus Schiltknecht User-Agent: Thunderbird 1.5.0.7 (X11/20060927) MIME-Version: 1.0 To: Bruce Momjian CC: Hannu Krosing , PostgreSQL-documentation , PostgreSQL-development Subject: Re: [HACKERS] Replication documentation addition References: <200610250255.k9P2tRM09844@momjian.us> In-Reply-To: <200610250255.k9P2tRM09844@momjian.us> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Scanned: Maia Mailguard 1.0.1 X-Spam-Status: No, hits=0.855 tagged_above=0 required=5 tests=FORGED_RCVD_HELO, SARE_SPEC_REPLICA X-Spam-Level: X-Archive-Number: 200610/73 X-Sequence-Number: 3794 Hi, Bruce Momjian wrote: > I have updated the text. Please let me know what else I should change. > I am unsure if I should be mentioning commercial PostgreSQL products in > our documentation. I support your POV and vote for not including any pointers to commercial extensions in the official documentation. If at all, they should go to 'external-projects.sgml', where PostGIS, PgAdmin and other projects are mentioned. I can't really get excited about the exclusion of the term 'replication', because it's what most people are looking for. It's a well known term. Sorry if it sounded that way, but I've not meant to avoid that term. The newly created terms 'Query Broadcast Load Balancing' or even worse 'Multi-Master Load Balancing' are more confusing than helpful, because these terms do not exist. (See the googlefight in [1]) Can we name the chapter "Fail-over, Load-Balancing and Replication Options"? That would fit everything and contain the necessary buzz words. Also, I'm still missing Multi- vs Single-Master, which are also commonly used terms. IMHO, it does not make sense to speak of a synchronous replication for a 'Shared Disk Fail Over'. It's not replication, because there's no replica. The Data Partitioning paragraph should probably mention it's close relation with data partitioning across table spaces (and make the differences clear). What you call 'Query Broadcast Load Balancing' is also a multi-master replication, thus naming only the later 'Multi-Master Load Balancing' misleading. I'd propose to add a subsection 'Synchronous, Multi-Master Replication' and explain the different possibilities on how to do that: * Query-Based * with 2PC * Distributed SHMEM * (perhaps mention the optimized Postgres-R algorithm ;-) What you called 'Single-Query Clustering' is probably better known as 'Parallel Query Execution'. It can be combined with all types of replication (every combination of async / sync and Single- / Multi-Master). It's maybe load balancing, but it depends on some form of replication to distribute the data first. I liked Chris Browns documentation in [2] which was clearer regarding replication (which can be used to do fail-over, load-balancing, data-partitioning or parallel query execution). I'd like to keep all those things a little more separate to get them clear. Regards Markus [1]: Googlefight: "Multi-Master Load Balancing" vs "Multi-Master Replication": http://tinyurl.com/y3k76r [2]: Chris Browns proposal for a replication documentation: http://archives.postgresql.org/pgsql-patches/2006-08/msg00026.php