Received: from maia.hub.org (unknown [200.46.208.211]) by mail.postgresql.org (Postfix) with ESMTP id D945B632F50 for ; Thu, 11 Feb 2010 09:07:53 -0400 (AST) Received: from mail.postgresql.org ([200.46.204.86]) by maia.hub.org (mx1.hub.org [200.46.208.211]) (amavisd-maia, port 10024) with ESMTP id 24502-02-9 for ; Thu, 11 Feb 2010 13:07:35 +0000 (UTC) X-Greylist: from auto-whitelisted by SQLgrey-1.7.6 Received: from outmail128141.authsmtp.com (outmail128141.authsmtp.com [62.13.128.141]) by mail.postgresql.org (Postfix) with ESMTP id 901B7632CEB for ; Thu, 11 Feb 2010 09:06:53 -0400 (AST) Received: from mail-c194.authsmtp.com (mail-c194.authsmtp.com [62.13.128.121]) by punt8.authsmtp.com (8.14.2/8.14.2/Kp) with ESMTP id o1BD6kPZ085592; Thu, 11 Feb 2010 13:06:46 GMT Received: from [10.0.1.26] (74-92-138-153-WashingtonDC.hfc.comcastbusiness.net [74.92.138.153]) (authenticated bits=0) by mail.authsmtp.com (8.14.2/8.14.2/Kp) with ESMTP id o1BD6e3Y015494; Thu, 11 Feb 2010 13:06:41 GMT Subject: Re: Re: [COMMITTERS] pgsql: Make standby server continuously retry restoring the next WAL From: Simon Riggs To: Heikki Linnakangas Cc: Fujii Masao , PostgreSQL-development In-Reply-To: <4B73FB99.4080403@enterprisedb.com> References: <20100127152751.3B2047541B9@cvs.postgresql.org> <3f0b79eb1002092105r21e009d3v468496058ba04392@mail.gmail.com> <4B726120.80007@enterprisedb.com> <1265884657.7341.1192.camel@ebony> <4B73F678.8070109@enterprisedb.com> <1265891248.7341.1346.camel@ebony> <4B73FB99.4080403@enterprisedb.com> Content-Type: text/plain Date: Thu, 11 Feb 2010 13:06:39 +0000 Message-Id: <1265893599.7341.1454.camel@ebony> Mime-Version: 1.0 X-Mailer: Evolution 2.26.1 Content-Transfer-Encoding: 7bit X-Server-Quench: 4c195f09-170e-11df-80b9-0022640b883e X-Report-Spam: If SPAM / abuse - report it at: http://www.authsmtp.com/abuse X-AuthRoute: OCdxZQATClZOTQEd DAteCiN5VAwpPBRK HVkIKg5MJUcNSQVJ NksadRtFaQRba1xT HGQLWlREUV57WWV/ bwsfagFDa0hQXgZi TklMQU1XHAJ3AVJe B2xqVExxHgVHfXp5 YQhlX3dYEkApdE94 FE5dCGwBZTJ9aWFL Bl1Qd1FdbQNKfB1D blAtXHsONCtlM3Bw LC8aFBMcBw5qYB5Y EEk+BnU3ZGc3IhMG fCVKBTo0BElXDwA6 LBFuMUIVGkobIw0p PEE/VEh6exQVDBFf GVxKHTRdNhEbSjYx HEs7R0MFDDpHQCFT SgEoL1dODyxOEhVx ICMA X-Authentic-SMTP: 61633235383639.1015:706/Kp X-AuthFastPath: 255 X-Virus-Status: No virus detected - but ensure you scan with your own anti-virus system. X-Virus-Scanned: Maia Mailguard 1.0.1 X-Spam-Status: No, hits=-2.599 tagged_above=-10 required=5 tests=AWL=0.000, BAYES_00=-2.599 X-Spam-Level: X-Archive-Number: 201002/846 X-Sequence-Number: 157189 On Thu, 2010-02-11 at 14:44 +0200, Heikki Linnakangas wrote: > Simon Riggs wrote: > > On Thu, 2010-02-11 at 14:22 +0200, Heikki Linnakangas wrote: > >> Simon Riggs wrote: > >>> On Wed, 2010-02-10 at 09:32 +0200, Heikki Linnakangas wrote: > >>>> Hmm, so after running restore_command, check the file size and if it's > >>>> too short, treat it the same as if restore_command returned non-zero? > >>>> And it will be retried on the next iteration. Works for me, though OTOH > >>>> it will then fail to complain about a genuinely WAL file that's > >>>> truncated for some reason. I guess there's no way around that, even if > >>>> you have a script as restore_command that does the file size check, it > >>>> will have the same problem. > >>> Are we trying to re-invent pg_standby here? > >> That's not the goal, but we seem to need some of the same functionality > >> in the backend now. > > > > I think you need to say why... > > See the quoted paragraph above. We should check the file size, so that > we will not fail if the WAL file is just being copied into the archive > directory. We can read, but that's not an explanation. By giving terse answers in that way you are giving the impression that you don't want discussion on these points. If you were running pg_standby as the restore_command then this error wouldn't happen. So you need to explain why running pg_standby cannot solve your problem and why we must fix it by replicating code that has previously existed elsewhere. -- Simon Riggs www.2ndQuadrant.com