Received: from maia.hub.org (maia-2.hub.org [200.46.204.251]) by mail.postgresql.org (Postfix) with ESMTP id 62195B5DBC9 for ; Tue, 14 Jun 2011 10:02:07 -0300 (ADT) Received: from mail.postgresql.org ([200.46.204.86]) by maia.hub.org (mx1.hub.org [200.46.204.251]) (amavisd-maia, port 10024) with ESMTP id 25613-03 for ; Tue, 14 Jun 2011 13:02:00 +0000 (UTC) X-Greylist: domain auto-whitelisted by SQLgrey-1.7.6 Received: from mail-px0-f174.google.com (mail-px0-f174.google.com [209.85.212.174]) by mail.postgresql.org (Postfix) with ESMTP id 4D409B5D80B for ; Tue, 14 Jun 2011 10:02:00 -0300 (ADT) Received: by pxi15 with SMTP id 15so3846170pxi.33 for ; Tue, 14 Jun 2011 06:02:00 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:cc:content-type :content-transfer-encoding; bh=18qLnXxM9DtilZ8sNDqgxT4Yv4V49YOqjNUqN4R9+wE=; b=RGE580pT2xFAqLEmBM3+ugz1HRcxrSSbRWJVODIIZmgK7u2pV6nXAJzjEMmGDNPW6x 8tuucIjd7lvQEb2ISzAUqoZG18Z8KE1brvVxBlictKftsyvksgkFx9pJ+UlE31ry2IfC FsCxnbTw92D1uuWlK425cYLAojagojdGpvXSI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=jGOB4HUcTmFVDUXXK5jRMl79zVOO1Kj7wfZD/E40/FiZPtjvqXngILdPpFJwn365wA 019YDNn7ifmS5liTGQRF/yd5Nfz4qo1z+vH5Z4Eu6wd8sFT+cRAXkkr8QUgs+7hR82eh caHQFwffzS38/8RmLoJXiQUJuutgTmN5rpIHM= MIME-Version: 1.0 Received: by 10.68.27.138 with SMTP id t10mr2954527pbg.245.1308056519854; Tue, 14 Jun 2011 06:01:59 -0700 (PDT) Received: by 10.68.56.70 with HTTP; Tue, 14 Jun 2011 06:01:59 -0700 (PDT) In-Reply-To: <1307989511.2862.810.camel@core2> References: <4DE89072.7070305@usit.uio.no> <1307989511.2862.810.camel@core2> Date: Tue, 14 Jun 2011 09:01:59 -0400 Message-ID: Subject: Re: Information about WAL Configuration needs an update From: Robert Haas To: Rafael Martinez Cc: pgsql-docs@postgresql.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Virus-Scanned: Maia Mailguard 1.0.1 X-Spam-Status: No, hits=-1.898 tagged_above=-5 required=5 tests=BAYES_00=-1.9, FREEMAIL_FROM=0.001, RFC_ABUSE_POST=0.001 X-Spam-Level: X-Archive-Number: 201106/69 X-Sequence-Number: 6843 On Mon, Jun 13, 2011 at 2:25 PM, Rafael Martinez wrote: > On Mon, 2011-06-13 at 13:24 -0400, Robert Haas wrote: >> On Fri, Jun 3, 2011 at 3:42 AM, Rafael Martinez >> > >> > You can see the graph with the generation of WAL files + some extra >> > information for this test here: http://folk.uio.no/rafael/total_wal/ >> > >> > What do you think? Shouldn't we update the documentation with some >> > information about this? >> >> Perhaps, but we'd have to think of something intelligent to say about >> it first. =A0We can't remove the old WAL files until we successfully >> checkpoint, and so I think if checkpoints are taking a very long to >> complete or failing altogether, there's actually no upper bound. =A0I >> don't think we have any kind of "hard stop" where, if no log space is >> available, we just refuse to process write transactions - such a thing >> would seem to be rather dangerous. >> > > Well, a good start will be to try to identify or describe the situations > where checkpoints can take very long to complete or fail altogether. > > I have the first one: Creating a large GIN index on a tsvector column. I > don't know why, maybe somebody who knows postgres internals can explain > why a creation of an index can create this situation. I think we're discussing this on the wrong list. It sounds to me like you have a performance or configuration problem (which likely has nothing to do with GIN indexes specifically) that you haven't fully diagnosed or understood (and I don't understand it either, at least not based on the information so far provided) and because that problem is manifesting itself as an excess of WAL files, you're homing in on this part of the documentation. And it may very well be that we need some better documentation here, because I too have seen a few systems lately with quite a lot of WAL files floating around for no immediately obvious reason, but we can't document what is going on until we understand it. If you're interested in troubleshooting this further, I think you should post to pgsql-performance and try to get some help understanding what is happening. If we get to the point where we have a clear explanation for what is occurring, then we can work out where and how to document it. --=20 Robert Haas EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company