X-Original-To: pgsql-hackers-postgresql.org@localhost.postgresql.org Received: from localhost (av.hub.org [200.46.204.144]) by postgresql.org (Postfix) with ESMTP id 4D34F9DCB52 for ; Thu, 8 Dec 2005 18:44:39 -0400 (AST) Received: from postgresql.org ([200.46.204.71]) by localhost (av.hub.org [200.46.204.144]) (amavisd-new, port 10024) with ESMTP id 77592-07 for ; Thu, 8 Dec 2005 18:44:40 -0400 (AST) X-Greylist: from auto-whitelisted by SQLgrey- Received: from candle.pha.pa.us (candle.pha.pa.us [64.139.89.126]) by postgresql.org (Postfix) with ESMTP id 8FC329DCB78 for ; Thu, 8 Dec 2005 18:44:36 -0400 (AST) Received: (from pgman@localhost) by candle.pha.pa.us (8.11.6/8.11.6) id jB8MiYT02161; Thu, 8 Dec 2005 17:44:34 -0500 (EST) From: Bruce Momjian Message-Id: <200512082244.jB8MiYT02161@candle.pha.pa.us> Subject: Re: Upcoming PG re-releases In-Reply-To: To: Gavin Sherry Date: Thu, 8 Dec 2005 17:44:34 -0500 (EST) CC: Peter Eisentraut , pgsql-hackers@postgresql.org X-Mailer: ELM [version 2.4ME+ PL121 (25)] MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII X-Virus-Scanned: by amavisd-new at hub.org X-Spam-Status: No, score=0.009 required=5 tests=[AWL=0.009] X-Spam-Score: 0.009 X-Spam-Level: X-Archive-Number: 200512/459 X-Sequence-Number: 77307 Gavin Sherry wrote: > On Tue, 6 Dec 2005, Bruce Momjian wrote: > > > > > Exactly what does vim do that iconv does not? Fuzzy encoding sounds > > scary to me. > > > > Right. It actually makes assumptions about the source encoding. People who > care about their data need, unfortunately, to spend a bit of time on this > problem. I've been discussing the same issue on the slony1 mailing list, > because the issue can affect people's ability upgrade using slony1. > > http://gborg.postgresql.org/pipermail/slony1-general/2005-December/003430.html > > It would be good if had the script I suggest in the email: > > A script which identifies non-utf-8 characters and provides some > context, line numbers, etc, will greatly speed up the process of > remedying the situation. I think the best we can do is the "iconv -c with the diff" idea, which is already in the release notes. I suppose we could merge the iconv and diff into a single command, but I don't see a portable way to output the iconv output to stdout., /dev/stdin not being portable. -- Bruce Momjian | http://candle.pha.pa.us pgman@candle.pha.pa.us | (610) 359-1001 + If your life is a hard drive, | 13 Roberts Road + Christ can be your backup. | Newtown Square, Pennsylvania 19073