X-Original-To: pgsql-www-postgresql.org@localhost.postgresql.org Received: from localhost (mx1.hub.org [200.46.208.251]) by postgresql.org (Postfix) with ESMTP id B774B9FA42B; Fri, 12 May 2006 19:37:29 -0300 (ADT) Received: from postgresql.org ([200.46.204.71]) by localhost (mx1.hub.org [200.46.208.251]) (amavisd-new, port 10024) with ESMTP id 64639-05; Fri, 12 May 2006 19:37:25 -0300 (ADT) X-Greylist: from auto-whitelisted by SQLgrey- X-Greylist: from auto-whitelisted by SQLgrey- Received: from anchor-post-36.mail.demon.net (anchor-post-36.mail.demon.net [194.217.242.86]) by postgresql.org (Postfix) with ESMTP id ED7169FA15C; Fri, 12 May 2006 19:37:23 -0300 (ADT) Received: from mailgate.vale-housing.co.uk ([194.217.48.34] helo=vale-housing.co.uk) by anchor-post-36.mail.demon.net with esmtp (Exim 4.42) id 1FegGA-000Adh-LC; Fri, 12 May 2006 22:37:22 +0000 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable X-MimeOLE: Produced By Microsoft Exchange V6.5 Subject: Re: developer.pgadmin.org/nagios.pgadmin.org - Diskfailure Date: Fri, 12 May 2006 23:37:21 +0100 Message-ID: X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: [pgsql-www] developer.pgadmin.org/nagios.pgadmin.org - Diskfailure Thread-Index: AcZ2Eh5Ad9rqI96dQoaxCu/p2WXGagAATgiA From: "Dave Page" To: "Jeff MacDonald" Cc: , "PostgreSQL WWW" X-Virus-Scanned: Maia Mailguard 1.0.1 X-Archive-Number: 200605/52 X-Sequence-Number: 10015 =20 > -----Original Message----- > From: Jeff MacDonald [mailto:jam@zoidtechnologies.com]=20 > Sent: 12 May 2006 23:19 > To: Dave Page > Cc: Jeff MacDonald > Subject: Re: [pgsql-www]=20 > developer.pgadmin.org/nagios.pgadmin.org - Diskfailure >=20 > On Fri, 2006-05-12 at 22:47 +0100, Dave Page wrote: > > The machine hosting the developer.pgadmin.org and=20 > nagios.pgadmin.org=20 > > vservers is currently having serious filesystem problems, which are=20 > > causing disk intensive operations (like rsync, tar) to segfault for=20 > > currently unknown reasons. >=20 > do a memory test, swap as needed, see if that solves the=20 > problem..=20 I'll try just replacing it - I have some unopened sticks for that mobo. FWIW, a reboot with a forced fsck found no errors at all and the box is currently working OK, but I have now found errors similar to the following: May 12 21:11:29 barbas rsyncd[32134]: rsync: writefd_unbuffered failed to write 4 bytes: phase "send_file_entry" [sender]: Broken pipe (32) May 12 21:11:29 barbas rsyncd[32134]: rsync error: error in rsync protocol data stream (code 12) at io.c(1126) [sender] May 12 22:13:52 barbas kernel: kernel BUG at page_alloc.c:142! May 12 22:13:52 barbas kernel: invalid operand: 0000 May 12 22:13:52 barbas kernel: CPU: 1 May 12 22:13:52 barbas kernel: EIP: 0010:[] Not tainted May 12 22:13:52 barbas kernel: EFLAGS: 00010286 May 12 22:13:52 barbas kernel: eax: d9e18100 ebx: c262c140 ecx: c262c140 edx: 00000000 May 12 22:13:52 barbas kernel: esi: c262c140 edi: 00000000 ebp: 00000000 esp: d50d5edc May 12 22:13:52 barbas kernel: ds: 0018 es: 0018 ss: 0018 May 12 22:13:52 barbas kernel: Process rsync (pid: 32141, stackpage=3Dd50d5000) May 12 22:13:52 barbas kernel: Stack: d50d5ee8 c0133ab0 00001000 c262c140 e3a59d44 00006000 c01348e9 00000000 May 12 22:13:52 barbas kernel: 00000000 00001000 c262c140 e3a59d44 00000000 c013423d d50d5f7c c262c140 May 12 22:13:52 barbas kernel: 00000000 00001000 00001000 00000001 00000000 0000013b e3a59c80 c01347f0 May 12 22:13:52 barbas kernel: Call Trace: [] [] [] [] [] May 12 22:13:52 barbas kernel: [] [] [] [] May 12 22:13:52 barbas kernel: May 12 22:13:52 barbas kernel: Code: 0f 0b 8e 00 6b ba 37 c0 e9 ba fd ff ff 8b 69 60 85 ed 0f 85 Could well be a duff stick I guess, given where it died. > the quicker solution may be to just put the backup=20 > machine into production rather than running exhaustive memory tests. Yes, well it was going into it anyway to get it out of the current 3U chassis and into a 1U one with full OOB management. The only problem is that I'm still awaiting delivery of a cable for the external tape drive in the rack so I can only do rsync/scp backups until that arrives. Regards, Dave.