X-Original-To: pgsql-www-postgresql.org@localhost.postgresql.org Received: from localhost (unknown [200.46.204.144]) by svr1.postgresql.org (Postfix) with ESMTP id EDE833A46E4 for ; Thu, 4 Nov 2004 13:05:23 +0000 (GMT) Received: from svr1.postgresql.org ([200.46.204.71]) by localhost (av.hub.org [200.46.204.144]) (amavisd-new, port 10024) with ESMTP id 45584-06 for ; Thu, 4 Nov 2004 13:05:15 +0000 (GMT) Received: from imap.cs.msu.su (imap.cs.msu.su [158.250.10.39]) by svr1.postgresql.org (Postfix) with ESMTP id 2C6D93A4666 for ; Thu, 4 Nov 2004 13:05:17 +0000 (GMT) Received: from [10.3.34.136] (pc724-lin.cmc.msu.ru [10.3.34.136]) by imap.cs.msu.su (8.12.11/8.12.11) with ESMTP id iA4D5GWv004562; Thu, 4 Nov 2004 16:05:16 +0300 (MSK) (envelope-from borz_off@cs.msu.su) Message-ID: <418A2869.5090003@cs.msu.su> Date: Thu, 04 Nov 2004 16:02:33 +0300 From: Alexey Borzov User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.7.3) Gecko/20040910 X-Accept-Language: ru, en-us, en MIME-Version: 1.0 To: Dave Page Cc: pgsql-www@postgresql.org Subject: Re: Mirror.php performance References: <418A07C4.2060103@cs.msu.su> In-Reply-To: <418A07C4.2060103@cs.msu.su> Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Scanned: ClamAV 0.80/572/Wed Nov 3 13:48:18 2004 clamav-milter version 0.80j on imap.cs.msu.su X-Virus-Status: Clean X-Virus-Scanned: by amavisd-new at hub.org X-Spam-Status: No, hits=0.0 tagged_above=0.0 required=5.0 tests= X-Spam-Level: X-Archive-Number: 200411/95 X-Sequence-Number: 5826 Hi, Alexey Borzov wrote: >> Nov 04 09:16:04 mirror [info] Mirroring finished. 1027 page(s) saved, >> 1346 second(s) spent >> >> It appears to have saved everything in the root directory afaict, and >> the 7.4 static docs, but nothing else. >> >> Any ideas? > > Ouch. It did the same for me, will look into this: seems as if some > links are dropped / not followed. Fixed. Turned out the regexes to extract links from pages were broken and some of the links (including the main menu, unfortunately) were thus not crawled.