X-Original-To: pgsql-www-postgresql.org@localhost.postgresql.org Received: from localhost (unknown [200.46.204.2]) by svr1.postgresql.org (Postfix) with ESMTP id 2DD38D1D25D for ; Sat, 31 Jan 2004 06:18:20 +0000 (GMT) Received: from svr1.postgresql.org ([200.46.204.71]) by localhost (neptune.hub.org [200.46.204.2]) (amavisd-new, port 10024) with ESMTP id 26296-08 for ; Sat, 31 Jan 2004 02:18:19 -0400 (AST) Received: from ganymede.hub.org (u46n208.hfx.eastlink.ca [24.222.46.208]) by svr1.postgresql.org (Postfix) with ESMTP id AAB92D1D1B2 for ; Sat, 31 Jan 2004 02:18:18 -0400 (AST) Received: by ganymede.hub.org (Postfix, from userid 1000) id B2A8134288; Sat, 31 Jan 2004 02:14:34 -0400 (AST) Received: from localhost (localhost [127.0.0.1]) by ganymede.hub.org (Postfix) with ESMTP id AFEAC3403F; Sat, 31 Jan 2004 02:14:34 -0400 (AST) Date: Sat, 31 Jan 2004 02:14:34 -0400 (AST) From: "Marc G. Fournier" X-X-Sender: scrappy@ganymede.hub.org To: Josh Berkus Cc: Oleg Bartunov , Dave Page , pgsql-www@postgresql.org Subject: Re: Postgresql.org search engine. In-Reply-To: <200401302201.15585.josh@agliodbs.com> Message-ID: <20040131020112.E29077@ganymede.hub.org> References: <03AF4E498C591348A42FC93DEA9661B87206B3@mail.vale-housing.co.uk> <20040131014838.U29077@ganymede.hub.org> <200401302201.15585.josh@agliodbs.com> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Scanned: by amavisd-new at postgresql.org X-Archive-Number: 200401/321 X-Sequence-Number: 3560 On Fri, 30 Jan 2004, Josh Berkus wrote: > Guys, > > > Do you have software to do this, including all the inter-posting > > references and followups? Or do you propose we write this all from > > scratch? > > Robert Bernier apparently wrote something to break up mail for inclusion in a > database, and should be able to help in a couple months. Josh Drake is also > willing to help, and has already done a prototype wiithout header searching. Dumping mail into a database isn't that hard to do ... there are several projects on the 'Net right now doing that, including one that connects a POP3 daemon into the database to download the mail ... in fact, from what I recall of fts.postgresql.org, isn't that what Oleg/Teodor's stuff does? I'm kinda curious here ... exactly what problem are we trying to solve here? Me, I'm just trying to clean up the archives so that when someone gets their search results, they don't all show the same 'text', which I've already accomplished ... Dave is working on improving the speed of the searches, which he has accomplished with ASPseek ... If I can figure out how to get the Date: of the posting into the Last-Modified field (I know *how* it should work, but last time I tried it ended up generating a whack of errors), then that should satisfy Oleg's beef ... Oleg, one question ... what do you recommend setting max-age to for Cache-control? Right now, I have it set to 30 days ... too long? not long enough? ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664