X-Original-To: pgsql-www-postgresql.org@localhost.postgresql.org Received: from localhost (unknown [200.46.204.2]) by svr1.postgresql.org (Postfix) with ESMTP id 2208BD1D1D6; Sat, 31 Jan 2004 12:45:47 +0000 (GMT) Received: from svr1.postgresql.org ([200.46.204.71]) by localhost (neptune.hub.org [200.46.204.2]) (amavisd-new, port 10024) with ESMTP id 14554-07; Sat, 31 Jan 2004 08:45:47 -0400 (AST) Received: from ra.sai.msu.su (ra.sai.msu.su [158.250.29.2]) by svr1.postgresql.org (Postfix) with ESMTP id 08645D1D141; Sat, 31 Jan 2004 08:45:37 -0400 (AST) Received: from ra (ra [158.250.29.2]) by ra.sai.msu.su (8.12.10/8.12.10) with ESMTP id i0VCjTYJ001210; Sat, 31 Jan 2004 15:45:29 +0300 (MSK) Date: Sat, 31 Jan 2004 15:45:28 +0300 (MSK) From: Oleg Bartunov X-X-Sender: megera@ra.sai.msu.su To: "Marc G. Fournier" Cc: Josh Berkus , Dave Page , pgsql-www@postgresql.org Subject: Re: Postgresql.org search engine. In-Reply-To: <20040131020112.E29077@ganymede.hub.org> Message-ID: References: <03AF4E498C591348A42FC93DEA9661B87206B3@mail.vale-housing.co.uk> <20040131014838.U29077@ganymede.hub.org> <200401302201.15585.josh@agliodbs.com> <20040131020112.E29077@ganymede.hub.org> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Virus-Scanned: by amavisd-new at postgresql.org X-Archive-Number: 200401/326 X-Sequence-Number: 3565 On Sat, 31 Jan 2004, Marc G. Fournier wrote: > On Fri, 30 Jan 2004, Josh Berkus wrote: > > > Guys, > > > > > Do you have software to do this, including all the inter-posting > > > references and followups? Or do you propose we write this all from > > > scratch? > > > > Robert Bernier apparently wrote something to break up mail for inclusion in a > > database, and should be able to help in a couple months. Josh Drake is also > > willing to help, and has already done a prototype wiithout header searching. > > Dumping mail into a database isn't that hard to do ... there are several > projects on the 'Net right now doing that, including one that connects a > POP3 daemon into the database to download the mail ... in fact, from what > I recall of fts.postgresql.org, isn't that what Oleg/Teodor's stuff does? > > I'm kinda curious here ... exactly what problem are we trying to solve > here? > > Me, I'm just trying to clean up the archives so that when someone gets > their search results, they don't all show the same 'text', which I've > already accomplished ... Dave is working on improving the speed of the > searches, which he has accomplished with ASPseek ... > > If I can figure out how to get the Date: of the posting into the > Last-Modified field (I know *how* it should work, but last time I tried it > ended up generating a whack of errors), then that should satisfy Oleg's > beef ... > > Oleg, one question ... what do you recommend setting max-age to for > Cache-control? Right now, I have it set to 30 days ... too long? not > long enough? in my experience Cache-control is not effective, because it's HTTP/1.1 feature and a lot of users come through proxy which still doesn't support HTTP/1.1 Last-Modified header is the most universal way. Check http://www.mnot.net/cache_docs/#CACHE-CONTROL > > ---- > Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) > Email: scrappy@hub.org Yahoo!: yscrappy ICQ: 7615664 > Regards, Oleg _____________________________________________________________ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83