X-Original-To: pgsql-www-postgresql.org@localhost.postgresql.org Received: from localhost (unknown [200.46.204.144]) by svr1.postgresql.org (Postfix) with ESMTP id 3ABCA32A84B for ; Tue, 19 Oct 2004 04:36:28 +0100 (BST) Received: from svr1.postgresql.org ([200.46.204.71]) by localhost (av.hub.org [200.46.204.144]) (amavisd-new, port 10024) with ESMTP id 22750-06 for ; Tue, 19 Oct 2004 03:36:21 +0000 (GMT) Received: from ra.sai.msu.su (ra.sai.msu.su [158.250.29.2]) by svr1.postgresql.org (Postfix) with ESMTP id C629032AC97 for ; Tue, 19 Oct 2004 04:36:14 +0100 (BST) Received: from ra (ra [158.250.29.2]) by ra.sai.msu.su (8.12.10/8.12.10) with ESMTP id i9J3ZiQT013667; Tue, 19 Oct 2004 07:35:44 +0400 (MSD) Date: Tue, 19 Oct 2004 07:35:44 +0400 (MSD) From: Oleg Bartunov X-X-Sender: megera@ra.sai.msu.su To: John Hansen Cc: "Joshua D. Drake" , PostgreSQL www Subject: Re: mailing list offer In-Reply-To: <1098155495.15522.19.camel@localhost.localdomain> Message-ID: References: <41746DC7.2060003@commandprompt.com> <1098155495.15522.19.camel@localhost.localdomain> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Virus-Scanned: by amavisd-new at hub.org X-Spam-Status: No, hits=0.0 tagged_above=0.0 required=5.0 tests= X-Spam-Level: X-Archive-Number: 200410/185 X-Sequence-Number: 5579 On Tue, 19 Oct 2004, John Hansen wrote: > > A fully integrated solution would be sweet. One that combines the > features of the archives, the search engine, and the lists themselves. > > You should be aware tho, that iirc the current tally is closing in on > 300.000 emails. Does anyone have any installations with that much data > indexed by tsearch2, who would be able to give us some indicative > reports on query time? > > Mnogosearch was scrapped due to it's speed (or lack thereof), and iirc, > tsearch2 closely resembles the same type of mechanism for full text > searches. Just something to think about. in principle, www.pgsql.ru/db/mw (fts.postgresql.org in past) was designed to be an integrated solution, but time past and we have no spare time to continue developing (we only reinstalled fts.postgresql.org). As for tsearch2 performance, then it's slow on full archive, but reasonably fast searching last month (default) or last year archive. We have tested our idea using read only archive and online news archive, and it worked very-very fast. Main idea is to have common infrastructure (parsers, dictionaries, ranking) for inverted index (read only part) and tsearch2 index for changing part. Running daemon (tsearchd) accepts text for indexing, all metadata are stored in posgresql, so searching could be constrained using metadata. Tsearchd is just a working prototype and much work is needed. > > Btw, With regards to the forums, I imagine they'll come back over the > weekend... > >> What are the thoughts on this? >> >> Sincerely, >> >> Joshua D. Drake > > ... John > > > ---------------------------(end of broadcast)--------------------------- > TIP 6: Have you searched our list archives? > > http://archives.postgresql.org > Regards, Oleg _____________________________________________________________ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: oleg@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83