public inbox for [email protected]  
help / color / mirror / Atom feed
From: Joshua D. Drake <[email protected]>
To: PostgreSQL WWW <[email protected]>
Subject: A counter productive conversation about search.
Date: Mon, 28 Aug 2006 20:12:28 -0700
Message-ID: <[email protected]> (raw)

Hello,

Now that I have effectively slapped myself silly by being rude to Tom 
about search. Let me bring up some points about search and see if there 
is a way to resolve them.

The problem:

Search really isn't that good. Tom has good results with it, but I am 
guessing that because he is looking for specific things, likely just in 
archives as I doubt he often searches the documentation ;).

A quick search on google:

site:archives.postgresql.org index bloat

archives.postgresql.org/pgsql-performance/2005-04/msg00617.php
archives.postgresql.org/pgsql-performance/2005-04/msg00594.php
archives.postgresql.org/pgsql-performance/2005-04/msg00608.php

archives.postgresql.org:

http://archives.postgresql.org/pgsql-performance/2005-04/msg00575.php
http://archives.postgresql.org/pgsql-general/2004-12/msg00288.php
http://archives.postgresql.org/pgsql-general/2005-07/msg00186.php

site:www.postgresql.org create index
www.postgresql.org/docs/7.4/static/sql-createindex.html
www.postgresql.org/docs/8.1/static/sql-createindex.html
www.postgresql.org/files/documentation/books/aw_pgsql/node216.html

search.postgresql.org:
http://www.postgresql.org/files/documentation/books/aw_pgsql/node216.html
http://www.postgresql.org/files/documentation/books/pghandbuch/html/sql-createindex.html
http://developer.postgresql.org/~petere/past-events/lsm2003-slides/foil20.html

The first search is "reasonable" between the two, although it does not 
appear to correctly follow the thread path.

The second search to me is completely wrong. CREATE INDEX should always 
return the current documentation first. I can forgive google for showing 
7.4 first because it has been around longer and yet is still widely in use.

I have on multiple occasions brought up the idea of another search 
engine. I wrote the pgsql.ru guys and asked if they would share their 
code. To their benefit they said they would be willing but didn't have 
the time to install it for us. I told them I would be happy to muscle 
through it if they would just answer some emails. I never heard back.

Other options include lucene, and rolling our own.

Rolling our own really wouldn't be that hard "if" we can create a 
reasonably smart web page grabber. We have all the tools (tsearch2 and 
pg_pgtrm) to easily do the searches.

So is anyone up for helping develop a page grabber?

Sincerely,

Joshua D. Drake









-- 

    === The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
    Providing the most comprehensive  PostgreSQL solutions since 1997
              http://www.commandprompt.com/





view thread (15+ messages)  latest in thread

reply

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Reply to all the recipients using the --to and --cc options:
  reply via email

  To: [email protected]
  Cc: [email protected]
  Subject: Re: A counter productive conversation about search.
  In-Reply-To: <[email protected]>

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox