public inbox for [email protected]
help / color / mirror / Atom feedFrom: Oleg Bartunov <[email protected]>
To: Tom Lane <[email protected]>
Cc: Bruce Momjian <[email protected]>
Cc: Laurenz Albe <[email protected]>
Cc: James Addison <[email protected]>
Cc: PostgreSQL WWW <[email protected]>
Subject: Re: Mailing list search engine: surprising missing results?
Date: Tue, 25 Jan 2022 14:04:09 +0300
Message-ID: <CAF4Au4yttKJ1KAP-cO+HMLQ2_66vmx0dLTBUbE4W8Aa64foafg@mail.gmail.com> (raw)
In-Reply-To: <[email protected]>
References: <CALDQ5NxzgeXHRCD4dS_6qz+nn01ivi3i1ZEtD2DmC779i0=iSQ@mail.gmail.com>
<[email protected]>
<[email protected]>
<[email protected]>
On Mon, Jan 24, 2022 at 11:47 PM Tom Lane <[email protected]> wrote:
> Bruce Momjian <[email protected]> writes:
> > On Mon, Jan 24, 2022 at 08:27:41AM +0100, Laurenz Albe wrote:
> >> The reason is that the 'moore' in 'boyer-moore' is stemmed, since it
> >> is at the end of the word, while the 'moore' in 'Boyer-Moore-Horspool'
> >> isn't:
>
> > Wow, he showed me this problem earlier but I never suspected it was
> > stemming issue because I never considered proper nowns could be
> > stem-adjusted, but it is obvious they can.
>
> I wonder if we should change that so that components of a compound
> word are consistently stemmed the same way.
>
Something like this
SELECT to_tsvector('english', 'Boyer-Moore-Horspool');
to_tsvector
----------------------------------------------------------
'boyer':2 'boyer-moore-horspool':1 'boyer-moore':1 'moore-horspool':1
'horspool':4 'moor':3
(1 row)
>
> regards, tom lane
>
>
>
--
Postgres Professional: http://www.postgrespro.com
The Russian Postgres Company
reply
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Reply to all the recipients using the --to and --cc options:
reply via email
To: [email protected]
Cc: [email protected], [email protected], [email protected], [email protected], [email protected]
Subject: Re: Mailing list search engine: surprising missing results?
In-Reply-To: <CAF4Au4yttKJ1KAP-cO+HMLQ2_66vmx0dLTBUbE4W8Aa64foafg@mail.gmail.com>
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox