Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1nC7dE-00024i-My for pgsql-www@arkaria.postgresql.org; Mon, 24 Jan 2022 22:16:12 +0000 Received: from localhost ([127.0.0.1] helo=malur.postgresql.org) by malur.postgresql.org with esmtp (Exim 4.92) (envelope-from ) id 1nC7ch-000821-Ix for pgsql-www@arkaria.postgresql.org; Mon, 24 Jan 2022 22:15:39 +0000 Received: from makus.postgresql.org ([2001:4800:3e1:1::229]) by malur.postgresql.org with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1nC7ch-00081s-Ca for pgsql-www@lists.postgresql.org; Mon, 24 Jan 2022 22:15:39 +0000 Received: from momjian.us ([72.94.173.45]) by makus.postgresql.org with esmtps (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from ) id 1nC7cf-0002HA-2A for pgsql-www@postgresql.org; Mon, 24 Jan 2022 22:15:38 +0000 Received: from bruce by momjian.us with local (Exim 4.94.2) (envelope-from ) id 1nC7ca-00BDNs-Gi; Mon, 24 Jan 2022 17:15:32 -0500 Date: Mon, 24 Jan 2022 17:15:32 -0500 From: Bruce Momjian To: Tom Lane Cc: Laurenz Albe , James Addison , pgsql-www@postgresql.org Subject: Re: Mailing list search engine: surprising missing results? Message-ID: References: <2150096.1643057249@sss.pgh.pa.us> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <2150096.1643057249@sss.pgh.pa.us> List-Id: List-Help: List-Subscribe: List-Post: List-Owner: List-Archive: Archived-At: Precedence: bulk On Mon, Jan 24, 2022 at 03:47:29PM -0500, Tom Lane wrote: > Bruce Momjian writes: > > On Mon, Jan 24, 2022 at 08:27:41AM +0100, Laurenz Albe wrote: > >> The reason is that the 'moore' in 'boyer-moore' is stemmed, since it > >> is at the end of the word, while the 'moore' in 'Boyer-Moore-Horspool' > >> isn't: > > > Wow, he showed me this problem earlier but I never suspected it was > > stemming issue because I never considered proper nowns could be > > stem-adjusted, but it is obvious they can. > > I wonder if we should change that so that components of a compound > word are consistently stemmed the same way. I don't see the value in a change --- it might break the same number of cases it fixes. -- Bruce Momjian https://momjian.us EDB https://enterprisedb.com If only the physical world exists, free will is an illusion.