public inbox for [email protected]  
help / color / mirror / Atom feed
From: PG Doc comments form <[email protected]>
To: [email protected]
Cc: [email protected]
Subject: Add NOTICE about non-NFC-characters and clues for solution
Date: Fri, 16 Nov 2018 10:54:25 +0000
Message-ID: <[email protected]> (raw)

The following documentation comment has been logged on the website:

Page: https://www.postgresql.org/docs/11/unaccent.html
Description:

Seems a bug, because the `select unaccent('Iglésias')` result in accented
"iglésias" again... It is correct because length('Iglésias') is 9 instead
8. 

The problem is not rare as you can check by pageviews of
https://stackoverflow.com/questions/24863716  
The solution is to feed database with good UTF8 (NFC characteres).  
 
**SUGGESTION**:  add a notice for reders, about the aparent bug with non-NFC
input, showing examples and clues about solutions.

REF: https://en.wikipedia.org/wiki/Unicode_equivalence#Example


reply

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Reply to all the recipients using the --to and --cc options:
  reply via email

  To: [email protected]
  Cc: [email protected], [email protected], [email protected]
  Subject: Re: Add NOTICE about non-NFC-characters and clues for solution
  In-Reply-To: <[email protected]>

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox