Received: from malur.postgresql.org ([217.196.149.56]) by arkaria.postgresql.org with esmtp (Exim 4.80) (envelope-from ) id 1XbXv7-0002RO-JQ for pgsql-www@arkaria.postgresql.org; Tue, 07 Oct 2014 16:47:30 +0000 Received: from localhost ([127.0.0.1] helo=postgresql.org) by malur.postgresql.org with smtp (Exim 4.80) (envelope-from ) id 1XbXv6-0006Yk-Ff for pgsql-www@arkaria.postgresql.org; Tue, 07 Oct 2014 16:47:28 +0000 Received: from makus.postgresql.org ([2001:4800:1501:1::229]) by malur.postgresql.org with esmtps (TLS1.2:DHE_RSA_AES_256_CBC_SHA256:256) (Exim 4.80) (envelope-from ) id 1XbXv5-0006Ye-PG for pgsql-www@postgresql.org; Tue, 07 Oct 2014 16:47:28 +0000 Received: from mail-ig0-x234.google.com ([2607:f8b0:4001:c05::234]) by makus.postgresql.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:256) (Exim 4.80) (envelope-from ) id 1XbXv2-0006Y5-5l for pgsql-www@postgresql.org; Tue, 07 Oct 2014 16:47:26 +0000 Received: by mail-ig0-f180.google.com with SMTP id uq10so5067239igb.13 for ; Tue, 07 Oct 2014 09:47:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=juffo.org; s=google; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-type; bh=0glnIUbZ3q/xfYYdrnyEjsSBv0PNG95HpTHeycbYQWA=; b=J94Xj+Wl8s/69I+keB2MgAJL+GUv9U52BUNPqXUbhW6w8snbv0TdoDJeHmiXk7mPh6 cYt+n6I6SEIEdVDH+g9FiEVAvLTBVrQJtWyUSOjviJVoPjcwu6NS5Qo4T4jqecHtVMNc HBCCaFxHQUzxA1YhoYFijkBiIm6ugrxnJWGPA= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc:content-type; bh=0glnIUbZ3q/xfYYdrnyEjsSBv0PNG95HpTHeycbYQWA=; b=lSddgRd5mbWxhE/CPYA6gpuy3b9+OMNGcyYa+Nih/HomTRFl34y3ji+oO8wu3JM1FC Bj7UbVP/2xtQAqzdMup9y0jl7oPWAmXfv8I1Rh7Vkdqj11V9+FIKzLjgCRhKemAwwtWa TJ1kNyMgFw71piEVhux3VlKNc0mb+yrPLypZJr5MF301JfJM0W6ZdnhavnXVFjOCqDpw 9GBN6j+NAvcCh+E/t2Lz90BcHvP2S3D3bKJUJZEkHVdW+ANt/LVo00nH5by+vdjrc8Rx f/NSGPEyawOyjvY7pzNKpXBNW9gdBQcPcHdkiog0TF9BDeG35AqOmzkSd+6utSJcoW1c jchQ== X-Gm-Message-State: ALoCoQntGcSMOt5Mpr4s+7VRUD4Gu0/ubqvioKb0d+g3i3aqwhde1O8q2TOMrEReM5uvNZi49yTC X-Received: by 10.50.66.170 with SMTP id g10mr7436823igt.10.1412700442887; Tue, 07 Oct 2014 09:47:22 -0700 (PDT) MIME-Version: 1.0 Received: by 10.107.14.75 with HTTP; Tue, 7 Oct 2014 09:46:52 -0700 (PDT) In-Reply-To: <20140827150012.GM14956@momjian.us> References: <04E4F5A6-6526-4DDC-A9E5-2991E3B2ED83@cantoute.com> <20140827150012.GM14956@momjian.us> From: Marti Raudsepp Date: Tue, 7 Oct 2014 19:46:52 +0300 Message-ID: Subject: Re: [DOCS] suggestion about SEO on www.postgresql.org/docs To: Bruce Momjian Cc: Antony , PostgreSQL www , Magnus Hagander Content-Type: text/plain; charset=UTF-8 X-Pg-Spam-Score: -2.0 (--) List-Archive: List-Help: List-ID: List-Owner: List-Post: List-Subscribe: List-Unsubscribe: X-Mailing-List: pgsql-www Precedence: bulk Sender: pgsql-www-owner@postgresql.org On Wed, Aug 27, 2014 at 6:00 PM, Bruce Momjian wrote: > Are we using the rel="canonical" suggestion in our web docs now? Apparently not. I looked into this and I'm not 100% certain we should do it. But if we decide so, I'm willing to code up a patch. https://tools.ietf.org/html/rfc6596 states: ==== 8< ==== The target (canonical) IRI MUST identify content that is either duplicative or a superset of the content at the context (referring) IRI. Authors who declare the canonical link relation ought to anticipate that applications such as search engines can: o Index content only from the target IRI (i.e., content from the context IRIs will be likely disregarded as duplicative). o Consolidate IRI properties, such as link popularity, to the target IRI. o Display the target IRI as the representative IRI. ==== 8< ==== We certainly want property 2, but property 1 suggests that older versions of docs are dropped from search engines altogether. It's not clear whether they are that strict in reality -- does anyone know? This would not be a problem if we also retained notes about earlier supported versions in the current version, which would make our latest version a "superset" of earlier ones. But I believe we very rarely remove material from docs, so I believe the upsides outweigh the cons. ---- Another question is whether we should make "interactive" point to "static" -- again, actually the interactive one is the superset, since static doesn't include user comments. But do we care about search engines indexing comments anyway? They're not present in sitemap.xml either and I've never landed on the interactive version when coming from Google. My proposal: 1. Doc pages that are *older* than current, and exist in the current version have canonical URL /docs/current/static/pagename.html 2. If it doesn't exist in current, we link to the last version that includes this page, like /docs/8.4/static/install-win32.html 3. Newer versions (devel/beta) should perhaps point to itself and not /current/? This would make new features googleable for testers. The doc links use rel=nofollow when linking to them, so they're already ranked lower by search engines. It appears there are already lots of places that hardcode the http://www.postgresql.org/ URL, so it makes sense to use absolute URLs for canonical too? Did I miss anything? Regards, Marti -- Sent via pgsql-www mailing list (pgsql-www@postgresql.org) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-www