public inbox for [email protected]
help / color / mirror / Atom feedFrom: Florents Tselai <[email protected]>
To: Ian Lawrence Barwick <[email protected]>
Cc: Amine Tengilimoglu <[email protected]>
Cc: [email protected] >> PG-General Mailing List <[email protected]>
Subject: Re: How to read an external pdf file from postgres?
Date: Mon, 10 Feb 2025 15:01:48 +0200
Message-ID: <[email protected]> (raw)
In-Reply-To: <CAB8KJ=hBm3QwtOq9_gM+=a-2Kx2crz3yh+vxsckEEE+EdsjdwQ@mail.gmail.com>
References: <CADTdw-x=c_=OFyh3DoCkaygGAkChi36O=acZHQ2-F_DgQGJGpQ@mail.gmail.com>
<CAB8KJ=hBm3QwtOq9_gM+=a-2Kx2crz3yh+vxsckEEE+EdsjdwQ@mail.gmail.com>
> On 12 Jan 2022, at 4:35 PM, Ian Lawrence Barwick <[email protected]> wrote:
>
> 2022年1月12日(水) 20:16 Amine Tengilimoglu <[email protected]>:
>>
>> Hi;
>>
>> I want to read an external pdf file from postgres. pdf file will exist on the disk. postgres only know the disk full path as metadata. Is there any software or extension that can be used for this? Or do we have to develop software for it? Or what is the best approach for this? I'd appreciate it if anyone with experience could make suggestions.
>
> By "read" do you mean "open the file and meaningful extract data from it"? If
> so, speaking from prior experience, don't. And if you really have to, make sure
> the source PDF is guaranteed to be in a well-defined, predictable format
> enforceable by contract law and/or people with sharp pointy sticks. I have
> successfully suppressed the memories of whatever it is I once had to do with
> reading data from PDFs, but though the data was eventually imported into
> PostgreSQL, there was a lot of mangling probably involving a Perl module (other
> languages are probably available) before it got anywhere near the database.
>
>
> Reagrds
>
> Ian Barwick
>
> --
> EnterpriseDB: https://www.enterprisedb.com
>
>
https://github.com/Florents-Tselai/pgpdf
reply
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Reply to all the recipients using the --to and --cc options:
reply via email
To: [email protected]
Cc: [email protected], [email protected], [email protected]
Subject: Re: How to read an external pdf file from postgres?
In-Reply-To: <[email protected]>
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox