public inbox for [email protected]
help / color / mirror / Atom feedHow to read an external pdf file from postgres?
4+ messages / 4 participants
[nested] [flat]
* How to read an external pdf file from postgres?
@ 2022-01-12 11:16 Amine Tengilimoglu <[email protected]>
0 siblings, 3 replies; 4+ messages in thread
From: Amine Tengilimoglu @ 2022-01-12 11:16 UTC (permalink / raw)
To: pgsql-general
Hi;
I want to read an external pdf file from postgres. pdf file will
exist on the disk. postgres only know the disk full path as metadata. Is
there any software or extension that can be used for this? Or do we have to
develop software for it? Or what is the best approach for this? I'd
appreciate it if anyone with experience could make suggestions.
Thanks.
^ permalink raw reply [nested|flat] 4+ messages in thread
* Re: How to read an external pdf file from postgres?
@ 2022-01-12 11:25 Peter Eisentraut <[email protected]>
parent: Amine Tengilimoglu <[email protected]>
2 siblings, 0 replies; 4+ messages in thread
From: Peter Eisentraut @ 2022-01-12 11:25 UTC (permalink / raw)
To: Amine Tengilimoglu <[email protected]>; pgsql-general
On 12.01.22 12:16, Amine Tengilimoglu wrote:
> I want to read an external pdf file from postgres. pdf file will
> exist on the disk. postgres only know the disk full path as metadata. Is
> there any software or extension that can be used for this? Or do we have
> to develop software for it? Or what is the best approach for this? I'd
> appreciate it if anyone with experience could make suggestions.
You could write a function in PL/Perl or PL/Python to open and read the
file and process the PDF data, using some third-party module that surely
exists somewhere.
^ permalink raw reply [nested|flat] 4+ messages in thread
* Re: How to read an external pdf file from postgres?
@ 2022-01-12 14:32 Дмитрий Иванов <[email protected]>
parent: Amine Tengilimoglu <[email protected]>
2 siblings, 0 replies; 4+ messages in thread
From: Дмитрий Иванов @ 2022-01-12 14:32 UTC (permalink / raw)
To: Amine Tengilimoglu <[email protected]>; +Cc: pgsql-general
What are you going to do with the data?
If you want to analyze it in some way, I can't think of a better option
with a Python function. Or do you just want to transfer them? There are
options here too, but in this case I like Python better.
--
Regards, Dmitry!
ср, 12 янв. 2022 г. в 16:16, Amine Tengilimoglu <[email protected]
>:
> Hi;
>
> I want to read an external pdf file from postgres. pdf file will
> exist on the disk. postgres only know the disk full path as metadata. Is
> there any software or extension that can be used for this? Or do we have to
> develop software for it? Or what is the best approach for this? I'd
> appreciate it if anyone with experience could make suggestions.
>
> Thanks.
>
^ permalink raw reply [nested|flat] 4+ messages in thread
* Re: How to read an external pdf file from postgres?
@ 2022-01-12 14:35 Ian Lawrence Barwick <[email protected]>
parent: Amine Tengilimoglu <[email protected]>
2 siblings, 0 replies; 4+ messages in thread
From: Ian Lawrence Barwick @ 2022-01-12 14:35 UTC (permalink / raw)
To: Amine Tengilimoglu <[email protected]>; +Cc: pgsql-general
2022年1月12日(水) 20:16 Amine Tengilimoglu <[email protected]>:
>
> Hi;
>
> I want to read an external pdf file from postgres. pdf file will exist on the disk. postgres only know the disk full path as metadata. Is there any software or extension that can be used for this? Or do we have to develop software for it? Or what is the best approach for this? I'd appreciate it if anyone with experience could make suggestions.
By "read" do you mean "open the file and meaningful extract data from it"? If
so, speaking from prior experience, don't. And if you really have to, make sure
the source PDF is guaranteed to be in a well-defined, predictable format
enforceable by contract law and/or people with sharp pointy sticks. I have
successfully suppressed the memories of whatever it is I once had to do with
reading data from PDFs, but though the data was eventually imported into
PostgreSQL, there was a lot of mangling probably involving a Perl module (other
languages are probably available) before it got anywhere near the database.
Reagrds
Ian Barwick
--
EnterpriseDB: https://www.enterprisedb.com
^ permalink raw reply [nested|flat] 4+ messages in thread
end of thread, other threads:[~2022-01-12 14:35 UTC | newest]
Thread overview: 4+ messages (download: mbox mbox.gz follow: Atom feed)
-- links below jump to the message on this page --
2022-01-12 11:16 How to read an external pdf file from postgres? Amine Tengilimoglu <[email protected]>
2022-01-12 11:25 ` Peter Eisentraut <[email protected]>
2022-01-12 14:32 ` Дмитрий Иванов <[email protected]>
2022-01-12 14:35 ` Ian Lawrence Barwick <[email protected]>
This inbox is served by agora; see mirroring instructions
for how to clone and mirror all data and code used for this inbox