[Slackbuilds-users] Copying PDF-1.7 text using -14.2

Jude DaShiell jdashiel at panix.com
Mon Sep 14 17:53:25 UTC 2020


Although adobe has accessibility options that can be enabled for pdf
files most pdf file creators do not use them.  When creators do use
accessibility options usually printing and converting files become
easier except in the case all a pdf file contains is images.

On Mon, 14 Sep 2020, Richard Ellis via SlackBuilds-users wrote:

> Date: Mon, 14 Sep 2020 13:15:18
> From: Richard Ellis via SlackBuilds-users <slackbuilds-users at slackbuilds.org>
> To: slackbuilds-users at slackbuilds.org
> Cc: Richard Ellis <rellis at dp100.com>
> Subject: Re: [Slackbuilds-users] Copying PDF-1.7 text using -14.2
>
> On Mon, Sep 14, 2020 at 09:57:13AM -0700, Rich Shepard wrote:
> >On Mon, 14 Sep 2020, Alexander Verbovetsky wrote:
> >>Maybe there is no text inside, just picture?  PDF is a container, not a
> >>format.
> >
> >Scientific journal articles are primarily text with occasional plots or other
> >images.  And I believe that PDF stand for "Portable Document Format."
>
> The Adobe name does expand to those words, but the word "Document" in the name
> bears no resemblence to how the data inside the PDF document produces a visual
> output.
>
> The internal structure of a PDF is basicacally best described as "electronic
> paper" than as a "document".  PDF, internally, is simply a way of specifying
> how to physically position "visual things" on a virtual sheet of paper, and
> whether you have any luck with extracting "text" later depends very much upon
> how the creating software generated the internal PDF data.
>
> _______________________________________________
> SlackBuilds-users mailing list
> SlackBuilds-users at slackbuilds.org
> https://lists.slackbuilds.org/mailman/listinfo/slackbuilds-users
> Archives - https://lists.slackbuilds.org/pipermail/slackbuilds-users/
> FAQ - https://slackbuilds.org/faq/
>
>

-- 



More information about the SlackBuilds-users mailing list