[Slackbuilds-users] Copying PDF-1.7 text using -14.2

Jude DaShiell jdashiel at panix.com
Mon Sep 14 17:55:48 UTC 2020


A way to assess probability a pdf file can be printed or converted is to
run a full accessibility check on the pdf first and if possible fix any
errors that come up on the report.

On Mon, 14 Sep 2020, Jude DaShiell wrote:

> Date: Mon, 14 Sep 2020 13:53:25
> From: Jude DaShiell <jdashiel at panix.com>
> Reply-To: SlackBuilds.org Users List <slackbuilds-users at slackbuilds.org>
> To: Richard Ellis via SlackBuilds-users <slackbuilds-users at slackbuilds.org>
> Subject: Re: [Slackbuilds-users] Copying PDF-1.7 text using -14.2
>
> Although adobe has accessibility options that can be enabled for pdf
> files most pdf file creators do not use them.  When creators do use
> accessibility options usually printing and converting files become
> easier except in the case all a pdf file contains is images.
>
> On Mon, 14 Sep 2020, Richard Ellis via SlackBuilds-users wrote:
>
> > Date: Mon, 14 Sep 2020 13:15:18
> > From: Richard Ellis via SlackBuilds-users <slackbuilds-users at slackbuilds.org>
> > To: slackbuilds-users at slackbuilds.org
> > Cc: Richard Ellis <rellis at dp100.com>
> > Subject: Re: [Slackbuilds-users] Copying PDF-1.7 text using -14.2
> >
> > On Mon, Sep 14, 2020 at 09:57:13AM -0700, Rich Shepard wrote:
> > >On Mon, 14 Sep 2020, Alexander Verbovetsky wrote:
> > >>Maybe there is no text inside, just picture?  PDF is a container, not a
> > >>format.
> > >
> > >Scientific journal articles are primarily text with occasional plots or other
> > >images.  And I believe that PDF stand for "Portable Document Format."
> >
> > The Adobe name does expand to those words, but the word "Document" in the name
> > bears no resemblence to how the data inside the PDF document produces a visual
> > output.
> >
> > The internal structure of a PDF is basicacally best described as "electronic
> > paper" than as a "document".  PDF, internally, is simply a way of specifying
> > how to physically position "visual things" on a virtual sheet of paper, and
> > whether you have any luck with extracting "text" later depends very much upon
> > how the creating software generated the internal PDF data.
> >
> > _______________________________________________
> > SlackBuilds-users mailing list
> > SlackBuilds-users at slackbuilds.org
> > https://lists.slackbuilds.org/mailman/listinfo/slackbuilds-users
> > Archives - https://lists.slackbuilds.org/pipermail/slackbuilds-users/
> > FAQ - https://slackbuilds.org/faq/
> >
> >
>
>

-- 



More information about the SlackBuilds-users mailing list