[consulting] From Highly Formatted Print to Web -- Best Solution?

SHEN Liang shenzhuxi at gmail.com
Sat Apr 9 13:31:57 UTC 2011


check my module
http://drupal.org/project/fileviewer

On Fri, Apr 8, 2011 at 11:19 PM,  <Daniel at snakehill.net> wrote:
> Here's a few ideas from Cheryl, our designer.
>
> 1. There are online services that take the PDF (which can be created from
> the Quark doc). Most appear to be paid but there is one called YUDU that
> has a free option but they also have a paid option ($180 a year) with more
> features.
>
> 2. We can create an online page-turning PDF from InDesign. We would have
> to convert the Quark file and have the newest version of InDesign (CS5).
> Even the newest version of Quark does not have this ability.
>
> So with either option there would be a link on the Bulletin page going to
> the PDF. This is the simplest, most cost effective way to do this.
>
> Dan
>
>
>
> From:        consulting-request at drupal.org
> To:        consulting at drupal.org
> Date:        04/08/2011 08:00 AM
> Subject:        consulting Digest, Vol 63, Issue 4
> Sent by:        consulting-bounces at drupal.org
> ________________________________
>
>
> Send consulting mailing list submissions to
>                 consulting at drupal.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
>                 http://lists.drupal.org/mailman/listinfo/consulting
> or, via email, send a message with subject or body 'help' to
>                 consulting-request at drupal.org
>
> You can reach the person managing the list at
>                 consulting-owner at drupal.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of consulting digest..."
>
>
> Today's Topics:
>
>   1. From Highly Formatted Print to Web -- Best Solution?
>      (Shai Gluskin)
>   2. Re: From Highly Formatted Print to Web -- Best
> Solution?
>      (Fred Jones)
>   3. Re: From Highly Formatted Print to Web -- Best
> Solution? (Ted)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Thu, 7 Apr 2011 13:06:38 -0400
> From: Shai Gluskin <shai at content2zero.com>
> Subject: [consulting] From Highly Formatted Print to Web -- Best
>                 Solution?
> To: "A list for Drupal consultants and Drupal service/hosting
>                 providers"                 <consulting at drupal.org>
> Message-ID: <BANLkTikLBOa_Nx3fPzLG-_yFc8YbYiCKhw at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi Folks,
>
> I've got a client who is going to be serially publishing a book to its web
> site, one page/day. Each day will be its own node.
>
> The print version is *highly *formatted. A real pain. The text is either in
> pdf or Word, but the fussing I've done so far when pasting into TinyMce
> doesn't produce good results.
>
> Here is my idea that I want feedback on: They should take a screen shot of
> the whole page (the pages are small, like 5in. x 7in.) for each day and
> upload the image. In order to get search-engine traffic for those pages, I
> thought they should just copy and paste the text from the pdf into a text
> field, losing all the formatting. Using css, I'll make sure the text field
> displays off-screen. This way they get the cleanest/easiest data entry and
> best-looking presentation while still allowing search engines to drive
> traffic to the page based on the text contents of a field that will be
> offscreen.
>
> Does that make sense?
>
> Any other ideas?
>
> Shai
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL:
> http://lists.drupal.org/pipermail/consulting/attachments/20110407/7c3248af/attachment-0001.html
>
> ------------------------------
>
> Message: 2
> Date: Thu, 7 Apr 2011 20:14:19 +0300
> From: Fred Jones <fredthejonester at gmail.com>
> Subject: Re: [consulting] From Highly Formatted Print to Web -- Best
>                 Solution?
> To: "A list for Drupal consultants and Drupal service/hosting
>                 providers"                 <consulting at drupal.org>
> Message-ID: <BANLkTim+rTeqb6ZOU7irGc9oYa7DfSsz9Q at mail.gmail.com>
> Content-Type: text/plain; charset=ISO-8859-1
>
>> Does that make sense?
>
> Google, at least, indexes PDFs.
>
> Aside from that, it sounds reasonable to me.
>
>
> ------------------------------
>
> Message: 3
> Date: Thu, 07 Apr 2011 14:01:29 -0400
> From: Ted <ted-drupalists at webfirst.com>
> Subject: Re: [consulting] From Highly Formatted Print to Web -- Best
>                 Solution?
> To: A list for Drupal consultants and Drupal service/hosting providers
>                 <consulting at drupal.org>
> Message-ID: <4D9DFBF9.6050904 at webfirst.com>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> On 4/7/2011 1:06 PM, Shai Gluskin wrote:
>> Here is my idea that I want feedback on: They should take a screen
>> shot of the whole page (the pages are small, like 5in. x 7in.) for
>> each day and upload the image. In order to get search-engine traffic
>> for those pages, I thought they should just copy and paste the text
>> from the pdf into a text field, losing all the formatting. Using css,
>> I'll make sure the text field displays off-screen. This way they get
>> the cleanest/easiest data entry and best-looking presentation while
>> still allowing search engines to drive traffic to the page based on
>> the text contents of a field that will be offscreen.
>>
>> Does that make sense?
>>
>> Any other ideas?
>>
>
> We've used a pdf-based toolchain to emulate the google docs pdf
> quickview function, including "highlighting" and copying text.
>
> http://pdftoxml.sourceforge.net/ - Gives you the coordinates and
> dimensions of each piece of text
> pdftoppm - Exports each page to a PNG image (with appropriate arguments)
> convert - ImageMagick tools to convert PNGs to indexed format, make
> thumbnails, etc.
>
> You'll most likely want to use some XSLT to cut down the size of the XML
> file from pdftoxml. These tools can be used to automate your idea above
> (with or without support for highlighting/copying the book text).
>
> Ted
>
>
>
> ------------------------------
>
> _______________________________________________
> consulting mailing list
> consulting at drupal.org
> http://lists.drupal.org/mailman/listinfo/consulting
>
>
> End of consulting Digest, Vol 63, Issue 4
> *****************************************
>
>
>
>
>
> _______________________________________________
> consulting mailing list
> consulting at drupal.org
> http://lists.drupal.org/mailman/listinfo/consulting
>
>


More information about the consulting mailing list