Automatic web page screenshots via script

From: smoker (amd_at_headru.sh)
Date: 12/30/03


Date: Tue, 30 Dec 2003 14:03:49 +0000

Hi,
I have been trying to find a way to grab screenshots of webpages via
scripting.

So far, I have htmldoc grabbing and converting the pages to pdf, then I am
using ImageMagick (& Ghostscript) to convert the pdf files to jpg images.

This works fairly well as far as grabbing and creating the images goes,
but :

1) Very cpu intensive (99% user cpu in top for 20 - 30 seconds)
2) htmldoc can't / won't wait for asp pages, java, etc to load on the page
before grabbing it.
3) Poor support for css, frames, etc.

I have tried making htmldoc produce postscript output, but with no
noticable difference in quality of output.

Somebody suggested using mozilla achieve my target, but as I am running
this process on a web server via perl scripting, I don't see how an X
program is going to help.

Anybody have any alternatives/suggestions ?

TIA

alan



Relevant Pages

  • Re: HTML zu PDF, war: MS Office 2008 nun mehrsprachig?
    ... nicht gleich schlappmachenden Browser ein PDF gewinnt. ... via text/html nach application/pdf wandelt (AsciiDoc und HTMLDOC sei ... die schicksten PDF der Welt generieren. ...
    (de.comp.sys.mac.misc)
  • Re: hyperlink in .pdf
    ... HTMLDOC is probably a very wise ... I did not suggest the HTML way because I just don't know if there are ... features in Photoshop that allow special things in the PDF. ... my additional suggestion would be OpenOffice. ...
    (comp.text.pdf)
  • Re: HTMLDOC - cyrillic font
    ... I am using HTMLDOC version 1.8.23, and I want to make a PDF with some Cyrillic text inside. ... I want to use the option "--embedfonts" so that everyone will view correctly the PDF file, but I don't have any Cyrillic font into the program's folders! ... If I find the appropriate .pfa files and put them into the correct folder, ... DejaVu fonts including all of the necessary Cyrillic characters... ...
    (comp.text.pdf)
  • Re: HTML to PDF converter
    ... > using HTMLDOC but it does not handle tables well and the resulting PDF ... > file formatting does not match the HTML page. ...
    (comp.os.linux.development.apps)
  • Re: More Mac questions
    ... Elden Fenison wrote: ... looking at from the server (as opposed to grabbing the entire PDF, ... least if the PDF is created properly), and it doesn't leave the PDF ... sitting on one's desktop when it's done. ...
    (comp.sys.mac.system)