Re: [opensuse] PDF



Neil wrote:
Unless you do some very strange things 5 pages isn't going to load
unacceptably slow in PDF.

Agree, I'm not concerned with load speed here. The author does like
to use hi-res graphics, but that's not of issue.

But I also place the source pdfs on the org's web site and keep
the older versions there for archival purposes. This works
well since the source is in color and there are frequently color
photos that we can't afford to distribute in paper form. But
it's nice to be able to index the archived newsletters for historical
reference (it's a museum), but pdf doesn't work well for indexing
text.

What do you mean exactly?

If you'd like to search te PDF (or a set of PDF's) for the occurence
of words (like Google does) Beagle might be able to help. I cannot do
more there than pointing.

Yes, like google does. I want the pdf's available so that search
engine spiders will index the text content.

If you mean the reader would be able to search within the file and
jump to the correct section with a usefull index, then that is already
available in PDF. For example, if you use the header types correctly
in Oo then the generated PDF has an index by default. It uses the
headers to determine what are chapters and paragraphs and inserts
those into the PDF.

That's useful to know, but it's not what I'm talking about. I also
don't have the source documents for these pdfs.

Regards,
Lew
--
To unsubscribe, e-mail: opensuse+unsubscribe@xxxxxxxxxxxx
For additional commands, e-mail: opensuse+help@xxxxxxxxxxxx



Relevant Pages

  • Re: Protecting e-mail addresses from spam
    ... From what I saw from google it appears that keywords such as ... newsgroup...ask there if a pdf file can be scanned for emails. ... wondering if i should be doing more to protect against spam etc. ... Obfusticated Email Link Creator ...
    (microsoft.public.publisher.webdesign)
  • PDFs and Google
    ... I am reposting this series of messages here; my original attempt on google.public.support.general wasn't met with much success; nor were my email exchanges with Google. ... Subject: Generating PDF titles ... it worse, the letters are doubled. ... the metadata was just noise created by the ...
    (comp.text.pdf)
  • Re: Password Protection Questions
    ... Google does index PDF files, but any content that is accessible via a login, then no search engine ... get your host to install the Adobe Acrobat component, which will allow Index Server to index PDFs. ... FrontPage Resources, WebCircle, MS KB Quick Links, etc. ...
    (microsoft.public.frontpage.client)
  • Re: Yahoo to put adverts in PDF files
    ... Yahoo has reached a deal to start running advertisements in Adobe's ... popular PDF document-reading format. ... This NG has become unreadable with these levels of Google ... Groups SPAM. ...
    (alt.internet.search-engines)
  • Re: Bug in acroread?
    ... Juergen Fiedler wrote: ... If a doc is in PDF format, Google will give you the ... Why not just use a software pdf to text / html / whatever converter, ...
    (Debian-User)