Re: OT : Approximate / fast math libraries ?



Btw, I usually use -O3 rather than -O2. I agree with the other poster: I am not sure getting rid of precision is a great idea.

Trotter

----- Original Message ----
From: Chris Jones <jonesc@xxxxxxxxxxxxxxxxx>
To: For users of Fedora <fedora-list@xxxxxxxxxx>
Sent: Friday, August 31, 2007 2:41:56 PM
Subject: Re: OT : Approximate / fast math libraries ?

Hi,

Thanks for your feedback.

What exactly is your need? Contact me off-list and maybe I can
help. Have you profiled your code? I have found that people
often do not actually know where their code is spending its
time. I once sped up an app which was universally acknowledged
to be slow "because it uses floating point." I sped it up 3x.

Yes, I have profiled the code, quite extensively, using the valgrind/calltree
application. From this I know this that I'm know I've tidying up this to the
point where its hard to find big improvements, the cpu time is fairly well
spread around, not isolated in a few places. So am now looking a a few places
where math calls are taking more time than I would hope. I'm not going to get
factors in speed in the overall application, but I hope in a few places
things can be improvemed a lot locally.

Also, the project is not small, massive in fact, and I'm only writting one
small part. If you are interested you can find it here

http://lhcb-release-area.web.cern.ch/LHCb-release-area/DOC/brunel/releases/latest/doxygen/index.html

It also has to be supported on a *lot* of hardware. Basically gcc 3.2.3 based
Scientific Linux 3 machines, gcc 3.4.6 SL4 machines (32 and 64 bit) and (not
my decision), windows VC 7.1. I cannot rely on for instance SSE math calls
etc.

Taking an example from another thread, one place I'm trying to understand is
where I use atan2 see

http://www.hep.phy.cam.ac.uk/~jonesc/atan2.png

for the profiler output. atan2 is taking 50% of the time of this method. Not
here I don't need that much precision on the result - say +- O(2*pi/100).
Anything you can suggest here - The code is here

http://www.hep.phy.cam.ac.uk/~jonesc/RichPhotonRecoUsingCKEstiFromRadius.cpp

( note though its full of internal classes etc...)

cheers Chris




I modified the parsing routines it used, not the floating point.

Mike
--
p="p=%c%s%c;main(){printf(p,34,p,34);}";main(){printf(p,34,p,34);}
Oppose globalization and One World Governments like the UN.
This message made from 100% recycled bits.
You have found the bank of Larn.
I can explain it for you, but I can't understand it for you.
I speak only for myself, and I am unanimous in that!


--
fedora-list mailing list
fedora-list@xxxxxxxxxx
To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list






____________________________________________________________________________________
Pinpoint customers who are looking for what you sell.
http://searchmarketing.yahoo.com/

--
fedora-list mailing list
fedora-list@xxxxxxxxxx
To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list



Relevant Pages