Re: htaccess file
- From: Barely Audible <somewhere@xxxxxxxxxxxxxxx>
- Date: Sun, 30 Dec 2007 22:28:31 +0000
David Bolt wrote:
On Sun, 30 Dec 2007, Barely Audible wrote:-
<big snip>
Something I missed, the first time I read the list:
RewriteCond %{HTTP_USER_AGENT} ^.*Ants.*[NC,OR]^^
There should be a space between the * and [ .
RewriteCond %{HTTP_USER_AGENT} ^.*[Ww]eb[Bb]andit.*[NC,OR]^^^^ ^^^^
Are also unnecessary. The NC tells Apache to ignore the case so you
could just use webbandit and it would match.
Another thing you'll probably find, is that including wget isn't going
to help very much when someone does go to use it to grab your site. It
is very easy to disguise wget by telling it to supply another user agent
header, and anyone using it will most likely have looked up the options
using the man page, where they would have also seen this:
--random-wait
Some web sites may perform log analysis to identify retrieval programs such
as Wget by looking for statistically significant similarities in the time
between requests. This option causes the time between requests to vary
between 0 and 2 * wait seconds, where wait was specified using the --wait
option, in order to mask Wget's presence from such analysis.
A recent article in a publication devoted to development on a popular conâ
sumer platform provided code to perform this analysis on the fly. Its
author suggested blocking at the class C address level to ensure automated
retrieval programs were blocked despite changing DHCP-supplied addresses.
The --random-wait option was inspired by this ill-advised recommendation to
block many unrelated users from a web site due to the actions of one.
Using the options:
--wait=30 --random-wait --user-agent="$something_resembling_a_browser_user_agent"
along with others required, and you probably won't even know they're
doing it.
Thanks Dave - I have learnt a lot!
I'm still getting the 500 error though :-(
.
- References:
- Re: htaccess file
- From: David Bolt
- Re: htaccess file
- From: David Bolt
- Re: htaccess file
- Prev by Date: Re: dependency hell
- Next by Date: Re: Window title bar disappeared
- Previous by thread: Re: htaccess file
- Next by thread: NTP not working on bootup
- Index(es):
Relevant Pages
|