Re: htaccess file
- From: David Bolt <blacklist-me@xxxxxxxxxx>
- Date: Sun, 30 Dec 2007 21:39:19 +0000
On Sun, 30 Dec 2007, Barely Audible wrote:-
<big snip>
Something I missed, the first time I read the list:
RewriteCond %{HTTP_USER_AGENT} ^.*Ants.*[NC,OR]^^
There should be a space between the * and [ .
RewriteCond %{HTTP_USER_AGENT} ^.*[Ww]eb[Bb]andit.*[NC,OR]^^^^ ^^^^
Are also unnecessary. The NC tells Apache to ignore the case so you
could just use webbandit and it would match.
Another thing you'll probably find, is that including wget isn't going
to help very much when someone does go to use it to grab your site. It
is very easy to disguise wget by telling it to supply another user agent
header, and anyone using it will most likely have looked up the options
using the man page, where they would have also seen this:
--random-wait
Some web sites may perform log analysis to identify retrieval programs such
as Wget by looking for statistically significant similarities in the time
between requests. This option causes the time between requests to vary
between 0 and 2 * wait seconds, where wait was specified using the --wait
option, in order to mask Wget's presence from such analysis.
A recent article in a publication devoted to development on a popular conâ
sumer platform provided code to perform this analysis on the fly. Its
author suggested blocking at the class C address level to ensure automated
retrieval programs were blocked despite changing DHCP-supplied addresses.
The --random-wait option was inspired by this ill-advised recommendation to
block many unrelated users from a web site due to the actions of one.
Using the options:
--wait=30 --random-wait --user-agent="$something_resembling_a_browser_user_agent"
along with others required, and you probably won't even know they're
doing it.
Regards,
David Bolt
--
www.davjam.org/lifetype/ www.distributed.net: OGR@100Mnodes, RC5-72@15Mkeys
SUSE 10.1 32bit | openSUSE 10.2 32bit | openSUSE 10.3 32bit | openSUSE 11.0a0
SUSE 10.1 64bit | openSUSE 10.2 64bit |
RISC OS 3.6 | TOS 4.02 | openSUSE 10.3 PPC |RISC OS 3.11
.
- Follow-Ups:
- Re: htaccess file
- From: Barely Audible
- Re: htaccess file
- References:
- Re: htaccess file
- From: David Bolt
- Re: htaccess file
- Prev by Date: Re: Three monitors
- Next by Date: Re: Three monitors
- Previous by thread: Re: htaccess file
- Next by thread: Re: htaccess file
- Index(es):
Relevant Pages
|