What a great resource for ideas and information.
I have been plying with .htaccess to try to block email spiders.
The info posted here has been of great help, but it only blocks those that advertise their presence.
I have downloaded a couple of email spiders, (first that came up on a search) and both of these still work through my site quite happily. They show up in the logs as Exlorer 6. I guess there is no way around this other than to have no emails on the web site?
Best regards,
Mark Empson
<snip>
[edited by: jdMorgan at 4:56 pm (utc) on Nov. 30, 2003]
[edit reason] No sigs or URLs, please [/edit]
Welcome to WebmasterWorld [webmasterworld.com]!
Take a look at this thread [webmasterworld.com] for an additional technique to stop site exploits. User-agent screening has the advantage of efficiency, in that you catch many intruders with one test. However, the ones you describe must be blocked by IP address, and further, a few must be blocked by forwarded IP address if they come through proxies.
Jim
Picked this up today:
|
The first thing that came to mind was, what's this preson doing requesting phpinfo.php? The next item that comes to mind is block access. And then, in doing that, that would mean including myself if ever I want to check my own details. This is for a hosted web site.
The question here is what to do with something like this?
Your post is on-topic as a potential addition to the list of troublesome user-agents. Any further discussion of *how* to deal with this specific problem probably does need its own thread, though.
Jim
Thanks for the follow up and the suggestion.
We'll chug on in what seems to be an ever-changing landscape.
I have a question concerning a RedirectMatch issue in my .htaccess file.
I have a hidden link to a non-existant file, which we will call example.html, embedded in a section containing all of the site links. Here is what the link resembles:
<a ngentmemekent="example.html" onclick="return false"> </a>. The onclick false action is a safety net for visual readers so they don't accidently trigger the banning redirect. Because there is no text for the link it is invisible on the displayed web page. The link is redirected by .htaccess to my banning script, which we will call ban.pl, for this example. The url cleansed code appears below:
RedirectMatch example\.html /cgi-bin/ban.pl
"GET /example.html HTTP/1.0" 302 219 "-" "Mozilla/4.0 (compatible; MSIE 5.0; Windows NT)"
I'd appreciate any help in getting this right.
TIA, Wiz