Posted by Augusto Ellacuriaga on July 3, 2008 | 9 comments
In a prior post I explained the main reasons why to block undesirable bots, scrapers and crawlers in order to prevent content stealing, malware injection, and vulnerability scanning in a website. If you haven’t read that post I encourage you to do so.
One of the solutions provided was a change to the .htaccess file to block crawlers based on User-Agent or IP Address. As a result, folks were asking me how to identify which User-Agent (UA) or IP Address to block?...continue
Posted by Augusto Ellacuriaga on June 25, 2008 | 10 comments
Spam has become a major problem for many of us that deal with content development on a regular basis. It takes different forms, affects performance and compromises the security of a site.
There are bots that specialize in spamming by inserting links and unsolicited information in input fields and forms of websites. The comment section of a blog, subscription or price match forms in eCommerce sites are very susceptible to these types of attacks....continue