Cleantalk Spam Protection

I've just added CleanTalk spam protection to a CSC v2 website. Seems to work well so far, so I thought it would be good to have a discussion topic for sharing user experience .
Its a paid service, but really cheap considering the time it saves.

Apart from the automated spam filtering, one of the fantastic advantages is that other verification methods can be turned off. With many internet users now using VPN's the Google reCapture more often requires full "click all the pictures with cars" validation for known VPN IP addresses, which is just another buyer objection to avoid if possible.

Anyone had experiences, any disadvantages of using this service?

Looks good. recapture is getting pretty annoying.

How do you install it for cscart? Is it an addon?

Update, !!!
CleanTalk has stuffed our website, so I dont recommend installing just yet.

Still trying to figure out the issue.
If you do try it, check website operation thoughourly before purchasing CleanTalk!

Ive now disabled CleanTalk until a fix is found.

To disable the plugin please delete strings after //Cleantalk on the start and on the end of every index.php file on your site.

Disappointing that it broke our website, given what it promises to achieve.
Its not an addon, but needs to be uploaded to the website root folder , then run the install script. There are plug-ins for other carts so it would be a good project for an addon dev I think.
I have a support ticket open with CleanTalk.. so will report back on the outcome.

The best way to mitigate spam is to just accept that its there and design your application in such a way that the caused load doesnt matter.

What we found to be the biggest burden is price monitoring companies. They crawl your website every 5 minutes or so to see if your price changes. Even though it is illegal, they maintain to do their unnoticed practices. The best way to check for this is to look for recurring IP addresses at certain intervals.

But take my word for it, spam is by far not your biggest concern.

@poppedweb
I very much disagree with your first comment as I feel you've overlooked the benefit of being able to disable all captcha.

Regarding the burden of price monitoring crawlers:
I had not considered this.. which raises the question as to whether there is an CSC addon to detect and block these?

On a VPS, I wonder if fail2ban or similar could be configured to detect these, but for csc on a shared server, any code would need to be under the CSC folder. Any suggestions?

But take my word for it, spam is by far not your biggest concern.

It is my biggest annoyance. If captcha is not turned on for contact us forms, a lot of bot spam comes through

The best way to mitigate spam is to just accept that its there and design your application in such a way that the caused load doesnt matter.

It's really quite annoying if you have to delete 30 spam reviews per day (we have about 6, with google recaptcha turned on - if I turn it off this jumps to 100s).

It's really quite annoying if you have to delete 30 spam reviews per day (we have about 6, with google recaptcha turned on - if I turn it off this jumps to 100s).

There is a multitude of strategies you can deploy to mitigate such attacks, the simplest one being a simple e-mail check. Another one is implementing it so that only registered users can write a review and only about the products they bought. Unregistered users could receive an email with a token that corresponds to their order to also write a review.

Then there is the classical IP range detector which groups reviews so that if spam is detected, you can easily delete it.

There are many smart ways to make handling with spam much easier. Most of the time these require some sort of investment, but you should never hinder your customer.

@poppedweb
I very much disagree with your first comment as I feel you've overlooked the benefit of being able to disable all captcha.

Regarding the burden of price monitoring crawlers:
I had not considered this.. which raises the question as to whether there is an CSC addon to detect and block these?

On a VPS, I wonder if fail2ban or similar could be configured to detect these, but for csc on a shared server, any code would need to be under the CSC folder. Any suggestions?

Such addons should not be required. The simplest thing I found to be working was disabling common tools used for downloading webpages to parse them. These include: python, wget and curl.

Besides, such practices are illegal. If you update your terms of service and do find a company crawling your website, you can easily sue them and get a reimbursement for all damages done.

@poppedweb

Well .. Neither python nor wget are enabled on this particular server and I'm sure that curl is needed.
Extract from CSC documentation:

Server Configuration Requirements:
"cURL support should be enabled. You need this PHP extension to ensure support of secure connections, some payment systems such as PayPal and Authorize.Net, and real-time shipping rate calculators for FedEx and DHL/Airborne."

Whilst alternative suggestions to CleanTalk are welcome, IMHO suggesting that the way to deal with companies crawling your website by law suet is off-topic, not realistic, at best left for another thread.

The aim of CleanTalk is to prevent spam in first place, so again, suggesting we should just delete spam as it comes in is also unhelpful. This thead is about "CleanTalk Spam Protection" not "Spam Reviewing and Deletion"...

@poppedweb

Well .. Neither python nor wget are enabled on this particular server and I'm sure that curl is needed.
Extract from CSC documentation:

Also, whilst alternative suggestions to CleanTalk are welcome, IMHO suggesting that the way to deal with companies crawling your website by law suet is off-topic, not realistic, at best left for another thread.

You completely misunderstood me. You can disable these tools by banning user agents (as they all have a specific user agent). This way other people can not use such tools to extract info from your website.

Thanks for the clarification, I'll need to research banning user agents. The purpose of using something like Cleartalk is to have things done automatically, blocking spam based on known IPaddress. I don't think it deals with User agents. It looks like the .htacces file needs editing to ban user agents, but I'm guessing that illegal site crawlers would change frequently?
Who/what determines the user agent name a crawler will use, I mean, for example, what prevents an illegal site crawler from pretending to be a Mozilla browser ?

I guess that's why you suggest detection based on behavior ie look for recurring IP addresses at certain intervals.

Thus some sort of Automated process is needed.

Perhaps a good place to start: https://github.com/mitchellkrogza/apache-ultimate-bad-bot-blocker/blob/master/_htaccess_versions/htaccess-mod_setenvif.txt

Thanks for the clarification, I'll need to research banning user agents. The purpose of using something like Cleartalk is to have things done automatically, blocking spam based on known IPaddress. I don't think it deals with User agents. It looks like the .htacces file needs editing to ban user agents, but I'm guessing that illegal site crawlers would change frequently?
Who/what determines the user agent name a crawler will use, I mean, for example, what prevents an illegal site crawler from pretending to be a Mozilla browser ?

I guess that's why you suggest detection based on behavior ie look for recurring IP addresses at certain intervals.

Thus some sort of Automated process is needed.

"I guess that's why you suggest detection based on behavior", we mainly use a machine model from SageMaker for this. It compares average traffic with bot traffic and flags it accordingly. It will then apply a hard rate limit but will still allow scraping in the rare event that we flag a customer. We simply take the hit as we automatically scale our application anyways (using kubernetes).

Regarding the user agents, the key behind the fact why people can get banned using the default user agent, is that any request made through Python, gets a python user agent by default. Same applies for curl and most of the other open source alternatives. This should already warn companies that they should not index your domain as they will get an error (which will certainly puzzle their developers).

The next mitigation would be to check your logs every now and then for recurring IP addresses at set intervals. If these do have a separate user agent, you can try sending them a message (using reverse IP lookup you get their domains most of the times).

If all does not work you can just go nuclear and ban their IP ranges and even send them legal warnings that they should not index your domain as it is against your terms of service. But do make sure that you have some kind of 'Fair Use' policy, as you are otherwise left to dust: https://resources.distilnetworks.com/all-blog-posts/is-web-scraping-illegal-depends-on-what-the-meaning-of-the-word-is-is

Cleantalk is a very good service, but its a shame that it doesnt work for CS-Cart websites. The Cleantalk Universal php plugin alters index.php and then the site no longer works.

It needs a plugin like there is for WooCommerce, Magento, OpenCart, etc.

Hi guys

I have good news :grin:

Cleantalk support CS-Cart now in the new universal Plugin php-uni-2.5.3.zip

https://cleantalk.org/help/install-uni

Pls contact the Support of Cleantalk and ask for the download link. On the website they have currently php-uni-2.5.2.zip available but you need php-uni-2.5.3.zip

After the installation, you have to create a new block type 'HTML block' and add it to all pages.

https://docs.cs-cart.com/latest/user_guide/look_and_feel/changing_attributes/add_custom_js_code.html

2) Code for your blocks



I use CleanTalk php-uni-2.5.3.zip with Multi-Vendor 4.13.3 and it works fine on my site.

Let me know if you have any more questions.

Good news, everyone!

Thank you for all the feedback, guys! So, we are here to announce to you our updated universal Anti-Spam plugin Uni 2.5.4 that is now compatible with CS-Cart. You can download it here and install, following our guide.

Feel free to use the plugin and if you have any questions, post it here or email us at welcome@cleantalk.org – we'll be happy to help.

1 Like