TABLE OF CONTENTS


What is PerimeterX?

Perimeterx Anti-Bot system is a protection system some websites use for blocking web scraping. One example at the moment  is https://www.ssense.com/.


Our View on PerimeterX

Identifying PerimeterX 

With Wappalyzer Chrome extension

Under Security


...Or in the code

Usually looking for "Perimeterx" string in "inspect page" will reveal its presence if this string is found somewhere in the code or in javascript files.


Reccomended approach to PerimeterX


During the execution of the scraper it happens, after some pages, that a challenge like the one in the picture is trigged, blocking the execution. 

It's needed a fully browser to not trigger the captcha, adding some random movement of the mouse and timers before moving to another page.



1. BEST CHOICE:


Playwright + proxy rotation


2. SECOND BEST:

Selenium

 


Reference and documentation

Perimeterx:https://www.perimeterx.com/products/bot-defender

Playwright:https://playwright.dev/python/

How Perimeterx works: https://www.trickster.dev/post/how-does-perimeterx-bot-defender-work/