How To Scrape Google Search Results Data In Python Easily Google search engine results pages (SERPs) can provide alot of important data for you and your business but you most likely wouldn't want to scrape it manually. After all, there might be multiple queries you're interested in, and the corresponding results should be monitored
Article Beating Google ReCaptcha and the funCaptcha using AWS Rekognition Project Voight-KampffOriginally found HERE. Beating Google's reCaptcha using AWS Rekognition. Part of project Touch-Captcha (두 터치). I did this because I cannot promote a better Captcha without first beating the industry standard. Nothing special here. Credit goes to the ML researchers who developed the
Good IP vs Bad IP? In the past I've mentioned "Good IPs" and "Bad IPs". So what makes an IP Bad? Well, it comes down to what are other people doing on that IP? If you're using a cheap/free/crappy VPN or Proxy chances are you're sharing that
Let's Talk Behavior Analysis These days people are using more behavior analysis and other buzzwords to analyze how people interact with their site. While probably not the first, HotJar is a common and well known tool for tracking the behaviors of users. Common tools include heatmaps that track
Article Sanitize Your Data Was reading this article today about Genius filing a lawsuit against Google for violating Genius's Terms of Use. The article served as a reminder why you should sanitize the data you scrape. This is a simple thing that Genius did to determine who was
But It's Dumb Some bots are dumb bots and that's okay! I no longer play Rainbow 6 Siege, but when I did I wanted the cool skins, guns, and all the operators. As you played you got an in game currency called 'Renown'. Well, you had to
Article Instacart shoppers besieged by bots that snatch lucrative orders Instacart shoppers besieged by bots that snatch lucrative ordersWhile bots aren’t a new problem for Instacart, the recent deluge is different because it comes at a time of white-knuckled expansion for the startup.Kartikay MehrotraThe Seattle TimesLisa Marsh’s job shopping and delivering
Are You a Bot? Let's talk Captchas for a second. They suck even when you're human. Use a VPN? Captcha. Privacy plugins? Captcha. Botting, scrapping, etc? Oh yeah. So when are you getting a captcha? Well I recently came across this handy dandy website which gives you a
Privacy? Not Here. For a while I've been using a browser extension that returns a fake canvas value to websites that request it. This ensures fewer sites are able to track me, especially ones that use something like FingerprintJS, which is able to track the same browser