December 15, 2019

255 words 2 mins read

s0md3v/goop

Google Search Scraper


repo name	s0md3v/goop
repo link	https://github.com/s0md3v/goop
homepage
language	Python
size (curr.)	30 kB
stars (curr.)	461
created	2019-08-02
license	GNU General Public License v3.0

Note: It no longer works. Google team told me it’s not a legitimate issue when I reported it to them but now they just silently fixed it.

Introduction
How it works?
Usage
- Installation
- Example
Legal

Introduction

goop can perform google searches without being blocked by the CAPTCHA or hitting any rate limits.

How it works?

Facebook provides a debugger tool for its scraper. Interestingly, Google doesn’t limit the requests made by this debugger (whitelisted?) and hence it can be used to scrap the google search results without being blocked by the CAPTCHA.
Since facebook is involved, a facebook session Cookie must be supplied to the library with each request.

Usage

Installation

pip install goop

Example

from goop import goop

page_1 = goop.search('red shoes', '<your facebook cookie>')
page_2 = goop.search('red shoes', '<your facebook cookie>', page='1')
include_omitted_results = goop.search('red shoes', '<your facebook cookie>', page='8', full=True)

A dict of following structure is returned

{
    "0": {
        "url": "https://example.com",
        "text": "Example webpage",
        "summary": "This is an example webpage whose aim is to demonstrate the usage of ..."
    },
    "1": {
...

cli.py demonstrates the usage by performing google searches from the terminal with the following command

python cli.py <query> <number_of_pages>

goop-cli

Legal & Disclaimer

Scraping google search results is illegal. This library is merely a proof of concept of the bypass. The author isn’t responsible for the actions of the end users.

s0md3v/goop

Contents

Introduction

How it works?

Usage

Installation

Example

Legal & Disclaimer

google-research/football

google/TensorNetwork

asvcode/Vision_UI

jarun/googler

google-research/tensor2robot

google-research/morph-net

hardikvasa/google-images-download

mindslab-ai/voicefilter

AntonioErdeljac/Google-Machine-Learning-Course-Notes