June 1, 2020

661 words 4 mins read

bisguzar/twitter-scraper

Scrape the Twitter Frontend API without authentication.


repo name	bisguzar/twitter-scraper
repo link	https://github.com/bisguzar/twitter-scraper
homepage
language	Python
size (curr.)	177 kB
stars (curr.)	2317
created	2018-02-22
license	MIT License

Twitter Scraper

maintain status

🇰🇷 Read Korean Version

Twitter’s API is annoying to work with, and has lots of limitations — luckily their frontend (JavaScript) has it’s own API, which I reverse–engineered. No API rate limits. No restrictions. Extremely fast.

You can use this library to get the text of any user’s Tweets trivially.

Prerequisites

Before you begin, ensure you have met the following requirements:

Internet Connection
Python 3.6+

Installing twitter-scraper

If you want to use latest version, install from source. To install twitter-scraper from source, follow these steps:

Linux and macOS:

git clone https://github.com/bisguzar/twitter-scraper.git
cd twitter-scraper
sudo python3 setup.py install

Also, you can install with PyPI.

pip3 install twitter_scraper

Using twitter_scraper

Just import twitter_scraper and call functions!

→ function get_tweets(query: str [, pages: int]) -> dictionary

You can get tweets of profile or parse tweets from hashtag, get_tweets takes username or hashtag on first parameter as string and how much pages you want to scan on second parameter as integer.

Keep in mind:

First parameter need to start with #, number sign, if you want to get tweets from hashtag.
pages parameter is optional.

Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from twitter_scraper import get_tweets
>>> 
>>> for tweet in get_tweets('twitter', pages=1):
...     print(tweet['text'])
... 
spooky vibe check
…

It returns a dictionary for each tweet. Keys of the dictionary;

Key	Type	Description
tweetId	string	Tweet’s identifier, visit twitter.com/USERNAME/ID to view tweet.
userId	string	Tweet’s userId
username	string	Tweet’s username
tweetUrl	string	Tweet’s URL
isRetweet	boolean	True if it is a retweet, False otherwise
isPinned	boolean	True if it is a pinned tweet, False otherwise
time	datetime	Published date of tweet
text	string	Content of tweet
replies	integer	Replies count of tweet
retweets	integer	Retweet count of tweet
likes	integer	Like count of tweet
entries	dictionary	Has hashtags, videos, photos, urls keys. Each one’s value is list

→ function get_trends() -> list

You can get the Trends of your area simply by calling get_trends(). It will return a list of strings.

Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from twitter_scraper import get_trends
>>> get_trends()
['#WHUTOT', '#ARSSOU', 'West Ham', '#AtalantaJuve', '#バビロニア', '#おっさんずラブinthasky', 'Southampton', 'Valverde', '#MMKGabAndMax', '#23NParoNacional']

→ class Profile(username: str) -> class instance

You can get personal information of a profile, like birthday and biography if exists and public. This class takes username parameter. And returns itself. Access informations with class variables.

Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from twitter_scraper import Profile
>>> profile = Profile('bugraisguzar')
>>> profile.location
'Istanbul'
>>> profile.name
'Buğra İşgüzar'
>>> profile.username
'bugraisguzar'

→ .to_dict() -> dict

to_dict is a method of Profile class. Returns profile datas as Python dictionary.

Python 3.7.3 (default, Mar 26 2019, 21:43:19) 
[GCC 8.2.1 20181127] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from twitter_scraper import Profile
>>> profile = Profile("bugraisguzar")
>>> profile.to_dict()
{'name': 'Buğra İşgüzar', 'username': 'bugraisguzar', 'birthday': None, 'biography': 'geliştirici@peptr', 'website': 'bisguzar.com', 'profile_photo': 'https://pbs.twimg.com/profile_images/1199305322474745861/nByxOcDZ_400x400.jpg', 'banner_photo': 'https://pbs.twimg.com/profile_banners/1019138658/1555346657/1500x500', 'likes_count': 2512, 'tweets_count': 756, 'followers_count': 483, 'following_count': 255, 'is_verified': False, 'is_private': False, user_id: "1019138658"}

Contributing to twitter-scraper

To contribute to twitter-scraper, follow these steps:

Fork this repository.
Create a branch with clear name: git checkout -b <branch_name>.
Make your changes and commit them: git commit -m '<commit_message>'
Push to the original branch: git push origin <project_name>/<location>
Create the pull request.

Alternatively see the GitHub documentation on creating a pull request.

Contributors

Thanks to the following people who have contributed to this project:

@kennethreitz (author)
@bisguzar (maintainer)
@lionking6792
@ozanbayram

Contact

If you want to contact me you can reach me at @bugraisguzar.

License

This project uses the following license: MIT.

bisguzar/twitter-scraper

Twitter Scraper

Prerequisites

Installing twitter-scraper

Using twitter_scraper

→ function get_tweets(query: str [, pages: int]) -> dictionary

Keep in mind:

→ function get_trends() -> list

→ class Profile(username: str) -> class instance

→ .to_dict() -> dict

Contributing to twitter-scraper

Contributors

Contact

License

tweepy/tweepy

twintproject/twint

jakubroztocil/httpie

meetmangukiya/instagram-scraper

vaguileradiaz/tinfoleak

karpathy/find-birds

JDAI-CV/fast-reid

deepmind/acme

encode/uvicorn