-
Notifications
You must be signed in to change notification settings - Fork 1
Twitter Tools
This list provides an overview of useful data collection tools that can be used for research on Twitter. If you face problems or issues with one of the applications on the list, feel free to post an Issue. It helps us to maintain this list.
Most of these Twitter tools connect to official Twitter APIs and therefore need an API key from Twitter. You can retrieve an API key from Twitter easily, just follow the documentation. You are bound to the restrictions given by Twitter. You can read about the rate limits here. Version 2 of the API will be more restrictive (at least it looks like that at time of writing). As an academic, you can apply for access to the new academic track (Twitter Academic API Track Application) though to elevate your access levels to 10 million tweets per month and access to the 'historic' archive, i.e., tweets older than 7 days.
Some of the tools are scrapers, which do not use the official APIs. Please be aware that the use of these tools might violate Twitter's Terms of Service. Despite being public, Twitter data can be very personal. Ensure to inform yourself thoroughly in order to follow data protection laws and ethical guidelines that apply to your research before starting your data collection.
API? | Last Tested | Language | Interfaces | Comments | |
---|---|---|---|---|---|
Facepager | V1/V2 | 2/2/2021 | - | GUI | No programming needed |
focalevents | Academic | Not Tested | Python | CLI | depends on PostgreSQL |
twacapic | Academic | 7/4/2021 | Python | CLI | early development |
twarc | V1/V2/Academic | 2/2/2021 | Python | CLI / Python Module | Programming possible |
TwitterAPI | V1/V2/Academic | 2/2/2021 | Python | Python Module | Programming needed |
Twint | Scraper | 2/2/2021 | Python | Python Module | Programming needed |
Twitterscraper | V1 | 2/2/2021 | Python | Python Module | Programming needed |
tweepy | V1/V2 | 2/2/2021 | Python | Python Module | Programming needed |
rtweet | V1 | Not Tested | R | R Module | Programming needed |
twitter-explorer | V1/V2/Academic | 19/01/2023 | Python | GUI | No Programming needed |
cta-tool | V2/Academic | 13/12/2021 | Python | Python Module | programming needed, collecting and counting conversations, MongoDB |
Twitter Downloader | Academic | 25/05/2022 | - | GUI | no programming needed, access to Tweets only |
The Digital Methods Initiative Twitter Capture and Analysis Toolset [DMI-TCAT] is a set of tools to retrieve and collect tweets from Twitter and to analyze them in various ways. It is written mostly in PHP and runs in a webserver (LAMP) environment...
DMI-TCAT provides robust and reproducible data capture and analysis, and interlinks with existing analytical software. Captured data sets can be refined in different ways (search queries, exclusions, date range, etc.) and the resulting selections of tweets can be analyzed in various ways, mainly by outputting files in standard formats (CSV for tabular files and GEXF for network files).
The big plus of DMI-TCAT is that it is organized around a MySQL database, which can run 24/7 robustly over months to years. However, setting up DMI-TCAT on a server requires some command line skills.
SMO-TMAS allows users to pull tweets of specified Twitter handles and tweets containing specified keywords by querying Twitter's REST API GET search/tweets endpoint and statuses/user_timeline endpoint as well as Twitter's STREAM API. The collected tweets can be downloaded as .csv file and SMO-TMAS also provides data analysis components that can be used to analyze and visualize the collected data right away...
known issues and limitations:
- is limited by the Twitter API limitations
Notable Features:
- Ideal for small datasets
- Accessible through the web
- No local installation needed
Installation via: Accessible through a Web Application, no local installation needed.
TAGS is a free Google Sheet template which lets you setup and run automated collection of search results from Twitter...
known issues and limitations:
- limited to search queries
Notable Features:
- easy to use, without command line
Installation via: An Google account is needed to install this sheets
Twitter explorer is an openly accessible interface that can collect, transform and visualize Twitter data. It can create a complex network without prior programming skills. twitter explorer consists of mainly three parts which are collector, visualizer, and explorer...
Notable Features:
- Collector :
- Collects Twitter using API search.
- Save the data in jsonl format.
- Visualizer:
- Timeline plotting
- Can generate network depending on retweet and Hashtags networks
- Can aggregate data based on node degree
- Plot can be export in .gml/.csv/.gv
- Explorer:
- Display network using an html format with interactive dashboard in a browser.
- Show information about datasets, number of nodes, node size, node scaling, users metadata on click.
Installation via: An installation package is available for Windows, Linux and MacOS
Requires python 3.6 or above .
# replace XXX by release number
cd ~/Downloads/twitter-explorer-vXXX
pip install -r requirements.txt
After installation we can collect data using streamlit
streamlit run collector.py
VOSONDash is an interactive R Shiny web application for the visualisation and analysis of social network data. The app has a dashboard layout with sections for visualising and manipulating network graphs, performing text analysis, displaying network metrics and the collection of network data using the vosonSML R package...
known issues and limitations:
- is limited by the Twitter API limitations
Notable Features:
- R application that connect to different social media APIs
- Built-in visual analysis, accessible through an web interface
- Cross-platform analysis
Installation via: CRAN
Twitter Downloader is a tool from Twitter that allows researchers with Academic Access to configure search queries for tweets via a web tool and load them as CSV or JSON files...
known issues and limitations:
- requires Academic Access
- Access to Tweets only
Notable Features:
- GUI: no programming skills needed
- allows full access to the Twitter archive going back to 2006 (deleted tweets always excluded)
Hydrator is an Electron based desktop application for hydrating Twitter ID datasets. Twitter's Terms of Service do not allow the full JSON for datasets of tweets to be distributed to third parties. However they do allow datasets of tweet IDs to be shared. Hydrator helps you turn these tweet IDs back into JSON and also CSV from the comfort of your desktop...
Notable Features:
- Program with a Graphic User Interface (GUI) making it easy to use for inexperienced users.
Installation via: An installation package is available for Windows, Linux and MacOS
This Wiki is curated by the Social Media Observatory, which is hosted by the Leibniz Institute for Media Research | Hans-Bredow-Institut and supported by the Research Institute Social Cohesion. A pretty version can be found here.
- Instagram-Tools
- Twitter-Tools
- Wikipedia-Tools
- Facebook-Tools
- YouTube-Tools
- Telegram-Tools
- Smaller Platform Tools
- Cross-Platform Tools
- General News Scrapers
- Secure Storage and Archiving
- Data Anonymization Tools
- Data Publishing