This library supports a Twitter bot that posts Google Streetview pictures of every property in an SQLite database. Existing instances of the bot: @everylotnyc, @everylotchicago, @everylotsf and @everylotla. Since maps are instruments of power, these bots is a way of generating a tension between two different modes establishing power in urban space. Read more about that.
Set up will be easier with at least a basic familiarity with the command line. A knowledge of GIS will be helpful.
- A fresh Twitter account and a Twitter app, with registered keys
- A Google Streetview API token.
- A SQLite db with a row for every address you're interested in
- A place to run the bot (like a dedicated server or an Amazon AWS EC2 instance)
Creating a new Twitter account should be straightforward. To create a Twitter app, register at developer.twitter.com/apps. Once you have an app, you'll need to register your account with the app. Twitter has details on that process.
Once you have the keys, save them in a file called bots.yaml
that looks like this:
apps:
everylot:
consumer_key: 123456890123456890
consumer_secret: 123456890123456890123456890123456890
users:
example_user_name:
key: 123456890123456890-123456890123456890123456890123456890
secret: 1234568901234568901234568901234568901234568
app: everylot
Change example_user_name
to your Twitter account screen name. The app name can also be anything you want, but the name in the apps section must match the value of the users's app
key.
This file can be in json
format, if you wish.
Visit the Google Street View Image API page and click get a key. Make sure that your account and key have both street view and geocoding enabled.
Once you have the key, save it on its own line in your bots.yaml
file like so:
streetview: 123ABC123ABC123ABC123ABC
apps: [etc]
users: [etc]
Now, you'll need an SQLite database of addresses. At a minimum, the address database just needs an id field and list of addresses. It's helpful to also have a lat/lon coordinates, since if the Google API can't find a nearby address, the bot will use lat/lon instead.
One way to get this database is to download geodata and convert to to SQLite. Visit your county's open data page (if it has one). Ideally, you'll end up with the data in Shapefile format, which is actually four or five files that look like:
Parcels_2015.dbf Parcels_2015.prj Parcels_2015.shp Parcels_2015.shx Parcels_2015.shp.xml
While you're at it, make sure to download the metadata and carefully note the fields you'll want to track. At a minimum, you'll need an ID field and an address field. The address may be broken into several parts, that's fine. A field that tracts the number of floors would be nice, too.
Now, you'll need to transform that Shapefile into an SQLite database. The database should have a table named lots
with these fields: id
, lat
, lon
, tweeted
(the last should be set to zero initially). You must also have some fields that represent the address, like address
, city
and state
. Or, you might have address_number
, street_name
and city
. Optionally, a floors
field is useful for pointing the Streetview "camera".
(In the commands below, note that you don't have to type the "$", it's just there to mark the prompt where you enter the command.)
One way to create a the SQLite database is with GDAL command line tools. If you're on OS X and don't have a GIS handy, install Homebrew. Then, paying attention to the fields you noted, do something like this:
# this may take a while, you're installing a big software library
$ brew install gdal
# Convert the layer to Google's projection and remove unneeded columns.
# Check the file carefully, its field names will differ from this example.
$ ogr2ogr -f SQLite Parcels_2015_4326.db Parcels_2015.shp -t_srs EPSG:4326 -select taxid,addr,floors
$ ogr2ogr -f SQLite lots.db Parcels_2015_4326.db -nln lots \
-sql "SELECT taxid AS id, addr AS address, floors, \
ROUND(X(ST_Centroid(GeomFromWKB(Geometry))), 5) lon, \
ROUND(Y(ST_Centroid(GeomFromWKB(Geometry))), 5) lat, \
0 tweeted \
FROM Parcels_2015_4326 ORDER BY taxid ASC"
If you don't want to install GDAL, you can use other command line tools (e.g. mapshaper
) or a GIS like QGIS or ArcGIS to create a CSV and load it into SQLite:
# Convert a CSV (`lots.csv`) to SQLite with two steps
$ sqlite3 -separator , lots.db "import 'stdin' tmp" < lots.csv
$ sqlite lots.db "CREATE TABLE lots AS SELECT taxid AS id, CONVERT(lat, NUMERIC) AS lat,
CONVERT(lon, NUMERIC) AS lon, address, city, state, 0 AS tweeted FROM tmp;"
However you create the SQLite db, add an index (skip this step if have a very small database or like waiting for commands to run):
$ sqlite3 lots.db "CREATE INDEX i ON lots (id);"
Install this repository. First download or clone the repo, and open the folder up in terminal. Then run:
$ python setup.py install
You'll now have a command available called everylot
. It works like this:
$ everylot SCREEN_NAME DATABASE.db --config bots.yaml
This will look in DATABASE.db
for a table called lots, then sort that table by id
and grab the first untweeted row. The bot then checks where Google thinks this address is, and make sure it's close to the coordinates in the table. Then it wil use the address (or the coordinates, if they seem more reliable) to find a Streetview image, then post a tweet with this image to SCREEN_NAME
's timeline. It will need the authorization keys in bots.yaml
to do all this stuff. After posting, the bots saves the ID of the tweet it made to the tweeted
field of that row.
Everylot
will, by default, try to use address
, city
and state
fields from the database to search Google, then post to Twitter just the address
field.
You can customize this based on the layout of your database and the results you want. everylot
has two options just for this:
--search-format
controls how address will be generated when searching Google--print-format
controls how the address will be printed in the tweet
The format arguments are strings that refer to fields in the database in {brackets}
. The default search-format
is {address}, {city}, {state}
, and the default print-format
is {address}
.
Search Google using the address
field and the knowledge that all our data is in Kalamazoo, Michigan:
everylot everylotkalamazoo ./kalamazoo.db --config ./bots.yaml --search-format '{address}, Kalamazoo, MI'
Search Google using an address broken-up into several fields:
$ everylot everylotwallawalla walla2.db --config bots.yaml \
--search-format '{address_number} {street_direction} {street_name} {street_suffix}, Walla Walla, WA'
You might leave off the city and state when tweeting because that's obvious to your followers:
$ everylot everylotwallawalla walla2.db --config bots.yaml \
--search-format '{address_number} {street_direction} {street_name} {street_suffix}, Walla Walla, WA' \
--print-format '{address_number} {street_direction} {street_name} {street_suffix}'
While testing, it might be helpful to use the --verbose
and --dry-run
options. Use the --id
option to force everylot
to post a particular property:
everylot everylotpoughkeepsie pkpse.db --config bots.json --verbose --dry-run --id 12345
Now, you just need a place for the bot to live. This needs to be a computer that's always connected to the internet, and that you can set up to run tasks for you. You could use a virtual server hosted at a vendor like Amazon AWS, Linode or DigitalOcean, or space on a web server.
Put the bots.yaml
file and your database in the same folder on your server, then download this repository and install it as above.
Next, you want to set up the bot to tweet regularly. If this is a Linux machine, you can do this with crontab:
crontab -e
1,31 * * * * $HOME/.local/bin/everylot screen_name $HOME/path/to/lots.db -s '{address} Anytown USA'
If you want to use nano instead of vim as your editor, use EDITOR=nano crontab -e
and just paste the second line inside.
(Note that you can omit the bots.yaml
config file argument if it's located in the home directory.)
This walks through the steps of creating an example bot. It uses text-based command line commands, but most of these tasks could be done in programs with graphic interfaces.
First step is to find the data: google "Baltimore open data", search for parcels on data.baltimorecity.gov.
# Also works to download through the browser and unzip with the GUI
$ curl -G https://data.baltimorecity.gov/api/geospatial/rb22-mgti \
-d method=export -d format=Shapefile -o baltimore.zip
$ unzip baltimore.zip
Archive: baltimore.zip
inflating: geo_export_9f6b494d-b617-4065-a8e7-23adb09350bc.shp
inflating: geo_export_9f6b494d-b617-4065-a8e7-23adb09350bc.shx
inflating: geo_export_9f6b494d-b617-4065-a8e7-23adb09350bc.dbf
inflating: geo_export_9f6b494d-b617-4065-a8e7-23adb09350bc.prj
# Get a simpler name
$ mv geo_export_9f6b494d-b617-4065-a8e7-23adb09350bc.shp baltimore.shp
$ mv geo_export_9f6b494d-b617-4065-a8e7-23adb09350bc.shx baltimore.shx
$ mv geo_export_9f6b494d-b617-4065-a8e7-23adb09350bc.dbf baltimore.dbf
# Find the address and ID fields. It looks like we'll want to use a combination of
# blocknum and parcelnum to get a unique ID for each property
$ ogrinfo baltimore.shp baltimore -so
INFO: Open of 'baltimore.shp'
using driver 'ESRI Shapefile' successful.
...
parcelnum: String (254.0)
...
blocknum: String (254.0)
fulladdr: String (254.0)
...
# Create an SQLite database, reprojecting the geometries to WGS84. Keep only the desired fields
$ ogr2ogr -f SQLite baltimore_raw.db baltimore.shp baltimore -t_srs EPSG:4326
-nln baltimore -select parcelnum,blocknum,fulladdr
# Convert feature centroid to integer latitude, longitude
# Pad the block number and parcel number so sorting works
# Result will have these columns: id, address, lon, lat, tweeted
$ ogr2ogr -f SQLite baltimore.db baltimore_raw.db -nln lots -nlt NONE -dialect sqlite
-sql "WITH A as (
SELECT blocknum,
parcelnum,
fulladdr AS address,
ST_Centroid(GeomFromWKB(Geometry)) centroid
FROM baltimore
WHERE blocknum IS NOT NULL AND parcelnum IS NOT NULL
) SELECT (substr('00000' || blocknum, -5, 5)) || (substr('000000000' || parcelnum, -9, 9)) AS id,
address,
ROUND(X(centroid), 5) lon, ROUND(Y(centroid), 5) lat,
0 tweeted
FROM A;"
# Add an index
$ sqlite3 baltimore.db "CREATE INDEX i ON lots (id);"
$ everylot everylotbaltimore baltimore.db --search-format "{address}, Baltimore, MD"