every tree bot

This is a fork of the everylotbot library by Neil Freeman. It was created by Angus B. Grieve-Smith, Timm Dapper, Laura Silver and Elber Carneiro for the 2016 Trees Count! Data Jam sponsored by the New York City Parks Department.

everytreebot supports a Twitter bot that posts Google Streetview pictures of every property in an SQLite database. It is currently running at @everytreenyc.

Existing instances of everylotbot: @everylotnyc, @everylotchicago, @everylotsf and @everylotla. Since maps are instruments of power, these bots is a way of generating a tension between two different modes establishing power in urban space. Read more about that.

What you'll need

Set up will be easier with at least a basic familiarity with the command line.

A fresh Twitter account and a Twitter app, with registered keys
A Google Streetview API token.
A SQLite db with a row for every address you're interested in
A place to run the bot (like a dedicated server or an Amazon AWS EC2 instance)

Twitter keys

Creating a Twitter account should be straightforward. To create a Twitter app, register at apps.twitter.com/. Once you have an app, you'll need to register your account with the app. Twitter has details.

Streetview key

Visit the Google Street View Image API page and get a key.

Phrase file

For the NYC Trees Count! Data Jam, Laura Silver created a range of phrases and templates to be chosen semi-randomly when composing tweets. These are available in the EveryTreeNYC_Phrases.csv file in this repository. You will need to specify the full path to that file.

Once you have the keys and path, save them in a file called bots.yaml that looks like this:

consumer_key: 123456890123456890
consumer_secret: 123456890123456890123456890123456890
key: 123456890123456890-123456890123456890123456890123456890
secret: 1234568901234568901234568901234568901234568
app: everylot
streetview: 1234567890asdfghjklQWERTYUIOP
phrasefile: /home/example/everylot/data/phrases.csv

This file can be in json format, if you wish.

Address database

Now, you'll need an SQLite database of trees. For @everytreebot we exported a CSV from New York City's Socrata site. Here are the fields that we use in the bot:

For general tracking:
- tree_id
For composing the tweets:
- address
- health
- spc_common
- spc_latin
- status
- steward
- zip_city
For Google Street View lookups:
- address
- boroname
- latitude
- longitude
- state
- zip_city
We will need to add a column so that we don't tweet the same tree over and over again:
- tweeted
We will also need an index unless you have a very small database or lots of spare time

Convert that CSV to SQLite using the SQLite command line:

> sqlite3 trees.db

sqlite> .mode csv
sqlite> .import trees.csv trees
sqlite> ALTER TABLE trees ADD COLUMN "tweeted" TEXT default "0";
sqlite> CREATE INDEX i ON trees (tree_id, tweeted);

Test the bot

Install this repository:

> git clone https://github.com/grvsmth/everytreebot.git
> cd everylotbot
> python setup.py install

For this step, make your new Twitter private.

You'll now have a command available called everylot. It works like this:

everylot SCREEN_NAME DATABASE.db --config bots.yaml

This will look in DATABASE.db for a table called trees, then grab an untweeted row at random. It will check where Google thinks this address is, and make sure it's close to the coordinates in the table. Then it wil use the address (or the coordinates, if they seem more reliable) to find a Streetview image, then post a tweet with this image to SCREEN_NAME's timeline. It will need the authorization keys in bots.yaml to do all this stuff.

everytree will, by default, try to use address, city and state fields from the database to search Google, then post to Twitter just the address field.

You can customize this based on the lay out of your database and the results you want. everylot has two options just for this:

--search-format controls how address will be generated when searching Google
--print-format controls how the address will be printed in the tweet

The default search-format is {address}, {city}, {state}, and the default print-format is {address}.

Search Google using the address field and the knowledge that all our data is in Kalamazoo, Michigan:

everylot everylotkalamazoo ./kalamazoo.db --config ./bots.yaml --search-format '{address}, Kalamazoo, MI'

Search Google using an address broken-up into several fields:

everylot everylotwallawalla walla2.db --config bots.yaml \
    --search-format '{address_number} {street_direction} {street_name} {street_suffix}, Walla Walla, WA'

In practice, you want to do the same thing when posting to Twitter, but you leave off the city and state because that's obvious to your followers:

everylot everylotwallawalla walla2.db --config bots.yaml \
    --search-format '{address_number} {street_direction} {street_name} {street_suffix}, Walla Walla, WA' \
    --print-format '{address_number} {street_direction} {street_name} {street_suffix}'

Include the property ID in the tweet, in brackets:

everylot everylotkalamazoo kalamazoo.db --config bots.yaml --search-format '{address}, Kalamazoo, MI \
    --print-format '{address} [{id}]'

While you're testing, it might be helpful to use the --verbose and --dry-run options. Also, use the --id option to force everylot to post a particular property.

everylot everylotpoughkeepsie pkpse.db --config bots.json --verbose --dry-run --id 12345

A place for your bot to live

Now, you just need a place for the bot to live. This needs to be a computer that's always connected to the internet, and that you can set up to run tasks for you. You could use a virtual server hosted at a vendor like Amazon AWS, Linode or DigitalOcean, or space on a web server.

Put the bots.yaml file and your database in the same folder on your server, then download this repository and install it as above.

Next, you want to set up the bot to tweet regularly. If this is a Linux machine, you can do this with crontab:

crontab -e
1,31 * * * * $HOME/.local/bin/everylot screen_name $HOME/path/to/trees.db --config $HOME/path/to/bots.yaml

Flexible scheduling with badtime variables

Some schedulers (like the one provided by pythonanywhere.com) only allow scheduling every hour, or every day. The badtime() function allows you to specify two parameters in the bots.yaml file:

hoursbetween indicates how many hours the script should wait in between tweets, e.g. a value of 3 will tweet every three hours
quiethours is a list indicating a range of hours when tweeting shouldn't happen, e.g. a value of [23, 7] will not tweet between the hours of 11pm and 7am

(The function is not sophisticated enough to deal with a quiethours range where both hours are on the same side of midnight.)

(Note that you can omit the bots.yaml config file argument if it's located in the home directory.)

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
everylot		everylot
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

every tree bot

What you'll need

Twitter keys

Streetview key

Phrase file

Address database

Test the bot

A place for your bot to live

Flexible scheduling with badtime variables

About

Uh oh!

Releases 1

Packages

Languages

License

grvsmth/everytreebot

Folders and files

Latest commit

History

Repository files navigation

every tree bot

What you'll need

Twitter keys

Streetview key

Phrase file

Address database

Test the bot

A place for your bot to live

Flexible scheduling with badtime variables

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages