Welcome to Twitter AWS

Twitter AWS is a Rails application that helps you to

Collect tweets from the twitter garden hose for a given keyword
Create a set of classification criteria i.e. Is this tweet funny? Is it about a product? and so on...
Send out those tweets to Amazon Mechanical Turk and let crowdsource the questions
Review the Answers

Getting Started

You will need a AWS Account with some cash on it
You will need the aws gem: http://ruby-aws.rubyforge.org/ruby-aws/ gem install intridea-tweetstream

Projects

For each set of questions and keywords you can create a keyword.
Once you have set those up there are two deamons that need to be started:

The twitter keyword collection deamon
The Amazon MTurk Daemon

Daemons

You can start the daemons from the administration console once the webserver is running.

Twtiter Daemon

The Twitter Collection deamon watches the firehose for specific keywords and then adds them to the specific project

MTurk Daemon

The MTurk daemon periodically looks through the Database for jobs that have been marked for crowdcourcing.
If a job was marked for crowdsourcing, it sends it to mturk for tagging
and then periodically checks if the job has been processed yet
Processed jobs are reviewed and downloaded

Quality Considerations

When using Amazon MTurk we have some quality considerations to maintain a higher chance of getting a good job done:

a) For each project you can set a reward per job b) For each project you can set a minimum average rating that the worker must have performed in the past c) We reject workers that submit more than 5 times the exact same aswer for tweets and put them on a black list

Reading up on MTURK

You can read up more information on how MTurk works on: http://aws.amazon.com/mturk/

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
app		app
config		config
db		db
doc		doc
public		public
script		script
test		test
.gitignore		.gitignore
README		README
README.md		README.md
Rakefile		Rakefile
collect_daemon.rb		collect_daemon.rb
collect_daemon_control.rb		collect_daemon_control.rb
collect_tweets.log		collect_tweets.log
mturk_daemon.rb		mturk_daemon.rb
mturk_daemon_control.rb		mturk_daemon_control.rb
ruby-aws.log		ruby-aws.log
tweets.sqlite		tweets.sqlite

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Welcome to Twitter AWS

Twitter AWS is a Rails application that helps you to

Getting Started

Projects

Daemons

Twtiter Daemon

MTurk Daemon

Quality Considerations

Reading up on MTURK

About

Releases

Packages

Languages

plotti/Twitter-AWS

Folders and files

Latest commit

History

Repository files navigation

Welcome to Twitter AWS

Twitter AWS is a Rails application that helps you to

Getting Started

Projects

Daemons

Twtiter Daemon

MTurk Daemon

Quality Considerations

Reading up on MTURK

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages