Tutorial on Big Data and Online Crowd-Sourcing for Vision Research

Creating Big DataSets

Vision Sciences Society Satellite Event, Friday, May 18, 2018.
Organizer: Wilma A. Bainbridge, NIMH
Speakers: Tim Brady (UCSD), Gijsbert Stoet (Leeds Beckett University), Wilma Bainbridge (NIMH), Dwight Kravitz (George Washington University)

Online experiments and Big Data are becoming big topics in the field of vision science, but can be hard to access for people not familiar with web development and coding. This tutorial will teach attendees the basics of creating online crowd-sourced experiments, and how to think about collecting and analyzing Big Data related to vision research. Four experts in the field will discuss how they use and collect Big Data, and give hands-on practice to tutorial attendees. We will discuss Amazon Mechanical Turk, its strengths and weaknesses, and how to leverage it in creative ways to collect powerful, large-scale data. We will then discuss Psytoolkit, an online experimental platform for coding timed behavioral and psychophysical tasks, that can integrate with Amazon Mechanical Turk. We will then discuss how to create Big Datasets using various ways of “scraping” large-scale data from the internet. Finally, we will discuss other sources of useful crowd-sourced data, such as performance on mobile games, and methods for scaling down and analyzing these large data sets.

Here, I have made available material related to the course available.

Tim Brady - Introduction to Amazon Mechanical Turk
8:33am - 9:18am
The slides for the current tutorial are not yet available. In the meantime, you can view his previous tutorials on Amazon Mechanical Turk here.

Gijsbert Stoet - Introduction to Psytoolkit
9:21am - 10:06am
You can watch his tutorial on Psytoolkit here, and learn specifically about how you can integrate Psytoolkit with Amazon Mechanical Turk here. Learn more about Psytoolkit and access their website here.

Wilma Bainbridge - Creating Big Datasets and Data-scraping
You can view the slides and demonstrations, example code, and tutorial materials .

Dwight Kravitz - Good Scientific Practices of Big Dta
Tutorial information coming soon.

Disclaimer: I collect minor usage statistics (geographic location and time) for downloads of my tutorial materials.