Materials

The following data, code, and documents from our project are made available to the public. We are not responsible for any potential issues incurred in using them. More information about these materials can be found in our Publications.

Contents

  1. A Phishing Study with Real Mechanical Turk Users
  2. Modeling and Simulation of Cognition in Phishing Detection

A Phishing Study with Real Mechanical Turk Users

Description

In Fall 2017 a user study of phishing email detection was deployed and conducted on Amazon Mechanical Turk. Each participant interacted with Roundcube, an open source webmail system, to classify a set of 40 emails into either legitimate or phishing categories. Detailed behavioral information was collected of these participants, including every action taken on the web user interface in completing requested tasks.

In a multitasking context, a subset of these participants was asked to solve problems being presented concurrently with these emails, by an embedded Qualtrics iframe.

Moreover, as an incentive mechanism, some of these participants would receive additional tiered monetary rewards if they achieved accuracies as required. Information on design and implementation of this user study and all the logged user behavior data are available below for download.

User Study Overview

To start, you can review a paper on the overall user study and some preliminary results. [link]

User Study Platform Design and Implementation

Design and implementation of this user study platform can be found at this document. [link]

Emails Used

Description of the 40 emails used in this user study can be found here. [link]

Qualtrics Questions

The set of Qualtrics questions used for multi-tasking experiments can be found here. [link]

User Study Data

The log files of 177 participants who have completed the user study are in this zip file. [link]

Data Dictionary

Captured information is explained here. [link]

Modeling and Simulation of Cognition in Phishing Detection

Description

A simulation has been developed to model participants classifying emails controlled by a cognitive chunk activation mechanism similar to that in the ACT-R architecture. You are encouraged to first review a paper on this simulation study.

Old Simulation Code

The code in python used for this simulation study can be downloaded here. [link]

Email and Cue Coding Used in Simulation

The information cues and the associated 40 emails used for simulation is available here. [link]


Note:
Any opinions, findings and conclusions or recommendations expressed on this web site are those of the author(s) and do not necessarily reflect the views of the National Science Foundation (NSF).

Last updated on July 11, 2019