University of Leeds Twitter Spark API Programming Project
1 Introduction
This assignment tests your ability to use various Spark APIs to implement given workloads.
It also test your ability to analyze and tune the performance of Spark application.
The assignment data set contains tweet objects downloaded from Twitter using Tweeter
standard search API. The downloaded tweet objects are stored in a single JSON fifile. A
tweet object may refer to a general tweet, a retweet or a reply to a tweet. A general tweet is
“a message posted to Twitter containing text, photos, a GIF, and/or video”