1)HR analytics

Our example concerns a big company that wants to understand why some of their best and most experienced employees are leaving prematurely. The company also wishes to predict which valuable employees will leave next. At this stage we want to understand the data that compose our Analytical Base Table (ABT) and assess where the quality of it might suffer.

2)Rental Listing NY Map

In this competition, you will predict how popular an apartment rental listing is based on the listing content like text description, photos, number of bedrooms, price, etc. The data comes from renthop.com, an apartment listing website. These apartments are located in New York City.

3)Personalised Medicine - EDA with tidy R

This is an early Exploratory Data Analysis for the Personalized Medicine, Redefining Cancer Treatment challenge. I will be using ggplot2 and the tidyverse tools to study and visualise the structures in the data. We have been challenged to automatically classify genetic mutations that contribute to cancer tumor growth (socalled drivers) in the presence of mutations that are dont affect the tumors (passengers).

4)Countries the most mentioned by Hillary

About this Dataset Throughout 2015, Hillary Clinton has been embroiled in controversy over the use of personal email accounts on non-government servers during her time as the United States Secretary of State. Some political experts and opponents maintain that Clinton use of personal email accounts to conduct Secretary of State affairs is in violation of protocols and federal laws that ensure appropriate recordkeeping of government activity. Hillary campaign has provided their own four sentence summary of her email use here.

5)Exploratory Analysis - NYC Taxi Trip

This is an Exploratory Data Analysis for the NYC Taxi Ride Duration competition. In this competition, NYC Taxi and Limousine Commission is challenging you to build a model that predicts the total ride duration of taxi trips in New York City. The primary dataset includes pickup time, geocoordinates, number of passengers, and several other variables.