This page is progress journal for R Junkies group.
You can find the latest version of our group project below:
R_Junkies Group Project
Reviewing Group Name: Rjunkies Reviewed Group Name: NYC
Project Description: Find the bus lines with the highest and lowest deviation from the regular schedule in New York City
Dataset: Dataset is from the NYC MTA buses data stream service. Dataset contain 24 for variable;In roughly 10 minute increments the bus location, route, bus stop and more is included in each row. Data for the entire month of June 2017 is included.
Appropriateness: Dataset is completely appropriate with project aim. Dataset includes expected and scheduled arrival times of buses. Also dataset is fairly large, reproducible and not confidental.
Proposed Project Flow: They are not proposed project flow yet.
Analysis Suggestion: We suggest that;
We’ve decided to add new datasets to our project. They will help answering various questions.
OSYM Data is updated to v2. With new data, We see that General_quota type is changed from integer to character. You can find the third analysis here.
We learned how to pass variable to ggplot.
Update:Due to UTF-8 problem. Github doesn’t allow to update proposal page.Problem is solved. [Updated:31/10/2017]
Our dataset is Airplane Crash from 1908 to 2017. Here you can find our project proposal.
We made major mistakes in wording and visualizations during class time. We’ve fixed them and also solved the UTF-8 Turkish Character problem. You can find the second analysis here.
Now we call ourselves “R Junkies”. We’ve started working on OSYM dataset. You can find the initial analysis here.
Today we’ve built our project group. We haven’t decided on a name and our dataset yet. The group members are below.