Week 7 (Dec 26, 2018)
Week 6 (Dec 12, 2018)
- Machine Learning Part II (html | pdf)
- Machine Learning Parts Exercises (ps. not assignments)
Week 5 (Nov 28, 2018)
- Guest lecture by Mustafa Onan, CFA from Sabancı Holding.
- Machine Learning Part I (html | pdf)
- Bonus: Introduction to Cloud Computing. Make sure you get the GitHub Student Developer Pack. Participation is completely voluntary.
- Project Guidelines html | pdf
Week 4 (Nov 14, 2018)
- (Nov 20, 2018) All ODD car sales data can be found in here. Deadline extended to Nov 25.
- Guest lecture by Mehmet Memecan, Co-Founder and CEO of Tarfin, an fintech company specialized in agricultural inputs financing.
- This week we are going to learn about
reticulatepackages. Remember reticulate requires Python installed. Make sure these packages are working on your computers.
Week 3 (Oct 31, 2018)
- Guest lecture by Hayrettin Konuk from Borsa Istanbul Takasbank (file)
- ggplot2 (html | pdf)
- Lubridate (Link)
- EGM Example (data) (source)
- ODD Assignment Example Rmd file (click)
ODD Assignment Part 2
- Submit a merge request to this file with the code piece to add your rds file to the main file. Replicate the example with your working code.
- Perform a brief but striking exploratory analysis with the full car sales data. Your main analysis should include only a single angle and a single plot. You can perform more analyses after the first analysis.
Group Projects Initialization
- Email the instructor with your intentions for a project (Proposal Phase). Your intentions should include the topic, a description of the data and tentatively planned (3 to 5 items) work.
- You may ask the instructor about creative project ideas.
- Create group progress journal (gpj18) with the link provided by the instructor.
- All members of the group should join to the repository. Remember your group should consist of either 4 or 5 people.
- On your gpj page write the name of the group, group member name surnames, brief description of your intended project and data.
- Perform an initial exploratory analysis on the data and post the outcome html on your gpj.
- Email the instructor the link of your gpj18 webpage.
Week 2 (Oct 17, 2018)
- Some base R exercises (Solutions!) (html)
- Tidyverse Introduction: dplyr (html |pdf)
- dplyr join operations 1, 2
- Bonus Material (Program coordinator’s recommendation): EdX - The Analytics Edge
- ODD Assignment Example (html)
This is a very easy one. The homework is about standardizing and merging data. I did most of the work in the example on course webpage. I expect something very similar. Here are the steps. Please try to do it as quick as possible and cooperate with your classmates if you are having any trouble.
- Go to the Google Drive Excel sheet that I sent the invites to you.
- Choose a month (preferably most recent that is not taken) and write your name and email on it.
- Go to ODD site and download the data. Rename and add it to your github repository.
- Follow the example and write the RMarkdown document that will download the raw excel from your repository, manipulate it in the desired format (check the example) and save it in an RDS file.
- Add some analysis to the RMarkdown document using your dplyr skills.
Week 1 (Oct 3, 2018)
- Cheat Sheet Heaven
- Introduction to R (html | pdf)
- Some base R exercises (html)
- Datacamp Homework (Don’t forget to accept your invitations)
- RMarkdown Homework: Prepare an RMarkdown document. Introduce yourself in one paragraph (Your name surname, your work, your data interests and how you (plan to) use data science skills in your current/future work). Watch some UseR-2018 videos (Link) and write one of them down on your RMarkdown document. Find 3 R posts relevant to your work and describe them. Get the html output and put it in your progress journal repository. Provide link from your Progress Journal page. (Click for example)
- Form teams of 2-3 and prepare for major projects (we will discuss in week 2).
Week 0 (Sep 24, 2018)
- Tutorial: Initializing Your Progress Journal (download)
- Some light reading about the previous year. (Read on Blog)
- Some light reading about instructor’s view on R. (Read on Blog)
This course benefits from DataCamp for the Classroom program. See details here.
Data Sets for Prospective Projects
- Merkez Bankası - CBRT
- Emeklilik Gözetim Merkezi
- TURKSTAT - TUIK
For audiovisual learners, some webinars here.
- Official dplyr tutorial
- dplyr join functions
- dplyr join functions official tutorial
- dplyr Cheat Sheet
- Introduction to RMarkdown - Official
- R4DS Book - Communication
- DataCamp - Authoring R Markdown Reports Free Part
- RMarkdown Cheat Sheet
External Good Resources About R and Data Science
- Introduction to Statistical Learning
- R for Data Science
- R’a Hızlı Giriş (Türkçe)
- The Elements of Statistical Learning
- Advanced R
- Bookdown Compilation
- Akademik Bilişim 2017 - R ile Veri Analizi Dersi
- BOUN-FE 522
- Learn X in Y Minutes - R
- dplyr vignettes
- ggplot2 workshop
- RStudio Cheat Sheets (Base R, dplyr, ggplot2, RMarkdown etc.)
- R Reference Cards
- data.table Cheat Sheet