Course Essentials
- Syllabus (download)
- Progress Journals
Week 7 (Dec 26, 2018)
- Presentations!
Week 6 (Dec 12, 2018)
- Guest lecture by Uğur Kemal Solak from Accenture Turkey.
- Machine Learning Part II (html | pdf)
- Machine Learning Parts Exercises (ps. not assignments)
Week 5 (Nov 28, 2018)
- Guest lecture by Mustafa Onan, CFA from Sabancı Holding.
- Machine Learning Part I (html | pdf)
- Bonus: Introduction to Cloud Computing. Make sure you get the GitHub Student Developer Pack. Participation is completely voluntary.
- Create an account in Digital Ocean
- Check this old tutorial
- Magic command:
docker run -e USER=rstudio -e PASSWORD=rstudio2 -d -p 8787:8787 rocker/tidyverse
- Project Guidelines html | pdf
Week 4 (Nov 14, 2018)
- (Nov 20, 2018) All ODD car sales data can be found in here. Deadline extended to Nov 25.
- Guest lecture by Mehmet Memecan, Co-Founder and CEO of Tarfin, an fintech company specialized in agricultural inputs financing.
- This week we are going to learn about
shiny
andreticulate
packages. Remember reticulate requires Python installed. Make sure these packages are working on your computers.- Package installations link
shiny
to create interactive dashboards on web browsers (official tutorial). Also see the Shiny Cheatsheet.reticulate
to seamlessly integrate python scripts (official tutorial)
Week 3 (Oct 31, 2018)
- Guest lecture by Hayrettin Konuk from Borsa Istanbul Takasbank (file)
- ggplot2 (html | pdf)
- Lubridate (Link)
- EGM Example (data) (source)
- ODD Assignment Example Rmd file (click)
ODD Assignment Part 2
- Submit a merge request to this file with the code piece to add your rds file to the main file. Replicate the example with your working code.
- Perform a brief but striking exploratory analysis with the full car sales data. Your main analysis should include only a single angle and a single plot. You can perform more analyses after the first analysis.
Group Projects Initialization
- Email the instructor with your intentions for a project (Proposal Phase). Your intentions should include the topic, a description of the data and tentatively planned (3 to 5 items) work.
- You may ask the instructor about creative project ideas.
- Create group progress journal (gpj18) with the link provided by the instructor.
- All members of the group should join to the repository. Remember your group should consist of either 4 or 5 people.
- On your gpj page write the name of the group, group member name surnames, brief description of your intended project and data.
- Perform an initial exploratory analysis on the data and post the outcome html on your gpj.
- Email the instructor the link of your gpj18 webpage.
Week 2 (Oct 17, 2018)
- Some base R exercises (Solutions!) (html)
- Tidyverse Introduction: dplyr (html |pdf)
- dplyr join operations 1, 2
- Bonus Material (Program coordinator’s recommendation): EdX - The Analytics Edge
ODD Assignment
- ODD Assignment Example (html)
This is a very easy one. The homework is about standardizing and merging data. I did most of the work in the example on course webpage. I expect something very similar. Here are the steps. Please try to do it as quick as possible and cooperate with your classmates if you are having any trouble.
- Go to the Google Drive Excel sheet that I sent the invites to you.
- Choose a month (preferably most recent that is not taken) and write your name and email on it.
- Go to ODD site and download the data. Rename and add it to your github repository.
- Follow the example and write the RMarkdown document that will download the raw excel from your repository, manipulate it in the desired format (check the example) and save it in an RDS file.
- Add some analysis to the RMarkdown document using your dplyr skills.
Week 1 (Oct 3, 2018)
- Cheat Sheet Heaven
- Introduction to R (html | pdf)
- Some base R exercises (html)
- RMarkdown
- Datacamp Homework (Don’t forget to accept your invitations)
- RMarkdown Homework: Prepare an RMarkdown document. Introduce yourself in one paragraph (Your name surname, your work, your data interests and how you (plan to) use data science skills in your current/future work). Watch some UseR-2018 videos (Link) and write one of them down on your RMarkdown document. Find 3 R posts relevant to your work and describe them. Get the html output and put it in your progress journal repository. Provide link from your Progress Journal page. (Click for example)
- Form teams of 2-3 and prepare for major projects (we will discuss in week 2).
Week 0 (Sep 24, 2018)
- Tutorial: Initializing Your Progress Journal (download)
- Some light reading about the previous year. (Read on Blog)
- Some light reading about instructor’s view on R. (Read on Blog)
This course benefits from DataCamp for the Classroom program. See details here.
Miscellaneous
Data Sets for Prospective Projects
- YÖK
https://yokatlas.yok.gov.tr/ https://istatistik.yok.gov.tr/
- ÖSYS
http://www.osym.gov.tr/TR,6552/sureli-yayinlar.html
- EPİAŞ
https://seffaflik.epias.com.tr/transparency/
- SPK
- Merkez Bankası - CBRT
- Emeklilik Gözetim Merkezi
http://www.egm.org.tr/?pid=351
- TURKSTAT - TUIK
http://www.tuik.gov.tr/Start.do
http://www.tuik.gov.tr/takvim/tkvim.zul?submenuheader=0#tb1
Extra Materials
For audiovisual learners, some webinars here.
dplyr
- Official dplyr tutorial
- dplyr join functions
- dplyr join functions official tutorial
- dplyr Cheat Sheet
ggplot2
RMarkdown
- Introduction to RMarkdown - Official
- R4DS Book - Communication
- DataCamp - Authoring R Markdown Reports Free Part
- RMarkdown Cheat Sheet
Shiny
RStudio
External Good Resources About R and Data Science
- Introduction to Statistical Learning
- R for Data Science
- R’a Hızlı Giriş (Türkçe)
- The Elements of Statistical Learning
- Advanced R
- Bookdown Compilation
- Akademik Bilişim 2017 - R ile Veri Analizi Dersi
- BOUN-FE 522
- Learn X in Y Minutes - R
- dplyr vignettes
- ggplot2 workshop
- RStudio Cheat Sheets (Base R, dplyr, ggplot2, RMarkdown etc.)
- R Reference Cards
- data.table Cheat Sheet