- We analyzed cuts at the power plants in Turkey between 2012-2018.
- We had in total 73313 observations with 8 variables
- We mutated new observations from the existing ones: Plant.Type, Duration of Cut, Capacity Ratio at the cut and reason of the cut.
- We tidied the raw data using regular expressions and stringr package.
- We used tidy text mining to analyze count of words and which word is following which word.
- We divided cuts into two category, Malfunctions and Planned Activities and looked for their distributions.
- We looked at differences between malfunctions and planned activities in terms of duration of the cut.
- We looked at malfunction types, malfunction reasons and durations according to plant type.