Topics: classification and clustering methods. Unsupervised techniques. Clustering: k-means, k-nearest neighbours, hierarchical clustering. Supervised techniques: regression, tree, random forests. Training, testing, predicting. Performance measures: Dunn’s index, ROC, AUC, confusion matrix.
Code: R / Tool: RStudio
Machine learning methods
Classification, regression, k-nearest neighbour, clustering methods, and performance measures.
Consult the series of mini-cases in a new tab (Show/Hide All Code in the case upper-right corner) and the notes in a new tab.

Water pumps machine learning challenge
Classification with random forests. A challenge by DrivenData.
Consult the case in a new tab (Show/Hide All Code in the case upper-right corner).


Despite being full of promise, machine learning depends on humans…
The Turk, also known as the Mechanical Turk or Automaton Chess Player, was a fake chess-playing machine constructed in the late 18th century. The Turk was in fact a mechanical illusion that allowed a human chess master hiding inside to operate the machine.
