Topics: analyze texts (emails) with algorithms. Differentiate spam and nonspam. Custom methods, tree-based methods, and Support Vector Machine. Train, test, and evaluate the methods.
Code: R / Tool: RStudio
1 – Using custom functions and tree-based algorithms
Tree-based methods is a vast subject. Projects ‘Titanic: Getting the Picture’ and ‘Modeling Credit Risk’ extensively use tree-based methods.
Consult the case in a new tab (Show/Hide All Code in the case upper-right corner).

2 – Using Support Vector Machine algorithms
The case is rather simple for such a vast subject. There are many SVM packages to cover all the angles (see the notes in the case).
Consult the case in a new tab – a continuation of the case above (Show/Hide All Code in the case upper-right corner).

An algorithm in one picture.
