ugofolio

Algorithms and Spam

Topics: analyze texts (emails) with algorithms. Differentiate spam and nonspam. Custom methods, tree-based methods, and Support Vector Machine. Train, test, and evaluate the methods.

Code: R / Tool: RStudio


1 – Using custom functions and tree-based algorithms

Tree-based methods is a vast subject. Projects ‘Titanic: Getting the Picture’ and ‘Modeling Credit Risk’ extensively use tree-based methods.

Consult the case in a new tab (Show/Hide All Code in the case upper-right corner).


2 – Using Support Vector Machine algorithms

The case is rather simple for such a vast subject. There are many SVM packages to cover all the angles (see the notes in the case).

Consult the case in a new tab – a continuation of the case above (Show/Hide All Code in the case upper-right corner).


An algorithm in one picture.