Juhana tells in his blog post how he came up with the idea of a machine learning based text document classifier and how it was implemented as a part of his master’s thesis. The application implemented utilizes tools from natural language processing (NLP) in order to label documents with matching keywords. This reduces the need for manual work and makes it possible to perform content based document search.