ABSTRACT

Machine learning (ML) is progressively reshaping the fields of quantitative finance and algorithmic trading. ML tools are increasingly adopted by hedge funds and asset managers, notably for alpha signal generation and stocks selection. The technicality of the subject can make it hard for non-specialists to join the bandwagon, as the jargon and coding requirements may seem out of reach. Machine Learning for Factor Investing: R Version bridges this gap. It provides a comprehensive tour of modern ML-based investment strategies that rely on firm characteristics.

The book covers a wide array of subjects which range from economic rationales to rigorous portfolio back-testing and encompass both data processing and model interpretability. Common supervised learning algorithms such as tree models and neural networks are explained in the context of style investing and the reader can also dig into more complex techniques like autoencoder asset returns, Bayesian additive trees, and causal models.

All topics are illustrated with self-contained R code samples and snippets that are applied to a large public dataset that contains over 90 predictors. The material, along with the content of the book, is available online so that readers can reproduce and enhance the examples at their convenience. If you have even a basic knowledge of quantitative finance, this combination of theoretical concepts and practical illustrations will help you learn quickly and deepen your financial and technical expertise.

part I|54 pages

Introduction

chapter 1|5 pages

Notations and data

chapter 2|4 pages

Introduction

chapter 4|20 pages

Data preprocessing

part II|88 pages

Common supervised algorithms

chapter 6|22 pages

Tree-based methods

chapter 7|32 pages

Neural networks

chapter |6 pages

Support vector machines

chapter 9|14 pages

Bayesian methods

part III|54 pages

From predictions to portfolios

chapter 10|20 pages

Validating and tuning

chapter 11|12 pages

Ensemble models

chapter 12|20 pages

Portfolio backtesting

part IV|64 pages

Further important topics

chapter 13|16 pages

Interpretability

chapter 15|13 pages

Unsupervised learning

chapter 16|14 pages

Reinforcement learning

part V|27 pages

Appendix

chapter 17|3 pages

Data description

chapter 18|21 pages

Solutions to exercises