ABSTRACT

Research Software Engineering: A Guide to the Open Source Ecosystem strives to give a big-picture overview and an understanding of the opportunities of programming as an approach to analytics and statistics. The book argues that a solid "programming" skill level is not only well within reach for many but also worth pursuing for researchers and business analysts. The ability to write a program leverages field-specific expertise and fosters interdisciplinary collaboration as source code continues to become an important communication channel. Given the pace of the development in data science, many senior researchers and mentors, alongside non-computer science curricula lack a basic software engineering component. This book fills the gap by providing a dedicated programming-with-data resource to both academic scholars and practitioners.

Key Features

  • overview: breakdown of complex data science software stacks into core components
  • applied: source code of figures, tables and examples available and reproducible solely with license cost-free, open source software
  • reader guidance: different entry points and rich references to deepen the understanding of selected aspects

chapter 1|8 pages

Introduction

chapter 2|13 pages

Stack: A Developer's Toolkit

chapter 3|23 pages

Programming 101

chapter 4|11 pages

Interaction Environment

chapter 5|13 pages

Git Version Control

chapter 6|24 pages

Data Management

chapter 7|13 pages

Infrastructure

chapter 8|9 pages

Automation

chapter 9|5 pages

Community

chapter 10|10 pages

Publishing and Reporting

chapter 11|43 pages

Case Studies