Performance


Table of contents

  1. Boosting Performance

Boosting Performance

The programs herein include performance aiding aspects, e.g.,

  • Reading data files via their schemata, e.g., schemaOfCompanies.json
  • Selectively, judiciously, persisting in-memory.
  • Using persisted temporary views; if a Dataset or DataFrame can be [feasibly] persisted, temporary views are created via such persisted objects.

Parallelism is implicit & explicit throughout; wherever applicable and effective.