Performance
Table of contents
Boosting Performance
The programs herein include performance aiding aspects, e.g.,
- Reading data files via their schemata, e.g., schemaOfCompanies.json
- Selectively, judiciously, persisting in-memory.
- Using persisted temporary views; if a Dataset or DataFrame can be [feasibly] persisted, temporary views are created via such persisted objects.
Parallelism is implicit & explicit throughout; wherever applicable and effective.