Chapter 1

Data Acquisition and Reduction

In our first quest of data engineering, we will consider the task of acquiring data into our local environment, before it can be further processed.

graph LR;
  classDef highlight stroke:#000,stroke-width:4px
  direction LR
  Generation:::highlight
  subgraph S [ ]
    Ingestion:::highlight
    Transformation
    Serving
    Ingestion --> Transformation
    Transformation --> Serving
    Storage@{ shape: lin-cyl }
  end
  Generation --> Ingestion
  Serving --> Analytics
  Serving --> ML(Machine Learning)
  Serving --> RETL(Reverse ETL)
graph LR;
  classDef UC fill:#fff,stroke:#555
  classDef highlight stroke:#000,stroke-width:4px,fill:#fff,stroke:#555
  subgraph Undercurrents
    direction TB
    Security:::UC
    DM(Data management):::highlight
    DataOps:::UC
    DA(Data Architecture):::UC
    Orchestration:::UC
    SE(Software Engineering):::UC
    style Undercurrents fill:#eee,stroke:#000
  end

Data will come from a multitude of sources, covered by textbook chapter 5. Here we will discuss Chapter 7 on Data Ingestion.