Résumé de section

  • We will first, finalize the discussion of massively parallel query processing on top of MapReduce (based on the slides from the previous session). 

    Then, we will discuss a family of cloud data service architectures, provided by major companies nowadays.

    Finally, we will delve into the algorithms used to integrate heterogeneous data sources, which will be the the basis for the last session (lab).