DP-900 PART 14

Azure stream analytics

It is a service for complex event processing and analysis of stream data.

It is used to ingest data from input
Process data using query
Write results to output

Filters data after capturing it and send results to downstream process.

Apache spark azure

Can use spark on azure

  1. Azure synapse analytics
  2. Azure databricks
  3. Azure HDInsight

Spark structured stream

It is used for streaming data on azure
Spark stream library provides API for ingesting, processing and output results. It is built over dataframe. This API reads data from real time data source such as Kafka hub, file store.

Delta lake

Open source storage layer, support for transactional consistency, schema enforcement

