Dec

16

2021

Azure Databricks - Build data engineering and AI/ML pipeline

Laser 16 Dec 2021 09:04 LEARNING » e-learning - Tutorial

Azure Databricks - Build data engineering and AI/ML pipeline
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz, 2 ChGenre: eLearning | Language: English + srt | Duration: 45 lectures (3h 25m) | Size: 1.29 GB

What is Anomaly detection

How to apply unsupervised learning algorithms Isolation Forest, KNN and Clustering based Approach to detect anomalies
Step by Step guide to perform ETL operations using Azure Databricks
Understand DataLakeHouse Architecture
Build Data Pipeline using Azure Tech stack
machine learning model interpretable shapley values
Spark structured streaming with Kafka
Spark Structured streaming with Azure Event Hub
Use MLFlow for managing the end-to-end machine learning lifecycle
Anomaly detection on series data
Building D Pipeline using Azure Devops
Building Data Pipeline using Azure Data Factory
Productionizing model using Azure Function and Docker
Basic knowledge on python programming language
Basic understanding of Bigdata Ecosystems
Basic understanding of Pyspark
This course is designed to help you develop the skill necessary to perform ETL operations in Databricks, build unsupervised anomaly detection models, learn MLOPS, perform D operations in databricks and Deploy machine learning models into production.
Big Data eeering:
Big data eeers interact with massive data processing systems and databases in large-scale computing environments. Big data eeers provide organizations with analyses that help them assess their performance, identify market demographics, and predict upcoming changes and market trends.
Azure Databricks:
Azure Databricks is a data analytics platform optimized for the Microsoft Azure cloud services platform. Azure Databricks offers three environments for developing data intensive applications: Databricks SQL, Databricks Data Science & Eeering, and Databricks Machine Learning.
Anomlay detection:
Anomaly detection (aka outlier analysis) is a step in data mining that identifies data points, events, and/or observations that deviate from a dataset's normal behavior. Anomalous data can indicate critical incidents, such as a technical glitch, or potential opportunities, for instance a change in consumer behavior. Machine learning is progressively being used to automate anomaly detection.
Data Lake House:
A data lakehouse is a data solution concept that combines elements of the data warehouse with those of the data lake. Data lakehouses implement data warehouses' data structures and management features for data lakes, which are typically more cost-effective for data storage .
Explainable AI:
Explainable AI is artificial intelligence in which the results of the solution can be understood by humans. It contrasts with the concept of the "black box" in machine learning where even its designers cannot explain why an AI arrived at a specific decision.
Spark structured streaming:
Structured Streaming is a scalable and fault-tolerant stream processing ee built on the Spark SQL ee. .In short, Structured Streaming provides fast, scalable, fault-tolerant, end-to-end exactly-once stream processing without the user having to reason about streaming.
D Operation :
CI and CD stand for continuous integration and continuous delivery/continuous deployment. In very simple terms, CI is a modern software development practice in which incremental code changes are made frequently and reliably.
Data Eeers, Data Architect, ETL developer, Data Scientist, Big Data Developer




DOWNLOAD
uploadgig.com



rapidgator.net


nitro.download

High Speed Download

Add Comment

  • People and smileys emojis
    Animals and nature emojis
    Food and drinks emojis
    Activities emojis
    Travelling and places emojis
    Objects emojis
    Symbols emojis
    Flags emojis