€ 378,72

Azure Databricks Deep Dive

Evenementdetails

Deel dit evenement

Datum en tijd

Locatie

Locatie

TBD (Utrecht area)

Netherlands

Kaart bekijken

Beleid voor refunds

Beleid voor refunds

Restituties tot 7 dagen voor evenement

Beschrijving van het evenement

Beschrijving

Have you looked at Azure DataBricks yet? No! Then you need to. Knowing how to use Apache Spark will earn you more money. It is that simple. Data Engineers and Data Scientists who know Apace Spark are in-demand! This workshop is designed to introduce you to the skills required to do both.

In the morning we will introduce Azure DataBricks then discuss how to develop in-memory elastic scale data engineering pipelines. We will talk about shaping and cleaning data, the languages, notebooks, ways of working, design patterns and how to get the best performance. You will build an engineering pipeline with Python, as well as the other options available to you. The Engineering element will be delivered by UK MVP Simon Whiteley. Simon has been deploying engineering projects with Azure DataBricks since it was announced. He has real world experience in multiple environments.

Then we will shift gears, we will take the data we moved and cleansed and apply distributed machine learning at scale. We will train a model and productionise it. We will then enrich our data with our newly predicted values. The Data Science element will be led by UK MVP Terry McCann. Terry holds an MSc in Data Science and has been working with Apache Spark for the last 5 years. He is dedicated to applying engineering practices to data science to make model development, training and scoring as easy an as automated as possible

By the end of the day, you will understand how Azure Databricks supports both data engineering and data science, levering Apace Spark to deliver blisteringly fast data pipelines and distributed machine learning models. Bring your laptop as this will be hands on.

Pre-requisites

An understanding of ETL processing either ETL or ELT on either on-premises or in a big data environment. A basic level of Machine Learning would also be beneficial, but not critical.

Laptop Required:Yes

  • Software: In the session we will be using Azure Databricks. We will have labs and demos that you can follow if you want to. If you do want to then you will need the following: – An Azure Subscription – Money on the Azure Subscription – Enough access on the subscription to make service principals. – Azure Storage explorer- PowerShell
  • Subscriptions: Azure

Biography Simon

As a seasoned analytics consultant, Simon has been working with the Microsoft BI Stack for over a decade, initially designing kimball warehouses with SSIS and SSAS. For the past few years he has specialised in using Microsoft Azure to challenge and revolutionise data engineering. Whether it’s automating file ingestion into Azure Data Lakes, applying massive compute via SQLDW & Azure Databricks, or simply shifting your current SSIS packages into the cloud with Data Factory, he can help.

Simon is a strong advocate of the use of modern development practices in the SQL World (we actually are developers, whether we like it or not) and helps run both Surrey Data Platform Group & the Microsoft Data London PASS Chapter.

Biography Terry

Microsoft MVP. Principal Consultant and Owner of Advancing Analytics Limited, an Advanced Analytics consultancy in the UK. Terry helps businesses advance their analytical capabilities, drawing upon a deep expertise in Data Science, Data Engineering, DataOps and applied AI. Terry holds a Master’s degree in Data Science – with a focus on DataOps for Machine Learning. Organiser of the Data Science Exeter user group, frequent speaker at conferences across the world and the host of the Data Science in Production Podcast.

Delen met vrienden

Datum en tijd

Locatie

TBD (Utrecht area)

Netherlands

Kaart bekijken

Beleid voor refunds

Restituties tot 7 dagen voor evenement

Sla dit evenement op

Evenement opgeslagen