Databricks is one of the most in demand big data tools around. More than 9,000 organizations worldwide β including and over 40% of the Fortune 500 β rely on the Databricks Lakehouse Platform.
We will be focussing specifically on the Databricks SQL Platform.
Databricks SQL is a powerful tool used for querying and analyzing large datasets, making it highly relevant in today’s data-driven world. Learning this skill can enhance your employability and career prospects.
This course can be taken by experienced data analysts who are interested in learning about Databricks or even aspiring Data Analysis with no prior experience. I will teach you everything you need to know including how to code in SQL!
Get Instant Notification of New Courses on our
Telegram channel.
Noteβ Make sure your ππππ¦π² cart has only this course you're going to enroll it now, Remove all other courses from the ππππ¦π² cart before Enrolling!
It can also be taken as a guide for students who are aiming to achieve the Databricks Data Analyst Certification.
The course is packed with lectures and hands-on development. This should be more than enough to keep you engaged and learning!
The course is aimed at teaching you Data Analysis on Databricks, Unity Catalog and the Databricks Lakehouse Architecture.
The curriculum is extensive and will cover a variety of areas including:
- Set Up and Overview
- Databricks Queries
- Storing and Managing Data with Databricks
- External and Managed Tables
- Data Analysis with SQL
- Data Lakehouse Architecture
- Delta File Format
- Data Visualization and Dashboards
- Access Control and Data Governance
- Unity Catalog
Set Up and Overview of Databricks
Course Overview
Introduction to Big Data (Optional)
Apache Spark Ecosystem (Optional)
Overview of Databricks
Azure Account Set Up
Azure Portal Overview
Cost Management and Billing
Creating a Databricks Premium Workspace
Databricks Workspace User Interface
Unity Catalog Overview
Enabling Unity Catalog – Overview
ADLS Overview and Storage Creation
Access Connector for Databricks
Enabling Unity Catalog
Creating a SQL Warehouse
Introduction to Queries
Your First Query
Switching Catalogs and Schemas
Scheduling Queries
Adding Comments to Queries
Course Resources – SQL Code Download
Catalogs, Schemas, Tables and Views
Creating Catalogs
Creating Schemas
Data Types in Databricks SQL
Overview of Tables in Databricks
Creating Managed Tables with SQL
Creating Managed Tables in Hive Metastore
Creating Managed Tables using the Data Explorer
Creating an External Storage Location
Creating External Tables
Overriding Unity Catalogβs Default Managed Table Storage Location
Truncate Table
Alter Table
Drop Tables, Schemas and Catalogs
Data Analysis with SQL
Select Statement Recap
Select Distinct
Note on the JC_BIKE_DATA_22 Table
Filtering Records with the WHERE Clause
Filtering Records Based on Multiple Conditions
Filtering Records with the IN and LIKE Operators
Deleting Records
Databricks SQL Built In Functions Overview
String Functions
Numerical Functions
Date and Timestamp Functions
Converting Strings to Dates/Timestamps
Conditional Functions
Aggregate Functions
Group By Clause
Filtering Aggregated Tables with the Having Clause
Joining Tables Overview
Joining Tables Demo
Order By and Limit Clauses
SQL Order of Execution
Subqueries
Views
Set Operators
SQL Challenge 1
SQL Challenge 2
SQL Challenge 3
SQL Challenge 4
SQL Challenge 5
Schema Clean Up
Delta Lake
Medallion Architecture and Last Mile ETL
Medallion Architecture Demo
Benefits of Delta File Format
Upsert / Merge Into
Table Audit History and Time Travel
Query Alerts and Monitoring
Query History and Profile
Query Caching in Databricks SQL
Query Alerts
Visualizations and Dashboards in Databricks SQL
Visualizations and Dashboards Overview
Our First Chart in Databricks SQL
Line and Area Charts
Combo Chart
Pie Chart
Scatter and Bubble Plots
Histograms
Box Plots
Heatmaps
Sankey Charts
Tables
Pivot Tables
Counters
Additional Guidance on Charts in Databricks SQL
Exploratory Data Analysis Challenge
Adding Missing Data to the JC_BIKE_DATA_22 Table
Creating a View to Simplify Upcoming Demos
Query Filters
Query Parameters
Query Parameters (Dates)
Introduction to Dashboards
Adding Parameters to Dashboards
Adding Filters to Dashboards
Trip Duration Analysis Challenge
Rider Type Analysis Challenge
Access Control, Data Governance and Unity Catalog
Administrative Roles in Databricks
Adding a New User to our Azure Account
Adding a New User to our Databricks Environment
Workspace Admin Settings
Workspace Object Access Control
SQL Warehouse Access Control
Folder Access Control
Query Access Control
Dashboard Access Control
Workspace Object Access Control – Summary
Unity Catalog Securable Objects and Priveledges
Granting and Revoking Privileges with SQL (Unity Catalog)
Granting and Revoking Privileges via the Data Explorer (Unity Catalog)
Redacting Data with Dynamic Views (PII)
Data Discovery
Data Lineage
Delta Sharing Overview
Databricks to Databricks Delta Sharing
Open Delta Sharing
Congratulations on completing the course!
Congratulations
Bonus Lecture