Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
-
Updated
Feb 8, 2025 - Python
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Code examples and resources for DBRX, a large language model developed by Databricks
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
Testing framework for Databricks notebooks
Automated migrations to Unity Catalog
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
Manage your Databricks deployments and CI with code.
Databricks framework to validate Data Quality of pySpark DataFrames
Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
ML Ops Accelerator: Databricks & Azure Machine Learning Unification
Python Testing for Databricks
Add a description, image, and links to the databricks topic page so that developers can more easily learn about it.
To associate your repository with the databricks topic, visit your repo's landing page and select "manage topics."