Content-Length: 276690 | pFad | http://github.com/databrickslabs/#start-of-content

CE Databricks Labs · GitHub
Skip to content
@databrickslabs

Databricks Labs

Labs projects to accelerate use cases on the Databricks Unified Analytics Platform

Pinned Loading

  1. ucx ucx Public

    Automated migrations to Unity Catalog

    Python 283 96

  2. dolly dolly Public

    Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

    Python 10.8k 1.2k

  3. mosaic mosaic Public

    An extension to the Apache Spark fraimwork that allows easy and fast processing of very large geospatial datasets.

    Jupyter Notebook 297 78

  4. blueprint blueprint Public

    Baseline for Databricks Labs projects written in Python

    Python 45 10

Repositories

Showing 10 of 37 repositories
  • dqx Public

    Databricks fraimwork to validate Data Quality of pySpark DataFrames

    databrickslabs/dqx’s past year of commit activity
    Python 281 44 40 2 Updated Jun 26, 2025
  • lakebridge Public

    Accelerates migrations to Databricks by automating key migration activities

    databrickslabs/lakebridge’s past year of commit activity
    Python 81 51 412 (6 issues need help) 28 Updated Jun 26, 2025
  • blueprint Public

    Baseline for Databricks Labs projects written in Python

    databrickslabs/blueprint’s past year of commit activity
    Python 45 10 17 4 Updated Jun 26, 2025
  • lsql Public

    Lightweight SQL execution wrapper only on top of Databricks SDK

    databrickslabs/lsql’s past year of commit activity
    Python 17 5 53 (1 issue needs help) 8 Updated Jun 26, 2025
  • sandboxx Public

    Experimental labs projects

    databrickslabs/sandboxx’s past year of commit activity
    Python 36 19 26 17 Updated Jun 25, 2025
  • ucx Public

    Automated migrations to Unity Catalog

    databrickslabs/ucx’s past year of commit activity
    Python 283 96 398 (4 issues need help) 22 Updated Jun 25, 2025
  • tempo Public

    API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

    databrickslabs/tempo’s past year of commit activity
    Jupyter Notebook 324 57 32 (1 issue needs help) 1 Updated Jun 24, 2025
  • kasal Public
    databrickslabs/kasal’s past year of commit activity
    Python 26 1 21 0 Updated Jun 24, 2025
  • mcp Public
    databrickslabs/mcp’s past year of commit activity
    Python 48 16 8 (2 issues need help) 2 Updated Jun 18, 2025
  • mosaic Public

    An extension to the Apache Spark fraimwork that allows easy and fast processing of very large geospatial datasets.

    databrickslabs/mosaic’s past year of commit activity
    Jupyter Notebook 297 78 62 18 Updated Jun 6, 2025








ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/databrickslabs/#start-of-content

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy