Lazaros Panitsidis
08/13/2025, 11:31 AMvisch
08/13/2025, 1:14 PMvisch
08/13/2025, 1:15 PMvisch
08/13/2025, 1:16 PMHas anyone packaged a Meltano project as a Python wheel and installed it on Databricks clusters? How did you handle dependencies/plugins?Normally for orchestrators we tend to package Meltano projects as containers, a wheel a has a number of issues (platform dependent, would have to be pretty large, etc) it could ofc work as at the end of the day it's a package manager but I wouldn't do it personally
Edgar Ramírez (Arch.dev)
08/13/2025, 2:59 PMHas anyone packaged a Meltano project as a Python wheel and installed it on Databricks clusters? How did you handle dependencies/plugins?I can imagine it's doable, but not straightforward. Is that a requirement of the Databricks platform?
Lazaros Panitsidis
08/13/2025, 4:17 PM.whl
, wheelhouse.zip
, or requirements.txt
from the workspace.
• Volumes File Path: Use a .whl
, .jar
, or requirements.txt
stored in Volumes.
• File Path / S3: Accepts JAR files (.jar
, .zip
, .tar
) or Python packages (.whl
, .zip
, .tar
, .tar.gz
).
• PyPI: Install packages with exact versions using ==
to avoid regressions; optional custom index URL.
• Maven: Install via Maven coordinates (e.g., com.databricks:spark-csv_2.10:1.0.0
) with optional repository and exclusions for dependencies.
Feedback on deploying Meltano with asset bundles and Databricks Jobs, especially regarding stability, maintainability, and managing custom plugins, would be appreciated.visch
08/13/2025, 5:27 PM.whl
, wheelhouse.zip
, or requirements.txt
from the workspace.
> • Volumes File Path: Use a .whl
, .jar
, or requirements.txt
stored in Volumes.
> • File Path / S3: Accepts JAR files (.jar
, .zip
, .tar
) or Python packages (.whl
, .zip
, .tar
, .tar.gz
).
> • PyPI: Install packages with exact versions using ==
to avoid regressions; optional custom index URL.
> • Maven: Install via Maven coordinates (e.g., com.databricks:spark-csv_2.10:1.0.0
) with optional repository and exclusions for dependencies.
>
Doesn't "requires significant configuration and ongoing maintenance"
I'd go with the container servicevisch
08/13/2025, 5:28 PMLazaros Panitsidis
08/14/2025, 7:43 AM