emcp
11/17/2021, 1:41 AMemcp
11/17/2021, 1:44 AMstephen_bailey
11/17/2021, 2:59 PMdb.salesforce.*
tables in Snowflake that I would want to document heavily, tag with lineage, purpose restrictions, etc. There has been discussion around building a singer-catalog-to-dbt-documentation
utility, and now that taps can have descriptions in them, it might be good to revisit thatemcp
11/17/2021, 6:31 PMemcp
11/17/2021, 6:32 PMaaronsteers
11/17/2021, 6:57 PMmeltano select <TAP_ID> --list --all
to create a more grokable/auditable artifact for code reviews and drift detection.
I think something like this could make a nice git artifact 馃檪:
Enabled patterns:
tags.*
commits.id
commits.project_id
commits.created_at
commits.author_name
commits.message
!*.*_url
Selected attributes:
[selected ] commits.author_name
[selected ] commits.created_at
[automatic] commits.id
[selected ] commits.message
[selected ] commits.project_id
[automatic] tags.commit_id
[selected ] tags.message
[automatic] tags.name
[automatic] tags.project_id
[selected ] tags.target
wdyt?aaronsteers
11/17/2021, 7:02 PM...building the community in an enterprise around datasets...I do think this is important in the industry. When I was at Slalom, we treated our data project as an open source project that everyone in the company had read/fork access to, and anyone could submit PRs. We published something very much like the above so everyone in the company could see which fields and tables were available, which we currently imported into our project and which were "left on the table" (pun intended!) 馃槄
aaronsteers
11/17/2021, 7:03 PMemcp
11/19/2021, 9:58 PMaaronsteers
11/19/2021, 10:19 PMaaronsteers
11/19/2021, 10:21 PMaaronsteers
11/19/2021, 10:29 PMemcp
11/21/2021, 4:14 PMstephen_bailey
11/22/2021, 12:22 PMstephen_bailey
11/22/2021, 12:25 PMrow access policy
and masking policies
and basically will create / apply these things based on catalog metadata. but the challenge of getting quality, up to date metadata about all your data sources can't be overstated. that's why im such a big advocate of having that quality information live as far upstream (i.e. in meltano taps) as possible