steven_wang
07/18/2024, 4:51 PM- name: tap-s3
variant: airbyte
capabilities:
- state
- stream-maps
config:
flattening_enabled: true
flattening_max_depth: 2
I'm not seeing objects flattened in the target which is duckdb. I have data_flattening_max_level
in the duckdb target set to 2 as well:
- name: target-duckdb
variant: jwills
config:
add_metadata_columns: true
batch_size_rows: 50000
data_flattening_max_level: 2
path: test.duckdb
Edgar Ramírez (Arch.dev)
07/18/2024, 9:14 PMdata_flattening_max_level
🤔. Do you define the schema manually for tap-s3?steven_wang
07/18/2024, 10:01 PMsteven_wang
07/18/2024, 10:04 PM- name: tap-hubspot
variant: spacecowboy
capabilities:
- state
- catalog
- discover
- about
- stream-maps
config:
flattening_enabled: true
flattening_max_depth: 1
Edgar Ramírez (Arch.dev)
07/19/2024, 4:49 AMflattening drops the json fields entirelywhat's the json type of the field? I'll try to come up with a reproducer for this, if it turns out to be a 🐛
steven_wang
07/19/2024, 4:02 PM{'properties': {}, 'type': ['object', 'null']}
I'll try to find a better example later today toosteven_wang
07/19/2024, 4:04 PMKeyError: 'properties__about_us'
Probably because some of the kv pairs aren't present in all the recordssteven_wang
07/30/2024, 5:47 PMproperties
):
'properties': {'type': ['null', 'object']}
When it should look something like this:
{'properties': {<kv_pairs>}, 'type': ['object', 'null']}
For hubspot I was looking at the tap code and the catalog is specified in the tap but doesn't breakdown all the k-v pairs in the nested fields so the schema doesn't know what to flatten: https://github.com/YouCruit/tap-hubspot/blob/148ac26fbbd43a3aa68c6c8f1fbad34828763df0/tap_hubspot/streams/companies.py#L25
I'm thinking if I can feed in a custom catalog file with all the fields specified in it, then flattening might workEdgar Ramírez (Arch.dev)
07/30/2024, 6:35 PMsteven_wang
07/30/2024, 7:37 PMEdgar Ramírez (Arch.dev)
07/30/2024, 7:38 PM