thalia_elie
11/13/2020, 5:42 PMdouwe_maan
11/13/2020, 5:43 PMmeltano config tap-csv or meltano config tap-spreadsheets-anywhere?thalia_elie
11/13/2020, 5:44 PMthalia_elie
11/13/2020, 5:44 PMthalia_elie
11/13/2020, 5:45 PMthalia_elie
11/13/2020, 5:45 PMdouwe_maan
11/13/2020, 5:46 PMentity , file and keys under https://meltano.com/plugins/extractors/csv.html#files?douwe_maan
11/13/2020, 5:46 PMthalia_elie
11/13/2020, 5:56 PMdouwe_maan
11/13/2020, 5:57 PMentity is the name you'd like to give to the contents of this CSV file, which would be used for as the table name in the DB you'll load the data into.douwe_maan
11/13/2020, 5:57 PMkeys is a list of column names that together uniquely identify each rowdouwe_maan
11/13/2020, 5:57 PMthalia_elie
11/13/2020, 6:01 PMdouwe_maan
11/13/2020, 6:02 PMdouwe_maan
11/13/2020, 6:02 PMmeltano commandthalia_elie
11/13/2020, 6:03 PMthalia_elie
11/13/2020, 6:03 PMthalia_elie
11/13/2020, 6:14 PMthalia_elie
11/13/2020, 6:14 PMentity, file, and keys keys
csv_files_definition [env: TAP_CSV_FILES_DEFINITION, TAP_CSV_CSV_FILES_DEFINITION] current value: None (from default)
CSV Files Definition: Project-relative path to JSON file holding array of objects with entity, file, and keys keysdouwe_maan
11/13/2020, 6:15 PMfiles and csv_files_definition are the two supported settings, and the docs describe in more detail how either can be configureddouwe_maan
11/13/2020, 6:15 PMthalia_elie
11/13/2020, 6:19 PMdouwe_maan
11/13/2020, 6:22 PMmeltano.yml? You can configure the files setting for tap-csv and its entity , file , and keys subproperties using a config entry, like in the example here: https://meltano.com/plugins/extractors/csv.html#how-to-usedouwe_maan
11/13/2020, 6:22 PMplugins:
extractors:
- name: tap-csv
variant: meltano
pip_url: git+<https://gitlab.com/meltano/tap-csv.git>
config:
files:
- entity: <entity>
file: <path>
keys: [<key>]
# ...thalia_elie
11/13/2020, 6:22 PMmeltano config <plugin> set <setting> <value>
and an example below. but these variables do not match what I'm gaining from our slack. there arent file definitions, entities or keys here. i would love to know if the extractor config for tap-csv or tap-all-spreadsheets-everywhere specifies where these values are entered? in the yaml? within a select command? that's really what I'm asking. where do I put the values needed to configure the extractor?thalia_elie
11/13/2020, 6:22 PMthalia_elie
11/13/2020, 6:23 PMdouwe_maan
11/13/2020, 6:23 PMthalia_elie
11/13/2020, 7:10 PMdouwe_maan
11/13/2020, 7:11 PMmeltano config tap-csv list look like now?thalia_elie
11/13/2020, 7:11 PMmeltano config <plugin> listdouwe_maan
11/13/2020, 7:11 PMcsv_files_definition ?thalia_elie
11/13/2020, 7:11 PMdouwe_maan
11/13/2020, 7:12 PMmeltano.yml look like?thalia_elie
11/13/2020, 7:12 PMdouwe_maan
11/13/2020, 7:12 PMthalia_elie
11/13/2020, 7:12 PMthalia_elie
11/13/2020, 7:13 PMdouwe_maan
11/13/2020, 7:13 PMdouwe_maan
11/13/2020, 7:14 PMfiles: ... needs to be nested under config: , so it needs one more level of indentation (2 more spaces ahead of each line)douwe_maan
11/13/2020, 7:14 PMmeltano config tap-csv list show?douwe_maan
11/13/2020, 7:15 PMcsv_files_definition: path/to/file.json under config in meltano.yml, instead of including the files config there directlydouwe_maan
11/13/2020, 7:16 PMfiles in meltano.yml as you have now, without the separate filedouwe_maan
11/13/2020, 7:17 PM// ... , which is not actually valid JSON, but is included in the docs example to illustrate that more JSON objects with their own entity/file/keys could follow the one in the exampledouwe_maan
11/13/2020, 7:17 PMthalia_elie
11/13/2020, 7:18 PMthalia_elie
11/13/2020, 7:18 PMthalia_elie
11/13/2020, 7:18 PMdouwe_maan
11/13/2020, 7:19 PMfile: and keys: lines below entity: should have file and keys start at the same indentation level as entity itselfdouwe_maan
11/13/2020, 7:19 PMthalia_elie
11/13/2020, 7:21 PMthalia_elie
11/13/2020, 7:21 PMmeltano.yml)
Array of objects with entity, file, and keys keys
csv_files_definition [env: TAP_CSV_FILES_DEFINITION, TAP_CSV_CSV_FILES_DEFINITION] current value: None (from default)
CSV Files Definition: Project-relative path to JSON file holding array of objects with entity, file, and keys keys
To learn more about extractor 'tap-csv' and its settings, visit https://meltano.com/plugins/extractors/csv.htmldouwe_maan
11/13/2020, 7:24 PMfile value should have a .csv suffix to match the full path, and keys should only contain the columns that uniquely identify the row. If the name in each row is expected to be unique, that'd be a good candidatedouwe_maan
11/13/2020, 7:25 PMthalia_elie
11/13/2020, 7:28 PMdouwe_maan
11/13/2020, 7:29 PMname is sufficient if there's no IDthalia_elie
11/13/2020, 7:29 PMdouwe_maan
11/13/2020, 7:31 PMdouwe_maan
11/13/2020, 7:31 PMthalia_elie
11/13/2020, 7:45 PMthalia_elie
11/13/2020, 7:45 PMthalia_elie
11/13/2020, 7:46 PMthalia_elie
11/13/2020, 7:46 PMthalia_elie
11/13/2020, 7:46 PMthalia_elie
11/13/2020, 7:57 PMthalia_elie
11/13/2020, 8:00 PMthalia_elie
11/13/2020, 8:09 PMthalia_elie
11/13/2020, 8:10 PMthalia_elie
11/13/2020, 8:26 PMthalia_elie
11/13/2020, 8:28 PMthalia_elie
11/13/2020, 8:28 PMdouwe_maan
11/13/2020, 9:13 PMthe top is me forgetting to make my destination folder a directory but i think i got it..yes?Correct, this looks as expected!
douwe_maan
11/13/2020, 9:14 PMim guessing transformation skipped because its a csv file in the correct format, yes?Correct
douwe_maan
11/13/2020, 9:14 PMim researching and i THINK this doc is relevant: https://meltano.com/plugins/loaders/csv.html#getting-started to my next step..am i on the right path?Yes, that's right! You can find that link on the bottom of the
meltano config target-csv list output as well: https://meltano.slack.com/archives/C013EKWA2Q1/p1605296726134600?thread_ts=1605289339.120700&cid=C013EKWA2Q1douwe_maan
11/13/2020, 9:16 PMi see directions : configure the settings below..but then it moves to the next step with nothing listed below in that section.The idea is that you jump from that "Configure the settings below using
meltano config." line to the Settings section using the link, where the doc then describes which settings are required and how they should be configured, with an example minimal configuration. But maybe that could be more clear?