thalia_elie
11/13/2020, 5:42 PMdouwe_maan
11/13/2020, 5:43 PMmeltano config tap-csv
or meltano config tap-spreadsheets-anywhere
?thalia_elie
11/13/2020, 5:44 PMthalia_elie
11/13/2020, 5:44 PMthalia_elie
11/13/2020, 5:45 PMthalia_elie
11/13/2020, 5:45 PMdouwe_maan
11/13/2020, 5:46 PMentity
, file
and keys
under https://meltano.com/plugins/extractors/csv.html#files?douwe_maan
11/13/2020, 5:46 PMthalia_elie
11/13/2020, 5:56 PMdouwe_maan
11/13/2020, 5:57 PMentity
is the name you'd like to give to the contents of this CSV file, which would be used for as the table name in the DB you'll load the data into.douwe_maan
11/13/2020, 5:57 PMkeys
is a list of column names that together uniquely identify each rowdouwe_maan
11/13/2020, 5:57 PMthalia_elie
11/13/2020, 6:01 PMdouwe_maan
11/13/2020, 6:02 PMdouwe_maan
11/13/2020, 6:02 PMmeltano
commandthalia_elie
11/13/2020, 6:03 PMthalia_elie
11/13/2020, 6:03 PMthalia_elie
11/13/2020, 6:14 PMthalia_elie
11/13/2020, 6:14 PMentity
, file
, and keys
keys
csv_files_definition [env: TAP_CSV_FILES_DEFINITION, TAP_CSV_CSV_FILES_DEFINITION] current value: None (from default)
CSV Files Definition: Project-relative path to JSON file holding array of objects with entity
, file
, and keys
keysdouwe_maan
11/13/2020, 6:15 PMfiles
and csv_files_definition
are the two supported settings, and the docs describe in more detail how either can be configureddouwe_maan
11/13/2020, 6:15 PMthalia_elie
11/13/2020, 6:19 PMdouwe_maan
11/13/2020, 6:22 PMmeltano.yml
? You can configure the files
setting for tap-csv
and its entity
, file
, and keys
subproperties using a config
entry, like in the example here: https://meltano.com/plugins/extractors/csv.html#how-to-usedouwe_maan
11/13/2020, 6:22 PMplugins:
extractors:
- name: tap-csv
variant: meltano
pip_url: git+<https://gitlab.com/meltano/tap-csv.git>
config:
files:
- entity: <entity>
file: <path>
keys: [<key>]
# ...
thalia_elie
11/13/2020, 6:22 PMmeltano config <plugin> set <setting> <value>
and an example below. but these variables do not match what I'm gaining from our slack. there arent file definitions, entities or keys here. i would love to know if the extractor config for tap-csv or tap-all-spreadsheets-everywhere specifies where these values are entered? in the yaml? within a select command? that's really what I'm asking. where do I put the values needed to configure the extractor?thalia_elie
11/13/2020, 6:22 PMthalia_elie
11/13/2020, 6:23 PMdouwe_maan
11/13/2020, 6:23 PMthalia_elie
11/13/2020, 7:10 PMdouwe_maan
11/13/2020, 7:11 PMmeltano config tap-csv list
look like now?thalia_elie
11/13/2020, 7:11 PMmeltano config <plugin> list
douwe_maan
11/13/2020, 7:11 PMcsv_files_definition
?thalia_elie
11/13/2020, 7:11 PMdouwe_maan
11/13/2020, 7:12 PMmeltano.yml
look like?thalia_elie
11/13/2020, 7:12 PMdouwe_maan
11/13/2020, 7:12 PMthalia_elie
11/13/2020, 7:12 PMthalia_elie
11/13/2020, 7:13 PMdouwe_maan
11/13/2020, 7:13 PMdouwe_maan
11/13/2020, 7:14 PMfiles: ...
needs to be nested under config:
, so it needs one more level of indentation (2 more spaces ahead of each line)douwe_maan
11/13/2020, 7:14 PMmeltano config tap-csv list
show?douwe_maan
11/13/2020, 7:15 PMcsv_files_definition: path/to/file.json
under config
in meltano.yml
, instead of including the files
config there directlydouwe_maan
11/13/2020, 7:16 PMfiles
in meltano.yml
as you have now, without the separate filedouwe_maan
11/13/2020, 7:17 PM// ...
, which is not actually valid JSON, but is included in the docs example to illustrate that more JSON objects with their own entity/file/keys could follow the one in the exampledouwe_maan
11/13/2020, 7:17 PMthalia_elie
11/13/2020, 7:18 PMthalia_elie
11/13/2020, 7:18 PMthalia_elie
11/13/2020, 7:18 PMdouwe_maan
11/13/2020, 7:19 PMfile:
and keys:
lines below entity:
should have file
and keys
start at the same indentation level as entity
itselfdouwe_maan
11/13/2020, 7:19 PMthalia_elie
11/13/2020, 7:21 PMthalia_elie
11/13/2020, 7:21 PMmeltano.yml
)
Array of objects with entity
, file
, and keys
keys
csv_files_definition [env: TAP_CSV_FILES_DEFINITION, TAP_CSV_CSV_FILES_DEFINITION] current value: None (from default)
CSV Files Definition: Project-relative path to JSON file holding array of objects with entity
, file
, and keys
keys
To learn more about extractor 'tap-csv' and its settings, visit https://meltano.com/plugins/extractors/csv.htmldouwe_maan
11/13/2020, 7:24 PMfile
value should have a .csv
suffix to match the full path, and keys
should only contain the columns that uniquely identify the row. If the name
in each row is expected to be unique, that'd be a good candidatedouwe_maan
11/13/2020, 7:25 PMthalia_elie
11/13/2020, 7:28 PMdouwe_maan
11/13/2020, 7:29 PMname
is sufficient if there's no IDthalia_elie
11/13/2020, 7:29 PMdouwe_maan
11/13/2020, 7:31 PMdouwe_maan
11/13/2020, 7:31 PMthalia_elie
11/13/2020, 7:45 PMthalia_elie
11/13/2020, 7:45 PMthalia_elie
11/13/2020, 7:46 PMthalia_elie
11/13/2020, 7:46 PMthalia_elie
11/13/2020, 7:46 PMthalia_elie
11/13/2020, 7:57 PMthalia_elie
11/13/2020, 8:00 PMthalia_elie
11/13/2020, 8:09 PMthalia_elie
11/13/2020, 8:10 PMthalia_elie
11/13/2020, 8:26 PMthalia_elie
11/13/2020, 8:28 PMthalia_elie
11/13/2020, 8:28 PMdouwe_maan
11/13/2020, 9:13 PMthe top is me forgetting to make my destination folder a directory but i think i got it..yes?Correct, this looks as expected!
douwe_maan
11/13/2020, 9:14 PMim guessing transformation skipped because its a csv file in the correct format, yes?Correct
douwe_maan
11/13/2020, 9:14 PMim researching and i THINK this doc is relevant: https://meltano.com/plugins/loaders/csv.html#getting-started to my next step..am i on the right path?Yes, that's right! You can find that link on the bottom of the
meltano config target-csv list
output as well: https://meltano.slack.com/archives/C013EKWA2Q1/p1605296726134600?thread_ts=1605289339.120700&cid=C013EKWA2Q1douwe_maan
11/13/2020, 9:16 PMi see directions : configure the settings below..but then it moves to the next step with nothing listed below in that section.The idea is that you jump from that "Configure the settings below using
meltano config
." line to the Settings section using the link, where the doc then describes which settings are required and how they should be configured, with an example minimal configuration. But maybe that could be more clear?