Daniel Luo
10/01/2024, 12:51 PMvisch
10/01/2024, 1:01 PMbatch
messages so the target can handle them. Meaning you're ok with the speed of the tap right now but the target is your issue.Daniel Luo
10/01/2024, 1:03 PMDaniel Luo
10/01/2024, 1:06 PMbatch_config.batch_size
in addition to batch_size_rows
in order for the batch size to actually change. And batch_config.storage.root
seems to be working as well.visch
10/01/2024, 1:10 PMDaniel Luo
10/01/2024, 1:12 PMvisch
10/01/2024, 1:12 PMDaniel Luo
10/01/2024, 1:13 PMDaniel Luo
10/01/2024, 1:13 PMvisch
10/01/2024, 1:13 PMvisch
10/01/2024, 1:14 PMtime meltano invoke tap-name
should really narrow this down!Daniel Luo
10/01/2024, 1:14 PMvisch
10/01/2024, 1:14 PMvisch
10/01/2024, 1:16 PMDaniel Luo
10/01/2024, 1:24 PMvisch
10/01/2024, 1:35 PMtime meltano invoke tap-name > out
Daniel Luo
10/01/2024, 2:09 PMvisch
10/01/2024, 2:10 PMvisch
10/01/2024, 2:10 PMvisch
10/01/2024, 2:11 PMvalidate_records
to False in the target and see what the new results areDaniel Luo
10/01/2024, 2:11 PMvisch
10/01/2024, 2:11 PMDaniel Luo
10/01/2024, 2:12 PMDaniel Luo
10/01/2024, 2:12 PMvisch
10/01/2024, 2:12 PMvisch
10/01/2024, 2:13 PMDaniel Luo
10/01/2024, 2:13 PMvisch
10/01/2024, 2:13 PMDaniel Luo
10/01/2024, 2:14 PMloaders:
- name: target-snowflake
variant: meltanolabs
pip_url: meltanolabs-target-snowflake
Daniel Luo
10/01/2024, 2:14 PMDaniel Luo
10/01/2024, 2:14 PMDaniel Luo
10/01/2024, 2:14 PMDaniel Luo
10/01/2024, 2:15 PMloaders:
- name: target-snowflake
variant: meltanolabs
pip_url: meltanolabs-target-snowflake
config:
validate_records: false
Daniel Luo
10/01/2024, 2:19 PMvisch
10/01/2024, 2:21 PMDaniel Luo
10/01/2024, 2:21 PMvisch
10/01/2024, 2:23 PMDaniel Luo
10/01/2024, 2:24 PMDaniel Luo
10/01/2024, 2:24 PMvisch
10/01/2024, 2:25 PMvisch
10/01/2024, 2:25 PMvisch
10/01/2024, 2:26 PMDaniel Luo
10/01/2024, 2:26 PM> meltano invoke target-snowflake --input map-output.json
2024-10-01T14:15:19.837133Z [info ] Environment 'default' is active
/home/dluo/git/dagster-hybrid/src/elt_projects/meltano/.meltano/loaders/target-snowflake/venv/lib/python3.12/site-packages/snowflake/sqlalchemy/base.py:1068: SAWarning: The GenericFunction 'flatten' is already registered and is going to be overridden.
functions.register_function("flatten", flatten)
2024-10-01 14:15:21,135 | INFO | target-snowflake | Target 'target-snowflake' is listening for input from tap.
2024-10-01 14:15:21,136 | INFO | target-snowflake | Initializing 'target-snowflake' target sink...
2024-10-01 14:15:21,136 | INFO | target-snowflake.client-fh_test | Initializing target sink for stream 'client-fh_test'...
2024-10-01 14:15:21,150 | INFO | snowflake.connector.connection | Snowflake Connector for Python Version: 3.12.2, Python Version: 3.12.5, Platform: Linux-5.15.153.1-microsoft-standard-WSL2-x86_64-with-glibc2.35
2024-10-01 14:15:21,151 | INFO | snowflake.connector.connection | Connecting to GLOBAL Snowflake domain
2024-10-01 14:15:21,151 | INFO | snowflake.connector.connection | This connection is in OCSP Fail Open Mode. TLS Certificates would be checked for validity and revocation status. Any other Certificate Revocation related exceptions or OCSP Responder failures would be disregarded in favor of connectivity.
2024-10-01 14:15:22,656 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:23,693 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:23,965 | INFO | snowflake.connector.cursor | Number of results in first chunk: 6
2024-10-01 14:15:24,027 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:24,256 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:24,490 | INFO | snowflake.connector.cursor | Number of results in first chunk: 6
2024-10-01 14:15:24,568 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:24,732 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:25,162 | INFO | snowflake.connector.cursor | Number of results in first chunk: 96
2024-10-01 14:15:25,439 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:29,001 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:29,087 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:39,193 | INFO | snowflake.connector.cursor | Number of results in first chunk: 52
2024-10-01 14:15:41,657 | INFO | snowflake.connector.cursor | Number of results in first chunk: 468
2024-10-01 14:15:42,274 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:42,615 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:42,751 | INFO | snowflake.connector.cursor | Number of results in first chunk: 96
2024-10-01 14:15:42,831 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:48,131 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:48,220 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:49,538 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:15:52,195 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:15:53,116 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:53,808 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:55,794 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:55,848 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:56,082 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:04,851 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:12,625 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:12,697 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:12,858 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,207 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,331 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,393 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,438 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,460 | INFO | singer_sdk.metrics | METRIC: {"type": "timer", "metric": "batch_processing_time", "value": 35.92187786102295, "tags": {"stream": "client-fh_test", "pid": 19626, "status": "succeeded"}}
2024-10-01 14:16:25,499 | INFO | singer_sdk.metrics | METRIC: {"type": "counter", "metric": "record_count", "value": 10001, "tags": {"stream": "client-fh_test", "pid": 19626}}
2024-10-01 14:16:26,639 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:16:29,276 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:16:33,903 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:33,962 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:34,154 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:34,323 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:34,361 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:40,420 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,029 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,106 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,208 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,392 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,782 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:44,188 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:44,291 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:44,310 | INFO | singer_sdk.metrics | METRIC: {"type": "timer", "metric": "batch_processing_time", "value": 17.670918226242065, "tags": {"stream": "client-fh_test", "pid": 19626, "status": "succeeded"}}
2024-10-01 14:16:45,455 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:16:48,019 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:16:48,805 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:49,530 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:49,680 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:49,818 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:50,003 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:57,767 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:57,832 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,086 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,142 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,488 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,783 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,843 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,961 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:59,095 | INFO | singer_sdk.metrics | METRIC: {"type": "timer", "metric": "batch_processing_time", "value": 13.639158725738525, "tags": {"stream": "client-fh_test", "pid": 19626, "status": "succeeded"}}
2024-10-01 14:17:00,716 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:17:03,330 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:17:04,217 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:05,071 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:05,141 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:05,299 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:05,342 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:23,118 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:23,235 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:25,084 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:25,137 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:26,151 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:27,540 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:27,791 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:27,862 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:27,899 | INFO | singer_sdk.metrics | METRIC: {"type": "timer", "metric": "batch_processing_time", "value": 27.182664394378662, "tags": {"stream": "client-fh_test", "pid": 19626, "status": "succeeded"}}
2024-10-01 14:17:27,946 | INFO | singer_sdk.metrics | METRIC: {"type": "counter", "metric": "record_count", "value": 30000, "tags": {"stream": "client-fh_test", "pid": 19626}}
2024-10-01 14:17:29,024 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:17:31,604 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:17:31,910 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:32,123 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:32,200 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:32,421 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
visch
10/01/2024, 6:33 PMvisch
10/01/2024, 6:35 PMvisch
10/01/2024, 6:37 PMDaniel Luo
10/01/2024, 8:31 PMprocess_lines
. Is it waiting on something?Daniel Luo
10/01/2024, 9:12 PMhaleemur_ali
10/02/2024, 1:42 PMEdgar Ramírez (Arch.dev)
10/02/2024, 3:55 PMaaron_phethean
10/02/2024, 5:39 PMDaniel Luo
10/02/2024, 5:49 PMhaleemur_ali
10/02/2024, 5:53 PMDaniel Luo
10/02/2024, 5:55 PMhaleemur_ali
10/02/2024, 5:56 PMI suppose if you already have the files there, but if you're uploading a file to stagingwould you mind clarifying what is meant by
there
and which staging
you are referring to.aaron_phethean
10/02/2024, 5:56 PMDaniel Luo
10/02/2024, 5:59 PMwould you mind clarifying what is meant byI was referring to having files in external, and staging in general, as in you have a local file that you need to upload somewhere, whether that be SF or s3and whichthere
you are referring to.staging
Daniel Luo
10/02/2024, 6:00 PMFrom what I saw in the target the non batch implementation uses inserts? That’s very slow compared to the copy staging a file.Based on the query history that I see in SF, what it does is first does a PUT to put the results in staging, then does a merge from staging into the table
Daniel Luo
10/02/2024, 6:03 PMDaniel Luo
10/02/2024, 6:04 PM