Daniel Luo
10/01/2024, 12:51 PMvisch
10/01/2024, 1:01 PMbatch messages so the target can handle them. Meaning you're ok with the speed of the tap right now but the target is your issue.Daniel Luo
10/01/2024, 1:03 PMDaniel Luo
10/01/2024, 1:06 PMbatch_config.batch_size in addition to batch_size_rows in order for the batch size to actually change. And batch_config.storage.root seems to be working as well.visch
10/01/2024, 1:10 PMDaniel Luo
10/01/2024, 1:12 PMvisch
10/01/2024, 1:12 PMDaniel Luo
10/01/2024, 1:13 PMDaniel Luo
10/01/2024, 1:13 PMvisch
10/01/2024, 1:13 PMvisch
10/01/2024, 1:14 PMtime meltano invoke tap-name should really narrow this down!Daniel Luo
10/01/2024, 1:14 PMvisch
10/01/2024, 1:14 PMvisch
10/01/2024, 1:16 PMDaniel Luo
10/01/2024, 1:24 PMvisch
10/01/2024, 1:35 PMtime meltano invoke tap-name > outDaniel Luo
10/01/2024, 2:09 PMvisch
10/01/2024, 2:10 PMvisch
10/01/2024, 2:10 PMvisch
10/01/2024, 2:11 PMvalidate_records to False in the target and see what the new results areDaniel Luo
10/01/2024, 2:11 PMvisch
10/01/2024, 2:11 PMDaniel Luo
10/01/2024, 2:12 PMDaniel Luo
10/01/2024, 2:12 PMvisch
10/01/2024, 2:12 PMvisch
10/01/2024, 2:13 PMDaniel Luo
10/01/2024, 2:13 PMvisch
10/01/2024, 2:13 PMDaniel Luo
10/01/2024, 2:14 PMloaders:
- name: target-snowflake
variant: meltanolabs
pip_url: meltanolabs-target-snowflakeDaniel Luo
10/01/2024, 2:14 PMDaniel Luo
10/01/2024, 2:14 PMDaniel Luo
10/01/2024, 2:14 PMDaniel Luo
10/01/2024, 2:15 PMloaders:
- name: target-snowflake
variant: meltanolabs
pip_url: meltanolabs-target-snowflake
config:
validate_records: falseDaniel Luo
10/01/2024, 2:19 PMvisch
10/01/2024, 2:21 PMDaniel Luo
10/01/2024, 2:21 PMvisch
10/01/2024, 2:23 PMDaniel Luo
10/01/2024, 2:24 PMDaniel Luo
10/01/2024, 2:24 PMvisch
10/01/2024, 2:25 PMvisch
10/01/2024, 2:25 PMvisch
10/01/2024, 2:26 PMDaniel Luo
10/01/2024, 2:26 PM> meltano invoke target-snowflake --input map-output.json
2024-10-01T14:15:19.837133Z [info ] Environment 'default' is active
/home/dluo/git/dagster-hybrid/src/elt_projects/meltano/.meltano/loaders/target-snowflake/venv/lib/python3.12/site-packages/snowflake/sqlalchemy/base.py:1068: SAWarning: The GenericFunction 'flatten' is already registered and is going to be overridden.
functions.register_function("flatten", flatten)
2024-10-01 14:15:21,135 | INFO | target-snowflake | Target 'target-snowflake' is listening for input from tap.
2024-10-01 14:15:21,136 | INFO | target-snowflake | Initializing 'target-snowflake' target sink...
2024-10-01 14:15:21,136 | INFO | target-snowflake.client-fh_test | Initializing target sink for stream 'client-fh_test'...
2024-10-01 14:15:21,150 | INFO | snowflake.connector.connection | Snowflake Connector for Python Version: 3.12.2, Python Version: 3.12.5, Platform: Linux-5.15.153.1-microsoft-standard-WSL2-x86_64-with-glibc2.35
2024-10-01 14:15:21,151 | INFO | snowflake.connector.connection | Connecting to GLOBAL Snowflake domain
2024-10-01 14:15:21,151 | INFO | snowflake.connector.connection | This connection is in OCSP Fail Open Mode. TLS Certificates would be checked for validity and revocation status. Any other Certificate Revocation related exceptions or OCSP Responder failures would be disregarded in favor of connectivity.
2024-10-01 14:15:22,656 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:23,693 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:23,965 | INFO | snowflake.connector.cursor | Number of results in first chunk: 6
2024-10-01 14:15:24,027 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:24,256 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:24,490 | INFO | snowflake.connector.cursor | Number of results in first chunk: 6
2024-10-01 14:15:24,568 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:24,732 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:25,162 | INFO | snowflake.connector.cursor | Number of results in first chunk: 96
2024-10-01 14:15:25,439 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:29,001 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:29,087 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:39,193 | INFO | snowflake.connector.cursor | Number of results in first chunk: 52
2024-10-01 14:15:41,657 | INFO | snowflake.connector.cursor | Number of results in first chunk: 468
2024-10-01 14:15:42,274 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:42,615 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:42,751 | INFO | snowflake.connector.cursor | Number of results in first chunk: 96
2024-10-01 14:15:42,831 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:48,131 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:48,220 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:49,538 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:15:52,195 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:15:53,116 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:53,808 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:55,794 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:55,848 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:15:56,082 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:04,851 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:12,625 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:12,697 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:12,858 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,207 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,331 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,393 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,438 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:25,460 | INFO | singer_sdk.metrics | METRIC: {"type": "timer", "metric": "batch_processing_time", "value": 35.92187786102295, "tags": {"stream": "client-fh_test", "pid": 19626, "status": "succeeded"}}
2024-10-01 14:16:25,499 | INFO | singer_sdk.metrics | METRIC: {"type": "counter", "metric": "record_count", "value": 10001, "tags": {"stream": "client-fh_test", "pid": 19626}}
2024-10-01 14:16:26,639 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:16:29,276 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:16:33,903 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:33,962 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:34,154 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:34,323 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:34,361 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:40,420 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,029 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,106 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,208 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,392 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:43,782 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:44,188 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:44,291 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:44,310 | INFO | singer_sdk.metrics | METRIC: {"type": "timer", "metric": "batch_processing_time", "value": 17.670918226242065, "tags": {"stream": "client-fh_test", "pid": 19626, "status": "succeeded"}}
2024-10-01 14:16:45,455 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:16:48,019 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:16:48,805 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:49,530 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:49,680 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:49,818 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:50,003 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:57,767 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:57,832 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,086 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,142 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,488 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,783 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,843 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:58,961 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:16:59,095 | INFO | singer_sdk.metrics | METRIC: {"type": "timer", "metric": "batch_processing_time", "value": 13.639158725738525, "tags": {"stream": "client-fh_test", "pid": 19626, "status": "succeeded"}}
2024-10-01 14:17:00,716 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:17:03,330 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:17:04,217 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:05,071 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:05,141 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:05,299 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:05,342 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:23,118 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:23,235 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:25,084 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:25,137 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:26,151 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:27,540 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:27,791 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:27,862 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:27,899 | INFO | singer_sdk.metrics | METRIC: {"type": "timer", "metric": "batch_processing_time", "value": 27.182664394378662, "tags": {"stream": "client-fh_test", "pid": 19626, "status": "succeeded"}}
2024-10-01 14:17:27,946 | INFO | singer_sdk.metrics | METRIC: {"type": "counter", "metric": "record_count", "value": 30000, "tags": {"stream": "client-fh_test", "pid": 19626}}
2024-10-01 14:17:29,024 | INFO | target-snowflake | Target sink for 'client-fh_test' is full. Current size is '10000'. Draining...
2024-10-01 14:17:31,604 | INFO | target-snowflake.client-fh_test | Processing batch of files.
2024-10-01 14:17:31,910 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:32,123 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:32,200 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1
2024-10-01 14:17:32,421 | INFO | snowflake.connector.cursor | Number of results in first chunk: 1visch
10/01/2024, 6:33 PMvisch
10/01/2024, 6:35 PMvisch
10/01/2024, 6:37 PMDaniel Luo
10/01/2024, 8:31 PMprocess_lines . Is it waiting on something?Daniel Luo
10/01/2024, 9:12 PMhaleemur_ali
10/02/2024, 1:42 PMEdgar Ramírez (Arch.dev)
10/02/2024, 3:55 PMaaron_phethean
10/02/2024, 5:39 PMDaniel Luo
10/02/2024, 5:49 PMhaleemur_ali
10/02/2024, 5:53 PMDaniel Luo
10/02/2024, 5:55 PMhaleemur_ali
10/02/2024, 5:56 PMI suppose if you already have the files there, but if you're uploading a file to stagingwould you mind clarifying what is meant by
there and which staging you are referring to.aaron_phethean
10/02/2024, 5:56 PMDaniel Luo
10/02/2024, 5:59 PMwould you mind clarifying what is meant byI was referring to having files in external, and staging in general, as in you have a local file that you need to upload somewhere, whether that be SF or s3and whichthereyou are referring to.staging
Daniel Luo
10/02/2024, 6:00 PMFrom what I saw in the target the non batch implementation uses inserts? That’s very slow compared to the copy staging a file.Based on the query history that I see in SF, what it does is first does a PUT to put the results in staging, then does a merge from staging into the table
Daniel Luo
10/02/2024, 6:03 PMDaniel Luo
10/02/2024, 6:04 PM