jan_soubusta
03/03/2023, 2:12 PM2023-03-03T13:56:04.857208Z [info ] smart_open.s3.MultipartWriter('meltano', 'meltano_state/dev_local:tap-salesforce-to-target-postgres-sfdc/lock'): uploading part_num: 1, 17 bytes (total 0.000GB)
2023-03-03T13:59:45.719671Z [info ] INFO Starting sync cmd_type=elb consumer=False name=tap-salesforce producer=True stdio=stderr string_id=tap-salesforcejan_soubusta
03/03/2023, 2:31 PMjan_soubusta
03/03/2023, 2:31 PMjan_soubusta
03/03/2023, 2:31 PMjan_soubusta
03/03/2023, 2:35 PMjan_soubusta
03/03/2023, 2:43 PMenvironments:
- name: dev_local
# config:
# plugins:
# extractors:
# - name: tap-salesforce
# config:
# api_type: "BULK"
# select_fields_by_default: true
# start_date: "2023-01-01T00:00:00Z"
# username: integration.internal@gooddata.com.full
# is_sandbox: true
.....
plugins:
extractors:
- name: tap-salesforce
variant: meltanolabs
pip_url: git+<https://github.com/meltanolabs/tap-salesforce.git>
config:
api_type: "BULK"
select_fields_by_default: true
start_date: "2023-01-01T00:00:00Z"
username: integration.internal@gooddata.com.full
is_sandbox: truejan_soubusta
03/03/2023, 2:46 PMlead table.jan_soubusta
03/03/2023, 2:48 PMstart_date is not applied to`lead` table?
The lag in the beginning is really strange. Is my config in the environments section correct?jan_soubusta
03/03/2023, 4:00 PMjan_soubusta
03/03/2023, 4:00 PMINFO METRIC: {"type": "timer", "metric": "http_request_duration", "value": 0.1849195957183838, "tags": {"endpoint": "ConnectedApplication", "status": "succeeded"}}
INFO Making GET request to <https://gooddata--full.sandbox.my.salesforce.com/services/data/v53.0/sobjects/UserProvisioningRequestShare/describe> with params: None
INFO Used 7149 of 282600 daily REST API quota
INFO METRIC: {"type": "timer", "metric": "http_request_duration", "value": 0.3235626220703125, "tags": {"endpoint": "UserProvisioningRequestShare", "status": "succeeded"}}
INFO Making GET request to <https://gooddata--full.sandbox.my.salesforce.com/services/data/v53.0/sobjects/DOZISF__ZoomInfo_Scoop__Tag/describe> with params: None
INFO Used 7139 of 282600 daily REST API quota
INFO METRIC: {"type": "timer", "metric": "http_request_duration", "value": 0.11390542984008789, "tags": {"endpoint": "DOZISF__ZoomInfo_Scoop__Tag", "status": "succeeded"}}
INFO Making GET request to <https://gooddata--full.sandbox.my.salesforce.com/services/data/v53.0/sobjects/IndividualShare/describe> with params: None
INFO Used 7152 of 282600 daily REST API quotajan_soubusta
03/03/2023, 4:01 PMjan_soubusta
03/03/2023, 4:04 PMselect_fields_by_default: truealexander_butler
03/03/2023, 4:11 PMstart_date is applied to SystemModstamp which is what is used as the replication key most of the time.jan_soubusta
03/03/2023, 7:12 PMselect limiting the list of entities and fields, it still freezes in the beginning.
When I turn on debug, huge amount of REST GET requests are issued against the salesforce instance, just like if the select would be ignored.jan_soubusta
03/03/2023, 7:13 PMjan_soubusta
03/03/2023, 7:13 PMalexander_butler
03/03/2023, 7:25 PMalexander_butler
03/03/2023, 7:25 PMalexander_butler
03/03/2023, 7:26 PMproperties instead of catalogalexander_butler
03/03/2023, 7:26 PMjan_soubusta
03/03/2023, 7:38 PMselect: section.
The discovery is very slow, it takes 3-4 minutes.
Anytime I change the select: section, the cache is invalidated and the expensive discovery is executed again.
Is the state of the discovery stored only locally or is it stored in state backend as well (AWS S3/Minio in my case)?jan_soubusta
03/03/2023, 7:39 PMjan_soubusta
03/03/2023, 7:49 PMjan_soubusta
03/03/2023, 7:52 PM