Deji Emiabata
06/18/2024, 8:44 PMtap-salesforce (meltanolabs variant)
. We recently added a new field (Owner_Team__c
) on the salesforce side but keep getting this error when we run our meltano EL jobs for the salesforce pipeline. See Error below `WARNING Removed paths list: ['Owner_Team__c\\']`We think it might be an issue with discovery but one thing we notice is that when we dump the catalog.json
into a local .json file we see the new field. Followed steps in this link
2024-06-18T20:32:11.296315Z [info ] Owner_Team__c\ cmd_type=extractor name=tap-salesforce run_id=965dedeb-d6c1-4347-9c3c-4372848192d4 state_id=2024-06-18T202910--tap-salesforce--target-duckdb stdio=stderr
2024-06-18T20:32:11.296491Z [info ] WARNING Removed paths list: ['Owner_Team__c\\'] cmd_type=extractor name=tap-salesforce run_id=965dedeb-d6c1-4347-9c3c-4372848192d4 state_id=2024-06-18T202910--tap-salesforce--target-duckdb stdio=stderr
2024-06-18T20:32:11.297983Z [info ] WARNING Removed 1 paths during transforms: cmd_type=extractor name=tap-salesforce run_id=965dedeb-d6c1-4347-9c3c-4372848192d4 state_id=2024-06-18T202910--tap-salesforce--target-duckdb stdio=stderr
2024-06-18T20:32:11.298118Z [info ] Owner_Team__c\ cmd_type=extractor name=tap-salesforce run_id=965dedeb-d6c1-4347-9c3c-4372848192d4 state_id=2024-06-18T202910--tap-salesforce--target-duckdb stdio=stderr
2024-06-18T20:32:11.298240Z [info ] WARNING Removed paths list: ['Owner_Team__c\\'] cmd_type=extractor name=tap-salesforce run_id=965dedeb-d6c1-4347-9c3c-4372848192d4 state_id=2024-06-18T202910--tap-salesforce--target-duckdb stdio=stderr
2024-06-18T20:32:11.299635Z [info ] WARNING Removed 1 paths during transforms: cmd_type=extractor name=tap-salesforce run_id=965dedeb-d6c1-4347-9c3c-4372848192d4 state_id=2024-06-18T202910--tap-salesforce--target-duckdb stdio=stderr
Edgar RamĂrez (Arch.dev)
06/19/2024, 11:18 AMmeltano install --clean
?Deji Emiabata
06/19/2024, 12:21 PMDeji Emiabata
06/19/2024, 4:57 PMEdgar RamĂrez (Arch.dev)
06/19/2024, 8:15 PMmeltano.yml
look like?Deji Emiabata
06/19/2024, 8:32 PMversion: 1
send_anonymous_usage_stats: false
project_id: 7d46dd40-62fc-40ce-9425-900b7dc1970a
include_paths:
- ./config/**/*.yml
plugins:
loaders:
- name: target-duckdb
namespace: target_duckdb
pip_url: target-duckdb
executable: target-duckdb
utilities:
- name: dbt-redshift
variant: dbt-labs
pip_url: dbt-core==1.7.13 dbt-redshift==1.7.7 pytz==2021.1 git+<https://github.com/meltano/dbt-ext.git@main>
settings:
- name: target_schema
label: Target Schema
env: DBT_TARGET_REDSHIFT_SCHEMA
value: ds
- name: target_schema_prefix
label: Target Schema PREFIX
env: DBT_TARGET_SCHEMA_PREFIX
value: ${USER_PREFIX}
config:
target: redshift
- name: airflow
variant: apache
pip_url: git+<https://github.com/meltano/airflow-ext.git@f763fd788b2d10c57f25132adc635583a85a7c05> apache-airflow==2.6.3 --constraint <https://raw.githubusercontent.com/apache/airflow/constraints-2.6.3/constraints-${MELTANO__PYTHON_VERSION}.txt> apache-airflow-providers-postgres apache-airflow-providers-amazon apache-airflow-providers-slack pandas boto3 requests
settings:
- name: core.executor
label: Core Executor
value: LocalExecutor
env: AIRFLOW__CORE__EXECUTOR
- name: core.dags_folder
label: Dags Folder
value: $MELTANO_PROJECT_ROOT/orchestrate/dags
env: AIRFLOW__CORE__DAGS_FOLDER
- name: database.sql_alchemy_conn
label: Database SQL Alchemy Connection
value: ${AIRFLOW__DATABASE__SQL_ALCHEMY_CONN}
env: AIRFLOW__DATABASE__SQL_ALCHEMY_CONN
- name: smtp_host
label: SMTP Host
value: ${AIRFLOW__SMTP__SMTP_HOST}
env: AIRFLOW__SMTP__SMTP_HOST
- name: smtp_starttls
label: SMTP Starttls
value: ${AIRFLOW__SMTP__SMTP_STARTTLS}
env: AIRFLOW__SMTP__SMTP_STARTTLS
- name: smtp_ssl
label: SMTP SSL
value: ${AIRFLOW__SMTP__SMTP_SSL}
env: AIRFLOW__SMTP__SMTP_SSL
- name: smtp_user
label: SMTP User
value: ${AIRFLOW__SMTP__SMTP_USER}
env: AIRFLOW__SMTP__SMTP_USER
- name: smtp_password
label: SMTP Password
value: ${AIRFLOW__SMTP__SMTP_PASSWORD}
env: AIRFLOW__SMTP__SMTP_PASSWORD
- name: smtp_port
label: SMTP Port
value: ${AIRFLOW__SMTP__SMTP_PORT}
env: AIRFLOW__SMTP__SMTP_PORT
- name: smtp_mail_from
label: SMTP Mail From
value: ${AIRFLOW__SMTP__SMTP_MAIL_FROM}
env: AIRFLOW__SMTP__SMTP_MAIL_FROM
- name: webserver.base_url
label: Base URL of Website
value: ${AIRFLOW__WEBSERVER__BASE_URL}
env: AIRFLOW__WEBSERVER__BASE_URL
- name: sqlfluff
variant: sqlfluff
pip_url: sqlfluff==2.3.5 sqlfluff-templater-dbt==2.3.5 dbt-core==1.7.0 dbt-redshift==1.7.7
settings:
- name: target_schema_prefix
label: Target Schema PREFIX
env: DBT_TARGET_SCHEMA_PREFIX
value: ${USER_PREFIX}
- name: target_schema
label: Target Schema
env: DBT_TARGET_REDSHIFT_SCHEMA
value: ds
- name: user
env: DBT_REDSHIFT_USER
value: ${DBT_REDSHIFT_USER}
- name: host
env: TARGET_REDSHIFT_HOST
- name: port
env: TARGET_REDSHIFT_PORT
- name: password
env: DBT_REDSHIFT_PASSWORD
value: ${DBT_REDSHIFT_PASSWORD}
- name: dbname
env: TARGET_REDSHIFT_DBNAME
schedules:
- name: postgres-salesforce-to-redshift-portal
extractor: tap-salesforce
loader: target-redshift--salesforce
transform: skip
interval: '@hourly'
start_date: 2021-09-24 17:39:28.048203
Deji Emiabata
06/20/2024, 1:26 AMsalesforce.yml
is this
plugins:
extractors:
- name: tap-salesforce
pip_url: -e extract/tap-salesforce
config:
api_type: BULK
start_date: '2019-01-01T00:00:00Z'
capabilities:
- properties
- discover
- state
settings:
- name: client_id
env: TAP_SALESFORCE_CLIENT_ID
- name: client_secret
env: TAP_SALESFORCE_CLIENT_SECRET
kind: password
- name: refresh_token
env: TAP_SALESFORCE_REFRESH_TOKEN
kind: password
select:
- Account.*
- AccountHistory.*
we forked the meltano labs projectEdgar RamĂrez (Arch.dev)
06/20/2024, 9:14 AMselect:
- Account.*
- Account.Owner_Team__c
- AccountHistory.*
Deji Emiabata
06/20/2024, 1:19 PMDeji Emiabata
06/20/2024, 1:38 PMmeltano install --clean
⢠updated salesforce.yml
with Account.Owner_Team__c
⢠ran my meltano elt command but still encountering issues
⢠warning - WARNING Removed paths list: ['Owner_Team__c\\']
; WARNING Removed 1 paths during transforms: cmd_type=extractor name=tap-salesforce run_id=224a0b3a-1df0-4530-86f3-9a7864d1dbb4 state_id=2024-06-20T132724--tap-salesforce--target-duckdb stdio=stderr
⢠2024-06-20T13:29:55.142462Z [info ] Owner_Team__c\
Edgar RamĂrez (Arch.dev)
06/20/2024, 5:30 PMmeltano invoke tap-salesforce > catalog.json
Deji Emiabata
06/20/2024, 5:35 PMEdgar RamĂrez (Arch.dev)
06/20/2024, 5:45 PMmeltano invoke --dump=catalog tap-salesforce > catalog.json
Deji Emiabata
06/20/2024, 6:13 PMDeji Emiabata
06/20/2024, 6:15 PMDeji Emiabata
06/20/2024, 6:16 PMDeji Emiabata
06/20/2024, 6:16 PM.json
work ?Edgar RamĂrez (Arch.dev)
06/20/2024, 6:22 PMOwner_Team__c
field does seem to be present there but it's the Opportunity
stream? đ¤Deji Emiabata
06/20/2024, 6:22 PMDeji Emiabata
06/20/2024, 6:23 PMEdgar RamĂrez (Arch.dev)
06/20/2024, 6:23 PMselect:
- Account.*
- AccountHistory.*
- Opportunity.*
Deji Emiabata
06/20/2024, 6:23 PMEdgar RamĂrez (Arch.dev)
06/20/2024, 6:23 PMDeji Emiabata
06/20/2024, 6:24 PMEdgar RamĂrez (Arch.dev)
06/20/2024, 6:25 PMplugins:
extractors:
- name: tap-salesforce
schema:
Opportunity:
Owner_Team__c:
type: ["string", "null"]
Deji Emiabata
06/20/2024, 6:25 PMDeji Emiabata
06/20/2024, 6:29 PMEdgar RamĂrez (Arch.dev)
06/20/2024, 6:32 PMDeji Emiabata
06/20/2024, 6:32 PMDeji Emiabata
06/20/2024, 6:33 PMDeji Emiabata
06/20/2024, 6:42 PMEdgar RamĂrez (Arch.dev)
06/20/2024, 6:51 PM.meltano/run/tap-salesforce
directory?Deji Emiabata
06/20/2024, 6:52 PMEdgar RamĂrez (Arch.dev)
06/20/2024, 6:57 PMDeji Emiabata
06/20/2024, 6:58 PM