Hello I need an advice I created a new pipeline MySQL to S3 Meltano #best-practices

Hello, I need an advice. I created a new pipeline ...

dima_anoshin

07/13/2022, 8:48 PM

Hello, I need an advice. I created a new pipeline MySQL to S3. The very 1st run will do the FULL EXPORT of the table and next one might me incremental. Meltano is running in Docker and during the run the Docker is dying due to lack of storage in container. What is the right approach to avoid this kind of problem? For example, during 1st run let Maltano copy data in batches or so. Error message

[Errno 105] No buffer space available

devon_seitz

07/13/2022, 10:12 PM

what is your loader?

devon_seitz

07/13/2022, 10:12 PM

you may need to tweak the batch sizing

devon_seitz

07/13/2022, 10:13 PM

ie for snowflake adjusting batch_size_rows: 350000

dima_anoshin

07/13/2022, 10:34 PM

I am getting from Mysql and save files to S3 for Athena to consume. The meltano.yml:

Copy code

- name: tap-mysql-billing
      inherit_from: tap-mysql
      config:
        host: billing-db
        port: 3306
        database: billing
        user: datalake
        password: $TAP_MYSQL_PASSWORD_BILLING
      select:
        - account_type_prices.account_type_id
        - account_type_prices.id
        - account_type_prices.price_id
        - account_types.id
        - account_types.name
      metadata:
        "account_type_prices*":
          replication-method: INCREMENTAL
          replication-key: id

dima_anoshin

07/13/2022, 10:35 PM

I assume for the 1st time it is doing full extract

dima_anoshin

07/13/2022, 10:35 PM

where can I adjust batch size?

dima_anoshin

07/14/2022, 4:48 AM

I don't see a