Hello, I am using tap-mysql and target-snowflake. ...
# troubleshooting
s
Hello, I am using tap-mysql and target-snowflake. Deployed the project on AWS EC2 instance, have 35 dags (log-based replication) which runs every 5 mins. I use PostgresSQL as database both for Airflow and Meltano. EC2 instance CPU utilisation is almost 100% and Postgres DB utilisation is around 30% but increases upto 80% during low-traffic. Any suggestions on how to reduce the CPU usage ? Does Meltano and Airflow keeps the connection live with PostgresDB for as long they run ?
a
@sumit_singh - Just checking in. Did you have any progress here?
have 35 dags (log-based replication) which runs every 5 mins
This sounds like a scenario where dags could easily overflow their timeslot and grow indefinitely as more workers get behind. Are you able to gage how well the dags finish in their allotted timeslot (or not)?