Still struggling with a few aspects of a SDK version of `tap Meltano #singer-tap-development

Still struggling with a few aspects of a SDK versi...

Andy Carter

04/11/2023, 8:57 AM

Still struggling with a few aspects of a SDK version of

tap-freshdesk

Is there any way I can access the values of a stream I have already run, and reuse, without using a parent child relationship? It seems each stream is run in isolation from each other, probably deliberately. Details/limitations in the thread 👇

Andy Carter

04/11/2023, 9:04 AM

The restrictions I have: 1. A

tickets

endpoint, which returns a summary of ticket info, 100 tickets at a time. I need to paginate through these quickly and get a list of ticket `id`s, otherwise tickets get missed in the pagination. 2. A ticket detail endpoint

ticket/{iid}

that gives the full detail, plus access to child conversations endpoint

ticket/{iid}/conversations

3. A very restrictive 429 error, which means I am allotted about '100 call credits' a minute, each ticket and related conversation eats up around '5 calls' I would like to 1. Run

tickets

as a normal stream, with state, to get me all the

id

s that have changes. Lets call this

TicketSummary

stream. 2. Access the IDs generated in a run of

TicketSummary

stream from an instance of

TicketDetail

stream, which accesses

ticket/{iid}

and

conversations

in turn at a much slower rate. Why can't I do it in the normal way? • The low head room on calls per minute means I can only get a few tickets at a time before being forced to wait 50s ish. • This gives lots of time for tickets to be updated by agents 'during' a run. • Which causes a messup in pagination, and some tickets which should have been included to get missed in the sync.

Andy Carter

04/11/2023, 9:38 AM

I have tried to use a

set

in the parent Tap to collect the relevant `id`s, but it looks like

get_records

is being called / the generator set up for each stream set up first, and then being iterated later.

Andy Carter

04/11/2023, 10:59 AM

OK made some progress here, needed to realise that the order that streams are defined on the tap don't dictate the execution order. I can force one stream to run before another, by making one a child of the other. Then sharing a set works, the parent stream runs and fills the set with Ids, and the child stream comes in and reads them running queries.

Andy Carter

04/11/2023, 11:57 AM

@Denis I. any idea why I'm getting this error

meltano select --list --all tap-freshdesk

Copy code

Cannot list the selected attributes: Could not find catalog. Verify that the tap supports discovery mode and advertises the `discover` capability as well as either `catalog` or `properties`

I can't get any

select:

filters to work either. What am I missing? https://github.com/acarter24/tap-freshdesk

Denis I.

04/11/2023, 1:01 PM

@Andy Carter how does the

meltano

configuration for

tap-freshdesk

look?

Andy Carter

04/11/2023, 1:03 PM

Copy code

- name: tap-freshdesk
    pip_url: git+<https://github.com/acarter24/tap-freshdesk.git>
    config:
      domain: ${TAP_FRESHDESK_DOMAIN}
      start_date: 2023-03-01
      api_key: ${TAP_FRESHDESK_API_KEY}
    select:
      - agents.*
      - '!.*'

Denis I.

04/11/2023, 1:10 PM

what do you see when execute

meltano invoke tap-freshdesk --discover

? • upd: run → invoke

Andy Carter

04/11/2023, 1:19 PM

2.5k json file!

disc.json

Denis I.

04/11/2023, 1:24 PM

hm, does adding

--log-level=debug

show anything useful for

select

command?

meltano --log-level=debug select --list --all tap-freshdesk

Andy Carter

04/11/2023, 1:34 PM

OK there is clearly something up!

RecursionError: maximum recursion depth exceeded in comparison

Andy Carter

04/11/2023, 1:34 PM

I will investigate some more, thanks for the pointer

Andy Carter

04/11/2023, 8:49 PM

Looks like if I add these capabilities to

meltano.yml

that command gives a proper response.

Copy code

- name: tap-freshdesk
    pip_url: git+<https://github.com/acarter24/tap-freshdesk.git>
    capabilities:
      - discover
      - properties
      - state

Why would that help though? Is it something to do with a version of

tap-freshdesk

already existing on the hub?

Andy Carter

04/11/2023, 8:56 PM

EDIT: Adding this block:

Copy code

capabilities:
      - discover
      - state
      - catalog
      - about
      - stream-maps

Made everything work much better

Denis I.

04/11/2023, 9:41 PM

Honestly, I didn’t get it either. For all of my custom extractors I use

pip_url: -e %localpath%

and define

capabilities

meltano.yml

. I can’t find where I read about it in docs, but probably just copied this example https://docs.meltano.com/tutorials/custom-extractor#2-edit-your-existing-meltanoyml-file from custom extractor docs.

Andy Carter

04/12/2023, 7:07 AM

Thankyou! I probably started working from that page but drifted away when I started implementing the various streams.

Ruben Vereecken

03/07/2024, 2:51 PM

Long time ago but this was useful! Needed streams to access each other

🙌 1

Ruben Vereecken

03/07/2024, 2:55 PM

Actually my use case is a bit more complicated perhaps: need to access a few streams that will be used to paginate other streams with: countries, cities, categories,… and I don’t think streams can have multiple parents. But this was a great start

2 Views

Open in Slack

Previous Next