Hi everyone, I'm seeking advice on associating con...
# singer-targets
t
Hi everyone, I'm seeking advice on associating contacts with their primary companies during data ingestion into HubSpot using Meltano. Specifically, I'm utilizing the csv-tap (Im giving the company domain and firstname,lastname,email of the contact person) and the hubspot-target. My goal is to link contacts to their respective primary companies as the data is loaded. Could anyone share best practices or conventional methods to achieve this within this setup? Any guidance would be greatly appreciated!"
v
"Entity Matching" is the search term you're after. There's a lot of methods that you can use, none of them are a clear "do it this way every time" it's very situational to what the business needs are and the context of the integration
1
Basically, it's a hard problem there's a lot of ways to get to 80-90% "easily" but the last 10% depends on how accurate it needs to be for the business.
t
Thanks for your reply @visch, when I lookup 'entity matching Meltano' there is not really any kind of documentation. I am curious to how this would be done, maybe you know where I can find some information? For now I am just looking for an 'easy' test to see how it would be done. If you know anything that could help me further, I would be very grateful!
v
Entity matching happens in your transformation step. If you want to use acronyms then "EL" is done by Meltano, the T of ELT is after meltano has loaded your data. You can use any tool you want, some folks use DBT, you can use Pandas, Java, you name it. Really not a meltano specific question as Meltano will get the data loaded for you (and we could run an Entity matching tool for you) but 🤷 hope that helps
I do it with dbt today for some of them, and Pandas for others