sonalgoyal's comments

sonalgoyal · on Nov 20, 2022

As part of my data consulting, I struggled with identity resolution and started working on scalable no code identity resolution - https://github.com/zinggAI/zingg/ . It has pushed my limits as a software engineer and product builder, and I had to do a lot of learning to build it. Its cool to see people use Zingg in their workflows and save months of working on custom solutions. Big highlight has been North Carolina Open Campaign Data https://crossroads-cx.medium.com/building-open-access-to-nc-...

sonalgoyal · on Oct 7, 2022

Thanks KrishD!

sonalgoyal · on Oct 7, 2022

Thanks for posting this - I am the author of the post and glad you found my journey worth sharing.

sonalgoyal · on Feb 9, 2022

Thanks for your support. Yes we do ship with some examples and their models which can be run out of the box. We have 3 customer demographic datasets and an ecommerce items matching across Google and Amazon. You can check them here https://github.com/zinggAI/zingg/tree/main/examples

sonalgoyal · on Feb 9, 2022

thanks a lot bencastleton, means a lot!

sonalgoyal · on Feb 9, 2022

Hi rishsriv,

Thanks for liking zingg, super excited to hear this :-) Here are some performance numbers. https://docs.zingg.ai/docs/setup/hardwareSizing.html

We see performance varies by a) Number of attributes to match b) Size of data c) Type of matching and the features we compute for each d) Hardware and cluster size

Although we do not do matching across languages like English with Chinese, we have tested Zingg quite rigorously with Chinese, Japanese, Hindi, German and other languages and it seems to work out of the box. Likely due to the inbuilt Java unicode support and the ML based learning.

You make a great point about continuous variables like lat/long, age etc. Age seems to work, again due to integer differences and the learning. Have not tried lat/long yet. Would you have any dataset you could recommend for testing?

rishsriv · on Feb 9, 2022

Thanks for pointing me to the performance numbers!

No open datasets that I'm aware of for fuzzy geocoordinate matching, unfortunately

sonalgoyal · on Feb 9, 2022

Hmm..guess we will wait along and keep an eye on such datasets.

sonalgoyal · on Feb 9, 2022

Thank you, hope you find it useful! :-)

sonalgoyal · on Oct 18, 2021

This is a great reply !

sonalgoyal · on Sept 27, 2021

Users get better services, personalised offers and no spamming even when they have already bought a product. Hopefully! :)

sonalgoyal · on Nov 17, 2015

I kind of like it that my husband is a programmer. We are able to brainstorm about a lot more things as our backgrounds and day to day challenges are similar. I am not sure how it works out if two people are in completely different professions, but I think work does fill up a lot of our lives, especially as we get older. So its nice to be able to speak the same language (pun intended :-))