Show HN: Syncing data to your customer’s Google Sheets

ricktdotorg · on Jan 28, 2023

> To make this possible without creating a superuser that would have access to _all_ the sheets, we had to programmatically generate a different user for each of our customers. We do this via the GCP IAM API, creating a new service account every time. We then auth into the sheet through this service account.

nice solution! i've worked with updating sheets in real time with data from the Admin SDK using the Sheets API and service accounts, but yeah that domain-wide delegation is fine for our corp stuff but not so good for customers!

great work, will give this a try out for sure! :thumbsup:

ctc24 · on Jan 28, 2023

Exactly! Thanks for the kind words!

char_star_star · on Jan 27, 2023

I wonder how long it will be until you have a customer using this just to share data to their own internal teams just so the data team doesn't have to mess around with Sheets!

Also—I want Prequel for Zendesk and Greenhouse (and Asana and ...) so badly. There are so many more interesting things I want to be doing with my time at work than babysitting pipelines.

ctc24 · on Jan 27, 2023

We're actually starting to get those requests!

Really appreciate the kind words. We'll do our best to those teams offer data warehouse integrations, so you can focus on higher value data engineering work!

stephen · on Jan 27, 2023

Interesting! Any plans for SalesForce support?

We'd love to have a way to easily sync our internal system's data in/out of SFDC.

...and a source of GraphQL? :-)

One of our engineers recently suggested syncing our PG database to Airtable, solely b/c Airtable has out-of-the-box SFDC integration (webhooks/etc), so our SFDC team could get at the data easier than they could from our PG database.

I'm hesitant about "Airtable as our official 3rd-party integration strategy", but it does make me pine for a "protocol" for one-way/two-way real-time/batch syncing between two systems that just want to share "dumb" tables+fields/entities.

I was thinking Zapier might have that, like if we implemented the Zapier-TM backend protocol on top of our custom system, and it would ~insta integrate with everything, but in my ~10 minutes of scanning their docs, it seemed less "protocol" and more "configure all the things via clicking around".

ctc24 · on Jan 28, 2023

No near-term plan for Salesforce support from us, but that's an interesting use-case. Definitely check out the other tools folks mentioned if you haven't already, they should be able to get you most of the way there.

jimmyechan · on Jan 28, 2023

Is your team looking to do a 2-way sync between PG and SF for certain fields? Or is the use case something where data from SF needs to be enriched with data from PG and then pushed back to SF (or vice versa)? I'm curious to know which fields need to be synced that way.

With the approach your engineer recommended, it seems to imply that the sales team might also want to update data manually and those updates should be reflected in both PG and SF - is my understanding correct?

stephen · on Jan 28, 2023

Hi jimmy! Yep, invariably the ask always ends up to be 2-way syncing. :-/

Fwiw realistically I think these things are generally best written as custom software anyway, b/c things like identity (of entities/mapping ids correctly across both systems), the inevitable field mapping, etc. always come up, and with custom software I can write tests for all of them. :-)

jimmyechan · on Jan 30, 2023

Two-way is definitely more convenient. Custom could be a good approach, but could get difficult to maintain due corner cases or as schemas change from either side. I'd be curious to know how you end up solving for this! Good luck!

tayloramurphy · on Jan 28, 2023

Meltano[0] might be of interest to you. Easy way to move data that should be very familiar for software engineers. If a connector doesn't exist our SDK makes it easy to build it.

[0] https://github.com/meltano/meltano

(disclaimer - I work at Meltano)

dyeje · on Jan 28, 2023

Check out https://sequin.io/ I think it’s pretty much exactly what you’re looking for. Awesome team behind it too.

julee04 · on Jan 28, 2023

Hey Stephen,

I would check out http://hightouch.com/. You can easily sync data from a warehouse to SFDC. You can also sync PG -> Airtable as well!

rubenv · on Jan 27, 2023

Not very familiar with the sheets API, but was wondering: how are rate limits applied? Per sheet or in total?

ctc24 · on Jan 27, 2023

They're applied to the caller, so "in total" in this instance. They're pretty high, however -- 300 write requests per minute per GCP project.

rubenv · on Jan 27, 2023

That's 5 per second over all your customers? Not too worried about reaching that quickly?

ctc24 · on Jan 27, 2023

Not particularly. A large portion of our customers who sync data to Google Sheets use a daily frequency, so the theoretical upper limit is close to a half million sheets being written to (per GCP project). We have other projects available that we can start using once this gets close to becoming an issue.

potmat · on Jan 27, 2023

At my company we routinely write to hundreds/thousands of sheets a minute, you can get much higher quotas if you ask.

HyperSane · on Jan 28, 2023

What are they used for?

rubenv · on Jan 27, 2023

That's more reassuring indeed.

Cool project, good luck!

rexreed · on Jan 28, 2023

Would an alternative to this approach be to let your customers copy a spreadsheet of yours that already has an API function connection to your data? This would avoid the need to create users and upserts and all that, no? Or am I missing something?

ctc24 · on Jan 28, 2023

Yup, that would be another valid approach. We felt that the product experience was a lot cleaner / more in line with our general philosophy if we could write directly to the user's sheet, rather than ask them to import data from somewhere else, which is why we went this route.

darkhorse13 · on Jan 28, 2023

When copying a spreadsheet in Google Sheets, are you able to copy the API functions as well?

typingmonkey · on Jan 27, 2023

Is there a way to get a stream of changes out of prequel (via websocket)? That would make it possible to live-sync between the customer's devices.

ctc24 · on Jan 27, 2023

Can you tell us a bit more about the type of use-case you have in mind?

We support S3 as a destination, so you could listen for changes on a given S3 bucket and pipe that to a stream (eg as outlined here [0] or here [1]).

[0]: https://aws.amazon.com/blogs/big-data/streaming-data-from-am... [1]: https://stackoverflow.com/questions/48147123/how-to-read-dat...

typingmonkey · on Jan 27, 2023

The use case is to build a RxDB plugin that backups the users data into the prequel api.

mritchie712 · on Jan 27, 2023

You should get Apollo to buy Prequel. Fivetran doesn't support Apollo and I could really use my Apollo data in Snowflake.

ctc24 · on Jan 27, 2023

Ha, thanks for the suggestion. We'll reach out to them.

ian0 · on Jan 28, 2023

Nice! Is it a two way sync? I noticed the sheets were locked in the demo.

ctc24 · on Jan 28, 2023

It's currently a one-way sync, where data from the database gets written to the sheet. Another way of framing it is that it's a data export.

tlarkworthy · on Jan 27, 2023

What about importing sheets too?

ctc24 · on Jan 28, 2023

Stay tuned!

orlp · on Jan 27, 2023

That's a bit of an unfortunate name clash with PRQL, also pronounced prequel: https://prql-lang.org/

jlund-molfese · on Jan 28, 2023

I could be wrong, but Prequel (2020) seems to predate PRQL (2021)