Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In other words, good quality book metadata is expensive. Book recommendation/reviews aren't haven't been so profitable as to justify such investment. If I was a publisher, I'd put my money on the book influencers, not on Goodreads and alternatives anyway.


It isn't expensive... it doesn't exist. Ingram's API is probably the best and it is messy as hell. I've been playing with it and it will help augment data, but you have to have humans to really fine tune things. I just entered 2,500 books manually to launch this on Monday (https://shepherd.com/) and I still am going to need to go back to build relationships between the types of authors (lead, translator, illustrator, etc). It is a hard problem.

Ingram's is like $1k a month, maybe $2k for the full flat files. I can't find anything that is high quality and with a bigger data set.


I like what you're doing with Shepherd.com. I think the idea of curated reading lists is a good one and I've often contemplated it.

The most important thing I was given at University were reading lists. I often wondered if getting students from courses all over the world to send in copies of their University course reading lists would be possible/useful. To give a concrete example my Eng lit course in the 90s, each year we would get 6 reading lists covering different topics/historical periods and with a thematic direction: like (made up) Social Mobility in Renaissance Literature. These lists are produced by people with significant expertise and strong opinions, but also had to ensure certain ground was covered. It seems trivial in a sense, but actually when you come fresh to a topic it's hard to get a great list!

Anyway I think what you're doing is excellent. A question and an observation:

- Why not use openlibrary as referenced in the top comment here (https://news.ycombinator.com/item?id=26837057)?

- I like the author led lists and I get why the page is strongly focused on who the recommendation is coming from / why I should care...but there was also a part of me that wanted quickly to see the general tenor of the recommendations. Not sure how one could possibly satisfy both desires above the fold! But anyway it was a "feeling".


Thanks!

I don't want to get yelled at... but in my testing OpenLibrary data was some of the worst out of all the datasets/APIs I tested. I would love to see that project succeed. I am not sure how they are going to achieve what they are trying to do within the current labor/"business" model.

I hear you on the general tenor of the recommendations, if you have a second shoot me an email at ben@shepehrd.com as I'd love to get your thoughts on something.


> It isn't expensive... it doesn't exist.

It's the same thing, but I get what you mean. By expensive, I mean too expensive to get to invest in building it, not as in expensive to buy off-the-shelf.


Ah gotcha. I have a dream of one day combining the data I am putting together with Storygraph or others and trying to do an at-cost API for book data to encourage more reading/book apps. There is so much free time being put into Goodreads that doesn't seem to do anyone any good.


I did some data management a while back for higher education institutions from Brazil and took that opportunity to contribute to WikiData.

Not a recommendation, just a shoutout to them. Also, OpenRefine helped so much, kudos to all involved.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: