Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

OpenAlex has 240M. https://docs.openalex.org/api-entities/works

CORE has 431M. https://core.ac.uk/data

Crossref has 165M. https://www.crossref.org/blog/2025-public-data-file-now-avai...

These datasets are all biased towards work published in the digital age, but it's important to note that work is coming out much faster now than it used to.



So indeed, order 10^9 not 10^8, given the CORE at > sqrt(10)*10^8.


Is that because there is a pressure to publish? As I wouldn't say we make advancements at a rate any different during the last two decades than we have over the 20 years prior to that.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: