These datasets are all biased towards work published in the digital age, but it's important to note that work is coming out much faster now than it used to.
Is that because there is a pressure to publish? As I wouldn't say we make advancements at a rate any different during the last two decades than we have over the 20 years prior to that.
CORE has 431M. https://core.ac.uk/data
Crossref has 165M. https://www.crossref.org/blog/2025-public-data-file-now-avai...
These datasets are all biased towards work published in the digital age, but it's important to note that work is coming out much faster now than it used to.