The backend has also evolved massively from Hbase to the cloud-powered data warehouses. We have the ability to ingest and query petabytes with single-second delays now. There's also on-demand querying like Presto/Drill/Dremio, ETL systems like CBT, and the growing space of "data lineage" for seeing how data is connected and has evolved over time.
The real issue has always been the organizational problems of larger teams and companies as data gets split into multiple silos and needs ETL and cleanup before it's useful. The new abilities we have gained have increased the complexity and scale which can lead to new challenges, but the tools are definitely getting better every day.
The real issue has always been the organizational problems of larger teams and companies as data gets split into multiple silos and needs ETL and cleanup before it's useful. The new abilities we have gained have increased the complexity and scale which can lead to new challenges, but the tools are definitely getting better every day.