We need more tokens, more variety of topics in texts and more complexity.
(That amount is equivalent to 50000 books, which few nationals will have read.)