Interesting thought: is it easier or more difficult to make a synthetic voice undistinguishable from a human one, compared to producing speech copying a real voice?
To me, at least, the voices I heard in Lyrebird's demo actually sounded more 'real' than Microsoft Sam for example.
Of course, the voices produced by Lyrebird sound more "real" than Microsoft Sam, since I would define realness as sounding humanlike. However, I would strongly prefer Microsoft Sam over something generated by this algorithm for general use because this algorithm produces voices that are still in an uncanny valley, because it is almost human but not human enough, whereas Microsoft Sam is obviously not human.
To me, at least, the voices I heard in Lyrebird's demo actually sounded more 'real' than Microsoft Sam for example.