I appreciate the reasoning, but I wonder whether the comparison with voice assistants is computationally fair; Federated learning might address the training problem along with the privacy.
I would have liked on-device processing + private mode like in 3rd party smartphone keyboards with a minimal language model and a cloud feature for better accuracy/sync facilities for those who need it.