“... In some cases, teams use the audio of users’ voice requests as training data—all anonymized, Apple says.
> We leave out identifiers to avoid tying utterances to specific users, so we can do a lot of machine learning and a lot of things in the cloud without having to know that it came from [the user],” Joswiak said. In other words, Siri can learn things about users as a whole without tapping into individuals’ personal data.
> Apple holds on to six months’ worth of the user voice recordings to teach the voice recognition engine to better understand the user
> After that six months, Apple saves another copy of the recordings, sans user ID, for use in improving Siri, and these recordings can be kept for up to two years.
> The training happens on Apple’s servers, but the models only start practicing what they’ve learned when they’ve been deployed to your device.
> Once on the device, the models begin to run computations on things you type or tap into your device, or on things that are seen in the device’s camera, heard through the microphone, or sensed by the device’s sensors. Over time, this creates a massive pile of personal data on the device, as much as 200MB worth.
> We leave out identifiers to avoid tying utterances to specific users, so we can do a lot of machine learning and a lot of things in the cloud without having to know that it came from [the user],” Joswiak said. In other words, Siri can learn things about users as a whole without tapping into individuals’ personal data.
> Apple holds on to six months’ worth of the user voice recordings to teach the voice recognition engine to better understand the user
> After that six months, Apple saves another copy of the recordings, sans user ID, for use in improving Siri, and these recordings can be kept for up to two years.
> The training happens on Apple’s servers, but the models only start practicing what they’ve learned when they’ve been deployed to your device.
> Once on the device, the models begin to run computations on things you type or tap into your device, or on things that are seen in the device’s camera, heard through the microphone, or sensed by the device’s sensors. Over time, this creates a massive pile of personal data on the device, as much as 200MB worth.
https://www.fastcompany.com/40443055/apple-explains-how-its-...