..... must be pulling in enormous amounts of data just from domestic communications. I don't see how they can store it, categorize it, catalog it, decide what to delete, etc. etc. etc.
One word:
metadataJust like if you check you Alex app... it has (in a text form) all your recent commands/requests of your echo device. Or... if you speak your texts on your phone... you words are converted to text.
Using proper software.... metadata can also contain symbolic text that would reference accents, cadence of speech, volume, stress levels,
keywords, ETC.. Each encounter would (just like a simple HTML) only require maybe 60KB (yes KB). Not much of a problem to store. And I am sure there are algorithms that could compress that as well.
The organization of such text/data would be done in the exact same way Amazon separates your data from my data, your shopping preferences from my shopping preferences.... credit card numbers, addresses, movie and book ownership ETC..