What happens now to data in public AI?

01 User submits chats, queries, docs into web based public AI model

Once the data is submitted, it’s visible to the public AI model owner — it must be visible for processing. Data transmitted across the internet isn’t inherently visible, however, it does mean there are more third parties involved and there’s more chance of interception.
02 The public AI infra machine

Most public AI companies use data submitted by users to “improve their services” and, if not explicitly requested, they use for “training purposes”. The public AI infra stack normally includes their: backend OS; LLM; model training; product improvement; and data storage. Note that most public AI stores user data which creates more data breach risk.
03 Third parties process much of users' data

Most public AI companies send user data to many third party processors. E.g., OpenAI uses Microsoft, Cloudflare, CoreWeave, Oracle, Google, Snowflake, Salesforce and many more, and in numerous countries. OpenAI’s dynamic Sub-Processor List is here.
04 The upshot

User and company data, personal and confidential info, and IP are shared with many unknown third parties in different countries for commercial reasons that don’t benefit the user. This increases company costs and may lead to possible harm later from data misuse.

see DATA FLOW WITH ZAI NODE (we HOST)

see DATA FLOW WITH ZAI NODE (YOU HOST)

Public AI’s data concerns

AI is data-breach time bomb

AI is everywhere. Once unleashed AI acts like a hungry Pac-Man, scanning and analyzing all the data it can grab. If AI surfaces critical data where it doesn’t belong, it’s game over.
Bleeping Computer, Varonis, 2025
Samsung bans ChatGPT

Samsung has banned the use of ChatGPT and other AI-powered chatbots by its employees, amid concerns about sensitive internal information being leaked on such platforms.
Forbes, 2024
McDonald’s AI recruiter exposes staff data

McDonald’s AI recruitment system has exposed sensitive staff data to hackers. This incident raises serious questions about corporate responsibility and the protection of data.
The Bridge Chronicle, 2025
Even VIPs want private AI too

Matthew McConaughey wants a language model trained entirely on his own writings, books, journals, and personal collections.
Economic Times, 2025
Report highlights security risks of open source AI

Security in open source AI projects is a major concern, as the report reveals more than half of organizations use open source components in at least half of their AI/ML projects.
Anaconda, 2025
Why Gen AI misuse across borders will increase data breaches

By 2027, more than 40% of AI-related data breaches will be caused by Gen AI misuse across borders, predicts Gartner.
Cyber Magazine, Gartner, 2025
Healthcare turns to AI scribes, so what are the risks?

More hospitals are turning to AI health technology to help lighten the admin load, but experts say there are real risks, including accuracy and data protection.
ABC, 2025
Altman warns there is no confidentiality with ChatGPT

OpenAI says that users have very little control over what happens after they hit send. In fact, OpenAI staff might access your conversations for moderation or training.
Genius Firms, 2025
Amazon AI coding assistant exposes 1 million users

The breach exposed critical flaws in how AI tools are integrated into software development pipelines. It's a moment of reckoning for the developer community.
TechSpot, 2025

01 User submits chats, queries, docs into web based public AI model

02 The public AI infra machine

03 Third parties process much of users' data

04 The upshot

AI is data-breach time bomb

Samsung bans ChatGPT

McDonald’s AI recruiter exposes staff data

Even VIPs want private AI too

Report highlights security risks of open source AI

Why Gen AI misuse across borders will increase data breaches

Healthcare turns to AI scribes, so what are the risks?

Altman warns there is no confidentiality with ChatGPT

Amazon AI coding assistant exposes 1 million users