Morning Overview

A study found ChatGPT trains on your chats by default unless you dig in and switch it off.

Millions of people type personal details, medical questions, and financial concerns into ChatGPT every day, and unless they have found a buried settings toggle, those conversations feed back into OpenAI’s training pipeline. A research preprint analyzing donated chat histories from more than 1,000 users across four countries now quantifies what that default means for privacy: even after names and obvious identifiers are stripped from logs, an off-the-shelf model can still infer a user’s age, gender, and country at high accuracy rates.

How anonymized chat logs still expose personal details

The paper, titled “Inferential Privacy Leakage in Anonymized Conversational AI Logs,” was posted on arXiv as a research preprint. Its dataset consists of complete ChatGPT conversation histories voluntarily donated by more than 1,000 participants spread across four countries. The researchers did not scrape accounts or exploit any data breach. Instead, they worked with logs that users handed over, then anonymized those logs by removing explicit self-identifying information before running inference attacks against them.

The central finding is blunt. An off-the-shelf classification model, not a custom-built system or a state-of-the-art research tool, was able to infer sensitive attributes such as age, gender, and country of residence at high F1 scores. F1 is a standard machine-learning metric that balances precision and recall; a high score means the model correctly identified these attributes with few false positives and few misses. The implication is that the patterns people leave in ordinary conversation-the topics they ask about, the phrasing they use, even the cadence of follow-up questions-carry enough signal to re-identify them even when traditional identifiers have been removed.

This matters because ChatGPT’s default behavior is to retain conversations and use them for model improvement. Users can disable this by navigating through settings menus, but the option is not surfaced during onboarding or in any prominent way during regular use. The practical result is that the vast majority of conversations are likely retained with training enabled, creating a growing corpus of material that, as this research demonstrates, carries inferential risk even after standard anonymization.

What the training toggle actually controls

OpenAI allows users to opt out of having their chats used for training by toggling a setting inside the data controls menu. When that toggle is off, conversations are still stored for abuse monitoring and safety purposes, but they are excluded from the datasets used to improve future models. When it is on, which is the default state for typical consumer accounts, conversations become part of the training pipeline and may influence how future systems respond.

The research preprint does not directly test whether accounts with training enabled face higher re-identification risk than accounts that disabled training before the study period. That distinction is worth watching. If the logs used for training carry the same inferential leakage documented in the paper, then the default setting effectively feeds re-identifiable data into model weights at scale. Accounts that opted out would still have their logs stored temporarily, but those logs would not propagate into training data where inferential patterns could compound over time.

A testable hypothesis follows from the findings: accounts that keep the training toggle on should show measurably higher rates of successful attribute inference in any future analysis of leaked or disclosed logs, compared to accounts that disabled training early. The preprint does not confirm this directly, but the mechanism it documents-high-accuracy inference from anonymized conversation patterns-suggests the risk scales with the volume and diversity of retained data.

Limits of the evidence and open questions

Several gaps in the current evidence deserve attention. The preprint does not include any primary OpenAI policy document or engineering log confirming the exact rollout date of the training toggle or its precise default state across all account types and regions. The researchers relied on donated histories, which means the sample is self-selected. People who volunteer their chat logs may differ systematically from the broader user base in how they use the tool, what topics they discuss, and how much personal information they share.

The study also does not establish whether the analyzed logs originated from accounts that had training enabled or disabled. That distinction would be necessary to draw a direct causal line between the default setting and the privacy leakage the paper documents. Without it, the findings demonstrate that anonymized ChatGPT logs in general carry inferential risk, but they stop short of proving that the training pipeline specifically amplifies that risk.

No data from OpenAI exists in the public record showing how many accounts have the training option enabled versus disabled. That number would help estimate the scale of exposure. The arXiv member information, which describes how research institutions support the repository, underscores that the platform is an independent preprint server rather than an industry transparency portal, so it does not provide additional context on the source of the donated logs or any institutional relationship with OpenAI.

Another open question is how representative the four countries in the dataset are of global ChatGPT use. Linguistic cues, cultural references, and even time-of-day patterns vary widely across regions. A model that performs well at inferring attributes in one set of countries may perform differently elsewhere. The preprint’s results therefore highlight a mechanism of privacy leakage rather than delivering a complete global risk map.

Finally, the paper focuses on a limited set of attributes-age bracket, gender, and country. In practice, similar inference attacks could target occupation, political leanings, health status, or other highly sensitive traits. The fact that a generic classifier already performs well on basic demographics suggests that more specialized models, given access to the same anonymized logs, could push further into intimate territory.

What users should do before their next conversation

For anyone who has never changed the default, the first step is straightforward. Open ChatGPT’s settings, find the data controls section, and turn off the toggle labeled “Improve the model for everyone” or its current equivalent. This does not delete past conversations that were already used for training, but it stops future chats from entering the pipeline. Users who want to go further can delete individual conversations or request full data deletion through OpenAI’s privacy tools.

The broader question is whether anonymization, as currently practiced, provides meaningful protection when conversational patterns themselves are so revealing. The preprint’s findings suggest that simply stripping names, email addresses, and obvious identifiers from logs is not enough. As long as the underlying text remains intact, models can exploit subtle regularities in language and behavior to reconstruct who is speaking and what broad categories they belong to.

That does not mean users are powerless. Being cautious about sharing directly identifying details-full names, addresses, account numbers-still matters, especially because those details can be copied, pasted, or misused outside the training context. But it does mean that even seemingly harmless small talk with a chatbot, repeated over months or years, contributes to a behavioral profile that is hard to unwind once it has been baked into model weights.

On the institutional side, the study adds weight to calls for stricter limits on retention and secondary use of conversational data. One proposal is to keep logs needed for safety monitoring in a segregated system with tight access controls, never mixing them into training datasets. Another is to explore stronger technical protections, such as differential privacy, that deliberately inject noise into training so that individual conversations leave a weaker imprint.

None of those measures will emerge from preprints alone. The infrastructure that underpins repositories like arXiv depends on sustained support from universities and funders, and its operators explicitly invite that backing through donation channels. As more research on conversational AI privacy appears there, policymakers and regulators will have better evidence to weigh trade-offs between innovation and protection.

For now, the lesson for everyday users is uncomfortably clear. Even when platforms promise that chat logs are anonymized, the way we talk-our questions, our habits, our interests-can be enough to give us away. Turning off training where possible, pruning old conversations, and treating AI chats with the same caution as any other online posting are modest steps, but they may be the only tools individuals have until product defaults and legal standards catch up with what the data already reveals.

More from Morning Overview

*This article was researched with the help of AI, with human editors creating the final content.