OpenAI develops internal AI agent to accelerate data-driven decision making

OpenAI has built an internal artificial intelligence agent designed to help employees explore its data platform and generate faster answers to business and product questions.

The company said in a blog post that the agent is for internal use only and integrates with its own data, workflows and permission systems. It uses OpenAI’s Codex, the GPT‑5.2 model, the Evals API and the Embeddings API.

Serving more than 3,500 internal users, the tool spans 70,000 datasets and 600 petabytes of data. It is available through Slack, a web interface, integrated development environments, the Codex command-line interface via MCP, and OpenAI’s internal ChatGPT app through an MCP connector.

According to the company, the agent moves users from question to insight in minutes and includes a memory system that learns continuously from interactions.

Its reasoning draws on multiple context layers, including dataset usage and lineage, human annotations, Codex-derived code enrichment, institutional content from Slack, Google Docs and Notion, persistent memory, and live queries executed at runtime.

A daily offline pipeline aggregates usage data, annotations and code-based enrichment into embeddings, which are used at query time for retrieval-augmented generation. OpenAI said this ensures that the agent can surface the most relevant context in response to user questions.

Quality control is enforced through the Evals framework, which pairs natural-language queries with validated SQL examples. The company said it compares generated SQL and output against known results to detect regressions.

The agent preserves permission boundaries through pass-through access and displays its reasoning alongside each answer, with links to underlying data sources. OpenAI said it continues to improve the tool’s reliability, validation methods and workflow integration.

The Recap

OpenAI built an internal agent to explore company data.
Platform covers 70k datasets and 600 petabytes for 3.5k users.
Daily offline pipeline produces embeddings used at query time.

Subscribe to Our Newsletter

OpenAI develops internal AI agent to accelerate data-driven decision making

The Recap

Most token launches fail in the months after listing, not on the day itself, Kraken research finds

Most workers are using AI wrong, Google and Stanford study finds

Nvidia brings 90 frames-per-second VR streaming to GeForce NOW as cloud gaming pushes into headsets

Kraken received nearly 8,000 law enforcement data requests in 2025 as regulatory scrutiny of crypto intensifies

Google expands open shopping protocol to let AI agents browse, compare and check out like human customers

Explore topics

Tech

Artificial Intelligence

Business

Entertainment & Sport

Top tags

OpenAI develops internal AI agent to accelerate data-driven decision making

Related reading

The Recap

Most token launches fail in the months after listing, not on the day itself, Kraken research finds

Most workers are using AI wrong, Google and Stanford study finds

Nvidia brings 90 frames-per-second VR streaming to GeForce NOW as cloud gaming pushes into headsets

Kraken received nearly 8,000 law enforcement data requests in 2025 as regulatory scrutiny of crypto intensifies

Google expands open shopping protocol to let AI agents browse, compare and check out like human customers