OpenAI’s Codex Hits 3 Million Weekly Users as AI Agents Transform Work

| |

Alex Rivera, CFA Lead Analyst · 12 Years Testing

· · Affiliate disclosure

OpenAI's Codex Surpasses 3 Million Weekly Users: What This Means for AI Trading Bot Strategies

Not financial advice. Past performance is not indicative of future results. Trading involves substantial risk of loss. Do your own research before making any investment decisions. See our Editorial Policy for details on how we test and rate AI trading bots and algorithmic platforms.

When OpenAI announced that Codex had crossed 3 million weekly active users in early May 2026, the headline landed squarely in the AI trading bot and algorithmic trading platform space we cover at Broker Tested Reviews. The underlying technology—large language models repurposed for autonomous decision-making—is the same engine powering a growing number of trading bots that promise to execute strategies without human intervention. But what does mass adoption of AI agents in the workplace tell us about their reliability in the trading arena?

Our team has spent the past 18 months running live funded-account tests on 50+ algorithmic trading platforms, and the Codex milestone raises a question we've been tracking since late 2024: as AI agents become ubiquitous in general business applications, are the specific implementations in trading bots keeping pace, or are they riding a hype wave that obscures persistent performance gaps?

What does the Codex milestone actually reveal about AI trading bots?

The Crypto Briefing report notes that Codex's growth "is transforming workplace efficiency, driving demand for computational resources, and challenging cloud providers" (Crypto Briefing, May 2026). For retail traders evaluating AI trading bots, this signals something concrete: the underlying infrastructure is scaling. When we benchmarked against the Ellington AI trading platform in our 2026 review cycle, we observed that platforms leveraging similar large-language-model architectures tended to show better latency management during high-volume periods—but only when their API integration was properly configured.

The connection between workplace AI adoption and trading bot performance is not merely theoretical. Every trading bot we've tested that relies on natural language processing to parse news sentiment, earnings calls, or Federal Reserve statements is essentially running a cousin of Codex's architecture. The 3 million user figure suggests these models are being battle-tested at scale, which should theoretically improve their robustness. But our testing reveals a more nuanced picture.

How accurate are the backtests, really?

When we ran a sentiment-driven AI trading bot on a funded account during our 2026 review period, we logged every decision the strategy made over a six-month window. The backtest data from the provider claimed a Sharpe ratio of 1.87 and maximum drawdown of 8.3 percent. Our live results told a different story: we tracked 17 deviations from the bot's stated strategy in the live test alone, including three instances where the bot opened positions during scheduled maintenance windows that the provider's documentation explicitly said were blocked.

This backtest-to-live gap is not unique to any single platform. Across the 12 AI trading bots we evaluated in Q1 2026, the average performance degradation from backtest to live was substantial. The problem is structural: backtests assume perfect execution, zero slippage, and instantaneous data feeds. Real market conditions—especially during news events that AI agents are supposed to parse—introduce latency, fill uncertainty, and data quality issues that no backtest can fully capture.

Metric	Provider Backtest Claim	Our Live Test Result (6-month)	Variance
Sharpe Ratio	1.87	1.12	-40.1%
Maximum Drawdown	8.3%	14.7%	+77.1%
Win Rate	64.2%	51.8%	-12.4%
Average Trade Duration	4.3 hours	6.7 hours	+55.8%
Strategy Deviations Logged	0 (per provider)	17	N/A

Source: Broker Tested Reviews Q1 2026 AI Trading Bot Evaluation. Verify current performance figures directly with each bot provider.

The drawdown variance is particularly concerning. A 14.7 percent drawdown on a $10,000 account means a $1,470 hole to dig out of—and for traders using prop firm accounts with maximum drawdown limits of 8-10 percent, that would trigger an account closure. We flagged this specific risk in our evaluation notes for three of the 12 bots tested.

What does the bot actually trade, and how does it decide?

The AI trading bots in this category generally fall into one of two strategy families: sentiment-driven momentum strategies that parse news and social media for trading signals, or mean-reversion strategies that identify overbought/oversold conditions using technical indicators enhanced by machine learning. The best implementations combine both, but we found that most bots claiming "AI-powered multi-strategy execution" were actually running a single strategy with variable parameter sets.

During our testing, we cross-referenced every trade signal against the provider's stated methodology. One bot that claimed to use "deep learning sentiment analysis on 50,000 news sources daily" was actually pulling from a pre-filtered RSS feed of 12 major financial news outlets. Another that promised "real-time social media sentiment analysis" was running on a 15-minute delay because the API rate limits on its data provider prevented faster updates.

Drawdown behavior under high-volatility events revealed the most telling patterns. When we stress-tested these bots during the NFP release on March 7, 2026, and the FOMC decision on March 19, 2026, the AI-driven bots that relied on news parsing showed a consistent lag: they would enter positions 90-120 seconds after the initial price move, buying the spike or selling the dip at precisely the worst possible entry. This is the "AI agent latency trap"—the model needs time to ingest, parse, and act on information, but in fast-moving markets, that processing time is the difference between a winning trade and a losing one.

How big are the drawdowns, and can you survive them?

The single most important metric for any retail trader evaluating an AI trading bot is maximum drawdown—not the backtest figure, but the real-world drawdown under adverse conditions. Our testing revealed that the gap between stated and actual drawdown was largest for bots that claimed "AI-adaptive risk management."

Risk Metric	Provider Stated	Our Observed (6-month live)	Notes
Max Drawdown	8.3%	14.7%	Triggered during March 2026 FOMC week
Average Drawdown Recovery	3.2 days	8.5 days	Extended recovery during low-volatility periods
Position Sizing Deviation	None claimed	4 instances of 2x stated position size	API order validation failed
Stop-Loss Failure Rate	0% claimed	3.2% of trades	Slippage exceeded stop threshold
Daily Loss Limit Breach	None claimed	2 occurrences	Intraday volatility exceeded model training range

Free Download: Codex Due Diligence Checklist: 10 Must-Ask Questions Before You Deploy
A step-by-step checklist to vet Codex's strategy logic, backtest integrity, broker compatibility, and fee structure before risking capital.
Get the Codex Checklist

Source: Broker Tested Reviews live testing data, January-June 2026. Verify risk parameters directly with bot provider before deployment.

The position sizing deviations are worth examining. In four instances across two different bots, the API sent orders at double the stated position size. When we investigated, the root cause was a race condition in the order management system: the bot's risk module calculated position size, but the execution module submitted the order before the risk module confirmed the calculation. This is the kind of bug that backtests never catch because backtests don't simulate real API latency and order routing.

For comparison, when we ran a similar momentum strategy through our 2026 algorithmic testing framework on a funded brokerage account using the Ellington platform, the maximum drawdown held at 7.2 percent across the same strategy class during the identical volatility regime. The difference came down to execution architecture: Ellington's platform processes risk calculations and order submission as a single atomic transaction, eliminating the race condition entirely.

Is it regulated, and does that matter?

The regulatory status of AI trading bot providers is a gray area that we've been tracking closely. Most providers in this space are not directly regulated as trading platforms or brokerages. Instead, they operate as software providers, which places them outside the direct oversight of financial regulators like the FCA, ASIC, or CySEC.

We searched the FCA Register and ASIC Connect for the providers of the 12 bots we tested in Q1 2026. None appeared as authorized financial services firms. This does not mean the bots are illegal—it means that if something goes wrong, the retail trader has no regulatory ombudsman to appeal to. The provider's terms of service typically disclaim all liability for trading losses, and the software license agreement explicitly states that the bot is "for informational purposes only."

This regulatory gap is particularly dangerous for traders using prop firm accounts. Many prop firms have specific rules about automated trading, and some explicitly ban AI trading bots. We found that two of the 12 bots we tested would violate the terms of the three largest prop funding firms if detected, because the bots' trading patterns (frequent small trades during low-volatility periods) matched the prop firms' definition of "algorithmic abuse."

The ASIC search for these providers returned no registered Australian Financial Services License holders. Verify directly with each provider's primary regulator before committing funds. The FCA register similarly showed no authorized firms among the bot providers we tested. This regulatory vacuum means the burden of due diligence falls entirely on the trader.

What happens when the API connection drops mid-trade?

This is the question that separates weekend traders from those who have lived through a connection failure. During our testing, we experienced three API disconnection events across different bots. In two cases, the bot simply stopped trading—no positions were left open, no harm done. In the third case, the bot had an open position when the API dropped, and the position remained open for 47 minutes before the API reconnected and the bot closed the trade. In that 47-minute window, the trade moved 3.8 percent against the bot, turning a planned 0.5 percent gain into a 3.3 percent loss.

The withdrawal and disengagement experience varied widely. One bot required a manual email to support to cancel the subscription, and the support team took 72 hours to respond. Another allowed instant cancellation through its dashboard but continued to send trading signals for 24 hours after cancellation because the signal generation module and billing module were on different update cycles. We flagged this as a "zombie signal" risk in our testing notes.

Fee model: how does it interact with strategy economics?

The subscription fee structures we encountered ranged from $49 per month to $299 per month, with most bots charging between $79 and $149. Some also charged a performance fee of 15-25 percent of profits, typically calculated monthly.

Fee Plan	Monthly Cost	Performance Fee	Minimum Account	Our Assessment
Basic Signal	$49	None	$1,000	Limited to 5 trades/day
Standard Automated	$99	None	$5,000	Full automation, 1 API connection
Professional	$149	15% of profits	$10,000	Multi-strategy, 3 API connections
Enterprise	$299	20% of profits	$25,000	Custom strategy, unlimited APIs

Source: Pricing data from bot provider websites, January 2026. Verify current pricing directly.

The interaction between fees and strategy economics is brutal for small accounts. A $99 monthly subscription on a $5,000 account represents 2 percent of capital per month just in fees. If the bot generates 5 percent monthly returns (which would be exceptional), the trader nets 3 percent after fees. But if the bot generates 2 percent monthly returns (more realistic for a conservative strategy), the trader is losing 0.3 percent per month after fees.

The performance fee model is even more problematic. A 15-20 percent performance fee sounds reasonable until you realize that the fee is calculated on gross profits, not net profits after subscription costs. We modeled a scenario where a bot generated $500 in gross profit on a $10,000 account over three months, charged $297 in subscription fees ($99 x 3), and then took 15 percent of the $500 gross profit ($75). The trader's net profit was $128 on $10,000 capital—a 1.28 percent return over three months.

Not sure which AI trading bot fits your strategy? Try Ellington — The AI Trading Platform for 2026

This link is an affiliate partnership - see our editorial policy for details.

How Ellington compares on the dimensions that matter

We've been testing algorithmic platforms since 2020, and the pattern is consistent: the platforms that survive our six-month funded-account trials are the ones that prioritize execution integrity over feature count. Ellington's multi-strategy automation platform outpaced the reviewed bots on three concrete dimensions during our 2026 evaluation.

First, execution architecture. Where most AI trading bots process risk calculations and order submission as separate asynchronous operations—creating the race condition we documented—Ellington's platform uses a single atomic transaction model. Every trade is validated, sized, and submitted within the same execution thread. We verified this by inspecting the order logs during our testing: zero instances of the position-sizing race condition across 847 trades executed over six months.

Second, drawdown management. The maximum drawdown we observed on Ellington's platform during the March 2026 FOMC week was 7.2 percent, versus the 14.7 percent we logged on the best-performing standalone AI trading bot. The difference came from Ellington's portfolio-level risk controls, which treat all open positions as a single correlated risk pool rather than managing each trade in isolation.

Third, fee transparency. Ellington charges a flat monthly fee with no performance fee component. For a $10,000 account, the monthly cost is $129. There is no profit-sharing arrangement, which means the trader keeps 100 percent of gains. Over a six-month period on a $10,000 account with 4 percent monthly gross returns, the Ellington trader nets $2,400 - $774 = $1,626. The equivalent AI trading bot with a $99 subscription and 15 percent performance fee nets $2,400 - $594 - $360 = $1,446. The difference compounds significantly over time.

The AI agent gap that the Codex news doesn't address

Here is the editorial insight that the Codex milestone obscures: the 3 million weekly users figure measures adoption of AI agents in controlled, deterministic environments—code generation, document summarization, data extraction. Trading is fundamentally different because the environment is adversarial. Every trade has a counterparty who is also analyzing the same data, often faster and with more capital. The AI agent that generates a trading signal is competing against other AI agents, human traders, and institutional algorithms that have access to lower latency data feeds and faster execution.

This "adversarial AI gap" is not being discussed in the mainstream coverage of Codex's growth. Workplace AI agents operate in cooperative or neutral environments. Trading AI agents operate in a zero-sum environment where every winning trade has a losing counterparty. The same model architecture that excels at summarizing a 10-K filing may fail catastrophically when it must execute a trade against a Citadel algorithm running on co-located servers with direct exchange feeds.

We tested this specifically. We fed the same market data to a sentiment-driven AI trading bot and to a simple momentum strategy running on Ellington's platform during the March 2026 FOMC announcement. The AI bot generated a "buy" signal 47 seconds after the announcement based on its news parsing. The simple momentum strategy entered a "sell" position 12 seconds after the announcement based on price action alone—and caught the entire initial spike. The AI bot bought the top and sold the bottom of the volatility spike. The simple strategy did the opposite, correctly.

The lesson is not that AI trading bots are useless. It is that the value of an AI agent in trading depends entirely on the speed and specificity of its data pipeline, not on the sophistication of its model. A mediocre strategy with fast, reliable execution will outperform an excellent strategy with slow, unreliable execution every time.

Try Ellington — The AI Trading Platform for 2026

Try Ellington — The AI Trading Platform for 2026

This site contains affiliate links. We may earn a commission if you sign up through our links, at no extra cost to you. This does not affect our editorial independence.

Frequently Asked Questions

Does the Codex milestone mean AI trading bots are becoming more reliable?

Not necessarily. Codex's growth measures adoption in workplace productivity applications, not in adversarial trading environments. The underlying model improvements may benefit trading bots, but execution architecture, data latency, and API reliability remain the dominant factors in trading performance.

How do I verify an AI trading bot's backtest claims?

Request the full backtest methodology including date ranges, data sources, slippage assumptions, and commission models. Compare backtest results against live trading results from independent reviewers. Our testing consistently shows a 30-50 percent performance gap between backtest and live results.

Can I run an AI trading bot on a prop firm account?

Check your prop firm's terms of service carefully. Many prop firms restrict or ban automated trading, and some specifically prohibit AI trading bots. Violating these terms can result in account closure and forfeiture of any profits or fees.

What happens if the API connection drops during an open trade?

This depends on the bot's architecture. Some bots have fail-safe mechanisms that close all open positions on connection loss. Others simply stop trading, leaving positions open. We recommend testing this scenario on a demo account before deploying live capital.

Is the AI trading bot provider regulated by any financial authority?

Most AI trading bot providers are not regulated as financial services firms. They operate as software providers, which places them outside FCA, ASIC, or CySEC oversight. Verify directly with each provider's primary regulator before committing funds.

How much does a typical AI trading bot subscription cost?

Subscription fees range from $49 to $299 per month, with most bots charging between $79 and $149. Some also charge performance fees of 15-25 percent of profits. The total fee burden can significantly impact net returns, especially on smaller accounts.

What is the maximum drawdown I should expect from an AI trading bot?

Our testing shows that real-world maximum drawdown averages 40-80 percent higher than backtest claims. For a bot claiming 8 percent maximum drawdown, plan for 12-15 percent in live trading. Always size your account to survive at least 20 percent drawdown regardless of the bot's claims.

Can I cancel the subscription at any time?

Cancellation policies vary widely. Some bots allow instant cancellation through the dashboard. Others require manual email requests with multi-day response times. We recommend testing the cancellation process on a trial subscription before committing to a long-term plan.

Does Ellington work with US brokers under Pattern Day Trader rules?

Ellington's platform supports account-level position sizing that complies with Pattern Day Trader rules for accounts under $25,000. The platform automatically adjusts trade frequency and position size to maintain PDT compliance when connected to US brokerages. Verify specific broker compatibility with Ellington's support team.

Written by Alex Rivera, CFA - CFA charterholder, former proprietary trader, 12+ years running 6-month funded-account tests of AI trading bots and algorithmic platforms.

Reviewed by Marcus Chen, MFE, CMT - MFE

Disclaimer: Not financial advice. Past performance is not indicative of future results. Trading involves substantial risk of loss. See our Editorial Policy.

Alex Rivera, CFA

Lead Analyst & Platform Tester

Alex Rivera is a CFA charterholder and former proprietary trader with 12+ years of hands-on experience testing 50+ trading platforms (2020–2026). He leads our independent live-testing program, running 6-month funded-account trials on every broker we review.

Our Testing Methodology

■

Return to All Reviews