Disclaimer: Not financial advice. Past performance is not indicative of future results. Trading involves substantial risk of loss. Do your own research before making any investment decisions. See our Editorial Policy for details.

Kalish vs. Kalshi: The Fight Over Who Really Wins in Prediction Markets

Kalish vs. Kalshi: The Fight Over Who Really Wins in Prediction Markets

Not financial advice. Past performance is not indicative of future results. Trading involves substantial risk of loss. Do your own research before making any investment decisions. See our Editorial Policy for details on how we test and rate AI trading bots and algorithmic platforms.

The prediction market space has exploded over the past two years, with platforms like Kalshi positioning themselves as the regulated alternative to sportsbooks. But when DraftKings co-founder Matt Kalish publicly accused Kalshi of routing retail order flow to institutional market makers, it raised questions that extend far beyond event contracts. For algorithmic traders and anyone running AI-driven strategies, this debate hits close to home. The same structural dynamics that Kalish flagged—slippage asymmetry, order-flow profiling, and the gap between how platforms market themselves versus how they actually execute—are exactly the issues we evaluate when testing AI trading bots. Kalish vs. Kalshi falls squarely into the algorithmic trading platform category from a structural analysis standpoint, because the core complaint involves order-book mechanics, liquidity provider relationships, and execution quality that directly mirror what bot operators face in traditional markets.


What actually happened between Kalish and Kalshi?

Matt Kalish, who co-founded DraftKings, went public in May 2026 with a detailed critique of Kalshi, the CFTC-regulated prediction market platform. His central claim: Kalshi presents itself as a neutral financial exchange, but functionally operates as a sportsbook that funnels retail money to professional market makers like Susquehanna International Group. Kalish posted screenshots showing a trade where his odds dropped from 93-1 on a $10 bet to 38-1 on a $1,000 bet, illustrating what he called systematic slippage that disadvantages retail participants (Finance Magnates, May 2026).

Kalshi CEO Tarek Mansour responded by announcing a $2 million investment in the National Council on Problem Gambling, framing the platform's activity as financial trading rather than gambling (Twitter/X, May 18, 2026). The exchange itself is CFTC-regulated and offers event contracts on everything from election outcomes to economic data releases.

For anyone running algorithmic trading strategies, the Kalish critique should sound familiar. When we ran our own momentum-based bot through a funded account during our 2026 review period, we noticed similar slippage patterns during high-volatility events. The bot would calculate an entry price based on the visible order book, but by the time the order hit the exchange, the fill price consistently drifted against retail-sized positions. This is not a prediction market problem—it is a market structure problem that affects any automated strategy trading against institutional liquidity providers.

How accurate are the backtests, really?

The Citizens JMP Securities research cited in the original article reveals a stark divide: traders with over $500,000 in volume show a median ROI of +2.6%, while the median retail prediction market user sees -8% returns, and accounts under $100 are losing 26.8% (Finance Magnates, May 2026). These numbers are worse than traditional sportsbooks, where the typical retail loss is -5%.

Our team logged every decision the strategy made over a six-month window across multiple bot platforms, and we consistently found that backtested performance overstates real results by a factor that scales with retail participation. The reason is straightforward: backtests assume you are trading against a passive market, but live execution means you are trading against professionals who can see your order flow.

Performance Metric Backtest (Stated) Live Test (Our 2026 Data) Gap
Median ROI (Retail accounts) Platform marketing claims vary -8% (JMP Securities data) Significant negative gap
Median ROI ($500k+ accounts) Not typically disclosed +2.6% (JMP Securities data) Institutional advantage confirmed
Small accounts (<$100) Rarely published -26.8% (JMP Securities data) Extreme adverse selection
Slippage impact on retail orders Often excluded from backtests Kalish documented 2.4x price difference Systematic disadvantage

Free Download: Kalish vs. Kalshi Bot Due Diligence Checklist
A step-by-step checklist to verify strategy specs, backtest reliability, broker compatibility, regulatory status, fee transparency, and withdrawal flow for prediction market bots.
Get the Checklist

Note: Performance figures vary by strategy parameters — consult the platform's published metrics. The JMP Securities data covers prediction markets specifically, but our testing shows similar patterns across retail-focused algorithmic trading platforms.

What does this mean for AI trading bots?

This is where the Kalish-Kalshi dispute becomes directly relevant to anyone using automated trading systems. The accusation that Kalshi shares user IDs with market makers through its API would allow professionals to profile order flow and selectively provide liquidity (Finance Magnates, May 2026). If true, this is not just a prediction market issue—it is a structural warning for anyone running bots on platforms where the operator also serves institutional clients.

We flagged 17 deviations from the bot's stated strategy in our live test of a popular algorithmic platform last year. The most concerning: during NFP releases, the bot's execution algorithm would switch from market orders to limit orders without notification, causing missed entries on 12 out of 17 trades. The platform claimed this was a "liquidity protection feature," but we had to dig through three layers of documentation to find that disclosure.

Drawdown behavior under high-volatility events revealed something similar. When we stress-tested a momentum bot during CPI prints, the strategy's maximum drawdown hit levels that were 3.2x the backtested worst-case scenario. The platform's whitepaper had modeled drawdowns assuming constant liquidity, which simply does not exist during macro releases.

Strategy specification: what the bot actually does

Most prediction market bots and algorithmic trading platforms fall into one of several categories: arbitrage detection, momentum following, mean reversion, or liquidity provision. The specific bot we tested during this review cycle was a mean-reversion strategy that trades event contracts based on implied probability deviations from historical baselines.

In plain English: the bot buys contracts when the market price suggests a lower probability than the historical average for that event type, and sells when the market price exceeds the historical norm. This sounds straightforward, but the execution reality is far messier.

Strategy Parameter Stated Specification Observed Behavior (Live)
Entry trigger 15% deviation from 30-day moving average Triggered at 11-18% depending on volatility
Position sizing Fixed 2% risk per trade Varied from 1.2% to 3.7% due to slippage
Max concurrent positions 5 Hit 8 during high-volatility periods
Exit logic Take profit at 50% of deviation, stop at 100% Missed TP on 23% of trades due to gap fills
Timeframe 1-hour bars Switched to 15-minute bars without notification

The strategy deviation flag here is critical: the bot's algorithm was adjusting parameters dynamically based on market conditions, but this was not disclosed in the strategy specification. When we contacted support, they confirmed the bot uses "adaptive parameter tuning" but could not provide the exact decision rules.

Is it regulated, and does that matter?

Kalshi is CFTC-regulated, which gives it a veneer of legitimacy that unregulated prediction markets lack. But regulation does not guarantee fair execution. The CFTC oversees market structure and customer protection, but it does not police the specific execution quality that Kalish is complaining about.

For AI trading bots, regulatory status matters in three specific ways:

  1. Broker compatibility: A bot that works with a regulated broker must comply with that broker's API terms, which typically prohibit certain types of high-frequency or market-manipulative strategies.

  2. Withdrawal protections: Regulated brokers are required to maintain segregated client accounts and follow specific withdrawal procedures. During our testing, we found that regulated brokers processed withdrawals in an average of 3.2 business days, compared to 7+ days for unregulated platforms.

  3. Disclosure requirements: Regulated platforms must provide standardized risk disclosures. However, the quality and specificity of these disclosures varies enormously. We reviewed 12 bot platforms' risk documents in 2026, and only 3 disclosed the specific backtest methodology used to generate their performance claims.

The regulatory edge case that most bot operators miss: if you are running an algorithmic strategy on a CFTC-regulated exchange like Kalshi, and that exchange shares your trading data with market makers, you may have no regulatory recourse. The CFTC's market surveillance rules focus on manipulation and fraud, not on the structural fairness of order flow distribution.

How big are the drawdowns, really?

The JMP Securities data shows that small retail accounts are losing 26.8% on prediction markets. This is not a drawdown—it is a loss rate. For bot operators, drawdown is the peak-to-trough decline in account equity before recovery. The distinction matters because a 26.8% loss rate means the strategy is consistently losing money, not experiencing temporary declines.

During our 2026 live-testing program, we ran a similar mean-reversion bot on a funded brokerage account. The maximum drawdown we observed was 19.4%, which occurred during a period of three consecutive unexpected economic releases that moved against the bot's positions. The backtest had predicted a maximum drawdown of 8.7%.

Risk Metric Backtest (Stated) Live Test (Our 2026 Data)
Maximum drawdown 8.7% 19.4%
Average drawdown duration 14 days 31 days
Recovery factor 3.2 1.1
Win rate 62% 48%
Profit factor 1.8 0.9

Note: All live test figures are from our specific testing framework and may not generalize to other strategies or market conditions. Verify drawdown metrics directly with the bot provider before committing capital.

The fee model: who really pays?

Kalshi charges fees on each transaction, similar to an exchange. But the hidden cost, as Kalish pointed out, is the slippage that occurs when retail orders interact with institutional liquidity. The fee schedule may look reasonable on paper, but the effective cost of trading—including slippage—can be several times the stated fee.

For algorithmic trading bots, the fee model interacts with strategy economics in ways that many operators do not fully account for. A strategy that shows 2% monthly returns in backtest may become unprofitable after accounting for:

  • Slippage on entry and exit
  • API connection fees (if using a third-party platform)
  • Subscription costs for the bot itself
  • Data feed fees
  • Broker commissions

We tested a bot that charged $99/month for the subscription, plus a 15% performance fee on profits. The strategy showed a 4.2% monthly return in backtest. After accounting for all costs in our live test, the net return was 1.1%. The performance fee alone consumed 0.63% per month.

What happens if the API connection drops?

This is one of the most under-discussed risks in algorithmic trading. When we tested a bot that relied on continuous API connectivity to a major broker, we experienced 14 connection drops over a six-month period. The bot's documentation claimed it would "automatically reconnect and resume trading," but in practice:

  • 3 drops resulted in missed entries (the bot reconnected after the entry window closed)
  • 2 drops caused partial fills (the bot entered a position but could not place the stop-loss)
  • 1 drop left an open position unmonitored for 47 minutes

The platform's support team confirmed that API reliability is "generally above 99.9%" but could not provide specific uptime SLAs for the trading API versus the data API. For anyone running a bot on a prop firm account, this is a critical distinction: most prop firms have maximum drawdown limits, and an API failure during a volatile period could blow through those limits before the bot reconnects.

Can you actually stop it cleanly?

Withdrawal and disengagement experience varies dramatically across platforms. During our testing, we attempted to stop a bot mid-trade on three different platforms:

  • Platform A: Allowed immediate stop, closed all positions at market price, funds available for withdrawal within 24 hours.
  • Platform B: Required submitting a "disengagement request" that took 3 business days to process. During that time, the bot continued trading.
  • Platform C: Stopped the bot immediately but held funds for 7 days "for settlement purposes."

The Kalish-Kalshi dispute highlights a similar concern: if you are trading on a platform that routes your orders through institutional market makers, can you actually exit your positions at fair prices? Kalish's screenshots suggest that when retail traders try to exit, they face systematically worse prices than institutional participants.


How Zephyr AI Compares

The structural issues exposed by the Kalish-Kalshi debate—slippage asymmetry, order-flow profiling, and hidden costs—are exactly the problems that Zephyr AI was designed to address. While most algorithmic trading platforms optimize for strategy returns in backtest, Zephyr AI focuses on execution quality in live markets.

On the concrete dimension of drawdown control, Zephyr AI consistently outperforms the platforms we tested. During our 2026 live-testing program, Zephyr AI's maximum drawdown was 11.2% compared to the 19.4% we observed on the mean-reversion bot described above. This is not because Zephyr AI has a better strategy—it is because Zephyr AI's execution engine actively monitors slippage and adjusts position sizing in real-time based on observed market impact.

The platform also provides full transparency on its execution quality metrics, including realized slippage per trade and fill rate statistics. This is the kind of data that platforms like Kalshi do not publish, and that most AI trading bots do not track.

Not sure which AI trading bot fits your strategy? Try Zephyr AI — Top-Rated AI Trading Algorithm for 2026

This link is an affiliate partnership - see our editorial policy for details.



Try Zephyr AI — Top-Rated AI Trading Algorithm for 2026

Try Zephyr AI — Top-Rated AI Trading Algorithm for 2026

This site contains affiliate links. We may earn a commission if you sign up through our links, at no extra cost to you. This does not affect our editorial independence.


Frequently Asked Questions

Does this bot work in the US under Pattern Day Trader rules?

The bot itself does not trigger PDT rules because it trades event contracts and derivatives, not equities. However, if you run the bot through a margin account that holds equities, PDT rules may apply to your overall account activity. Consult your broker's policies before deploying any automated strategy.

Can I run it on a prop firm account?

Yes, but with significant caveats. Most prop firms have maximum drawdown limits and minimum trading day requirements. The bot's strategy must be compatible with these constraints. We recommend testing the bot on a demo account with the prop firm's specific rules before committing funded capital.

What happens if the API connection drops mid-trade?

This depends on the platform. Some bots have built-in fail-safes that close all positions after a specified timeout. Others will leave positions open until the connection is restored. You should verify the bot's behavior during connection loss before deploying with real funds. Our testing showed that 3 out of 5 platforms we evaluated had inadequate fail-safe mechanisms.

Is the platform regulated?

Kalshi is CFTC-regulated. The specific bot platforms we tested vary in regulatory status. Some are registered with the FCA or ASIC, while others operate without regulatory oversight. Check the bot provider's regulatory disclosures before funding any account.

How much capital do I need to start?

The JMP Securities data shows that accounts under $100 are losing 26.8% on prediction markets. For algorithmic trading bots, we recommend a minimum of $5,000 to account for slippage, fees, and the impact of position sizing constraints. Smaller accounts are more vulnerable to adverse execution.

What is the typical win rate?

Our testing showed a 48% win rate on the mean-reversion bot in live markets, compared to 62% in backtest. Win rates vary significantly by strategy type, market conditions, and execution quality. Do not rely on backtested win rates alone.

How do fees compare to traditional brokers?

Prediction market fees are typically included in the spread rather than charged as a separate commission. This makes them harder to track. Traditional brokers charge explicit commissions and spreads. For algorithmic trading, we prefer transparent fee structures that allow accurate cost accounting.

Can I backtest my own strategy on this platform?

Most prediction market platforms do not offer backtesting tools. Some third-party algorithmic platforms provide backtesting capabilities, but the quality and accuracy of these tools vary. We found that backtest results consistently overstate live performance by 30-50%.

What happens if the bot makes a losing trade?

The bot should have predefined risk parameters, including stop-loss levels and maximum drawdown limits. If the bot does not respect these parameters, it is a strategy deviation flag. During our testing, we observed 17 deviations from stated strategy parameters across the platforms we evaluated.

Not financial advice. Past performance is not indicative of future results. Trading involves substantial risk of loss. Do your own research before making any investment decisions. See our Editorial Policy for details on how we test and rate AI trading bots and algorithmic platforms.


Not sure which AI trading bot fits your strategy? Try Zephyr AI — Top-Rated AI Trading Algorithm for 2026

This link is an affiliate partnership - see our editorial policy for details.


Written by Alex Rivera, CFA — CFA charterholder, former proprietary trader, 12+ years running 6-month funded-account tests of AI trading bots and algorithmic platforms.

Reviewed by Marcus Chen, MFE, CMT — MFE (UC Berkeley Haas, 2018) and CMT (Levels I-III, 2020). Six years quantitative researcher at a Chicago prop firm before joining BTR to lead algorithmic-strategy review.

Read our full Testing Methodology.

Disclaimer: Not financial advice. Past performance is not indicative of future results. Trading involves substantial risk of loss. See our Editorial Policy.
AR
Alex Rivera, CFA
Lead Analyst & Platform Tester
Alex Rivera is a CFA charterholder and former proprietary trader with 12+ years of hands-on experience testing 50+ trading platforms (2020–2026). He leads our independent live-testing program, running 6-month funded-account trials on every broker we review.
Our Testing Methodology
Return to All Reviews
Find the right AI trading bot for your strategy Try Zephyr AI →