Overview

Cursor launched Composer 2, their "frontier-level" AI coding model, but was discovered to be built on top of China's open-source Kimmy K2.5 model without proper attribution. While Cursor's failure to credit the base model sparked controversy, the company did add substantial improvements through reinforcement learning and their own innovations.

Key Takeaways

  • Open source licenses with revenue thresholds require large companies to prominently disclose base models - creating accountability for high-revenue firms using community work
  • Building on open source foundations is legitimate when three-quarters of the compute goes toward genuine improvements like reinforcement learning and novel techniques
  • Self-summarization during long coding tasks allows AI models to compress 100,000+ tokens into essential context, enabling work beyond normal limits
  • Attribution matters for ecosystem health - proper crediting encourages continued open source development and builds community trust
  • Geopolitical sensitivities around Chinese AI models create disclosure dilemmas for US companies serving enterprise customers with security concerns

Topics Covered