Futures
Access hundreds of perpetual contracts
TradFi
Gold
One platform for global traditional assets
Options
Hot
Trade European-style vanilla options
Unified Account
Maximize your capital efficiency
Demo Trading
Introduction to Futures Trading
Learn the basics of futures trading
Futures Events
Join events to earn rewards
Demo Trading
Use virtual funds to practice risk-free trading
Launch
CandyDrop
Collect candies to earn airdrops
Launchpool
Quick staking, earn potential new tokens
HODLer Airdrop
Hold GT and get massive airdrops for free
Launchpad
Be early to the next big token project
Alpha Points
Trade on-chain assets and earn airdrops
Futures Points
Earn futures points and claim airdrop rewards
DeepSeek announced a new model MODEL1: A technical leap in one year
DeepSec has reached new heights in its technical advancements with a recent historic announcement. One year after the successful launch of DeepSec-R1 in early January, the company is preparing to introduce a new model, MODEL1. This news has emerged as a major development among industry experts and the tech community.
Technical Changes Revealed on GitHub
DeepSec indicated significant updates to its code by updating on GitHub. Among the changes, 28 mentions of “MODEL1” were found across 114 files, highlighting extensive efforts in developing the new model. These modifications in the Flash MLA code are particularly noteworthy and point toward new technical directions.
MODEL1 vs. V32: New Architecture
The current V32 version, known as DeepSec v3.2, will differ from the new structure of MODEL1. The key differences are especially prominent in three areas: improvements in KV Cache architecture, changes in quantization methods, and new techniques in FP8D encoding. All these modifications are designed to make the system more efficient.
Memory Savings and New Computing Achievements
A major advantage of MODEL1 is its improved memory usage during computation. Unique strategies have been employed to save memory across various processing stages. These changes will enhance the performance of DeepSec’s new model and reduce resource requirements, marking a significant breakthrough in the industry.