DeepSeek, China's AI model: News & Discussion

So what will happen if I ask this AI for a critical analysis of the Ruling of CCP in China ?
The result is that artificial intelligence can easily come to a conclusion: China has eight political parties. India has more than 2,500 political parties, including 20-24 ruling parties, 30-40 opposition parties with seats, 26 major opposition alliances, and over 2,000 broader opposition parties. Analyzing China is easy, but when analyzing India, my computer froze.

According to the latest data from WDMMA, the total score of the Indian Air Force is 69.4, surpassing the Chinese Air Force's 63.8, successfully ranking it third on the global air force leaderboard. You can take a holiday to celebrate.

This proves that China is not as strong as India, which is your Indian pride.
 
The result is that artificial intelligence can easily come to a conclusion: China has eight political parties. India has more than 2,500 political parties, including 20-24 ruling parties, 30-40 opposition parties with seats, 26 major opposition alliances, and over 2,000 broader opposition parties. Analyzing China is easy, but when analyzing India, my computer froze.

According to the latest data from WDMMA, the total score of the Indian Air Force is 69.4, surpassing the Chinese Air Force's 63.8, successfully ranking it third on the global air force leaderboard. You can take a holiday to celebrate.

This proves that China is not as strong as India, which is your Indian pride.
Eight political parties !! Wow !
 

Alibaba’s Qwen family hits 700 million downloads to lead global open-source AI adoption​

Alibaba Cloud’s Qwen family is the world’s most widely used open-source AI system as of January, Hugging Face data shows

Published: 3:01pm, 12 Jan 2026

Alibaba Cloud’s flagship Qwen family of artificial intelligence models surpassed 700 million downloads as of January on the developer platform Hugging Face, which made it the world’s most widely used open-source AI system.

According to research published on Friday by consultancy AIBase, citing data from Hugging Face, estimated Qwen downloads as of December alone exceeded the combined total of the next eight most popular models on the platform globally, including those from Meta Platforms and OpenAI, as well as Chinese firms Zhipu AI, Moonshot AI and MiniMax.

Alibaba Cloud is the AI and cloud computing services unit of Alibaba Group Holding, owner of the South China Morning Post.

The volume of Qwen downloads on Hugging Face reinforced its position as the world’s leading open-source ecosystem.

“Tens of thousands of real-world applications around the globe have been built based on Qwen, marking a historic peak for [Chinese] open-source large models within the international developer community,” the AIBase report said.

According to research published in October, Alibaba’s Cloud’s Qwen2.5-1.5B-Instruct, a lightweight variant, was the most downloaded AI model on Hugging Face.
 

Deepseek research touts memory breakthrough, decoupling compute power and RAM pools to bypass GPU & HBM constraints — Engram conditional memory module commits static knowledge to system RAM​


published 11 hours ago

Deepseek paper seeks to reduce reliance on GPU compute for basic information retrieval

DeepSeek has released a new technical paper, which details a new method for how new AI models might rely on a queryable database of information committed to system memory. Named "Engram", the conditional memory-based technique achieves demonstrably higher performance in long-context queries by committing sequences of data to static memory. This eases the reliance on reasoning for AI models, allowing the GPUs to only handle more complex tasks, increasing performance, and reducing the reliance on high-bandwidth memory (HBM).

The paper details how N-grams, statistical sequences of words, are integrated into the model's neural networks, allowing them to be placed into a queryable memory bank. Engram allows models to remember facts, rather than having to reason them out, which is more computationally expensive.

Released on the company's GitHub page, Engram hopes to address how the company might be able to curb the reliance on more complex memory types and instead commit a knowledge library to a more common system memory standard, such as CXL.

Reducing the reliance on HBM​

The ongoing reliance on high-bandwidth memory for AI accelerators is something that even Chinese silicon, such as Huawei's Ascend series, cannot escape. Each stack of HBM uses more memory dies, and with demand skyrocketing, easing any AI model's reliance on the GPU's direct high-bandwidth memory would be significant, especially considering the ongoing memory supply squeeze.

Engram would enable static memory to be held separately from an LLM's compute power, allowing the GPU's rapid HBM to dedicate itself to reasoning, therefore enabling more performant Engram-based AI models, compared to a standard Mixture of Experts (MoE) model.

As detailed in the paper, an Engram-based model scaled to nearly 27 billion parameters can beat out a standard MoE model in long-context training and eliminates computational waste generated by having to reason out facts, by allowing them to be externally stored.

A standard MoE model might have to reconstruct these pieces of data every time it's referenced in a query, which is called conditional computation. The model will then call on its expert parameters to assemble and reason the data every time, even when it only focuses the query on certain parts or experts, named sparse computation.

Deepseek Paper shjowing Engram in training and inference

How Engram embeds itself into training and inference workloads (Image credit: Deepseek)

The Engram paper adds that placing conditional memory would allow the model to merely ask: "Do I already have this data?", rather than having to access the parts of the model that deal with reasoning.

"This process essentially amounts to an expensive runtime reconstruction of a static lookup table, wasting valuable sequential depth on trivial operations that could otherwise be allocated to higher-level reasoning," the paper reads.

How Engram is different to KVCache​

Engram takes static patterns and lists its knowledge index into a parsable piece of conditional memory with a store of information, relieving the AI model from the burden of having to reason through context repeatedly. While Nvidia's KVCache, announced at CES 2026, offloads context data to NVMe memory with BlueField-4, this acts as more of a short-term solution, allowing the model to remember things that you have recently said or added within context, and is, for all intents and purposes, disposable after you move on to the next query or conversation.

KVCache, while persistent within the history of your conversations or queries, does not draw on an existing base of pre-calculated data, and is not persistent in the same way that Engram-based LLMs could be, if the paper is to be believed. To put it simply, KVCache can be likened to storing your handwritten notes, whereas Engram is a record of the whole encyclopedia.

Hashing and gating​

This is enabled through tokenizer compression, which compresses equivalent tokens (such as the same word with different forms of capitalization) as the same, canonical concept. This allowed Deepseek to reduce the vocabulary size for the conditional memory module by 23%, and allows for rapid parsing of information in context.

As there is an impossibly large number of phrases or combinations of words within a certain context, they employ a methodology named Hashing, which allows the model to apply a number to a series of words. Engram adds to this, with what it calls Multi-Head Hashing, where you can put several hashes onto multiple numbers, for that single phrase to avoid erroneously adding the wrong context. For example, Universal might be a single entry, distinct from Universal Studios, with Multi-Head Hashing employed to ensure no mistakes or database errors.

This is then passed on to Engram's context-aware gating, which then confirms that the term matches the context of the sentence it's being used in, before being deployed into an output.

The perfect allocation ratio​

Deepseek Engram Allocation ratio

(Image credit: Deepseek)

To examine how Engram-based LLMs might work in large-scale deployments, Deepseek detailed how it might achieve the best allocation between embeddings of Engram and MoE parameters within an AI model.

The outcome was a U-curve, which proved that memory and compute (or reasoning) can be considered mathematically distinct forms of intelligence within AI models. This resulted in a sweetspot for MoE and Engram embeddings.

"Remarkably, the Engram model achieves comparable performance to the pure MoE baseline (𝜌 = 100%) even when the MoE allocation is reduced to just 𝜌 ≈ 40% (i.e., a total of 46 experts for the 5.7B model and 43 experts for the 9.9B model). Furthermore, the pure MoE baseline proves suboptimal: reallocating roughly 20%–25% of the sparse parameter budget to Engram yields the best performance."

Deepseek itself remarks on how both Engram-dominated and MoE-dominated models falter, whereas a ratio that yields 20-25% of the overall parameter budget of the model to Engram achieves the best results.

What if Engram's memory was infinite?​

Deepseek ran another experiment in parallel, which it names the "Infinite Memory Regime." This effectively keeps the computational budget fixed, so the model doesn't get more expensive to run, and attaches a near infinite number of conditional memory parameters to be deployed using Engram.

What they found was that since Engram is distinct from the overall compute budget (since it's effectively a long-term storage bank, which taps into the overall model), Deepseek discovered that performance scales linearly with memory size. Meaning that if a model continued to add to its conditional memory banks, its performance would only continue to improve, without having to increase the overall compute budget.

This could have significant implications for the wider AI industry if performance and results are not singularly bound by compute, but to long-term "Engram" memory banks. If the performance benefits are indeed as good as the paper outlines, the memory squeeze would no longer be singularly based on the deployment of HBM, but all forms of memory that could be deployed within data centers, either through CXL or other methods of interconnection.

The results speak for themselves​

Deepseek deployed an Engram-27B parameter model and a standard 27B MoE model in parallel to determine the performance benefits of computational memory within AI models, and the results were exemplary. Within knowledge-intensive tasks, Engram was 3.4 to 4 points better than its MoE equivalent, and it was even better at reasoning, with a 3.7 to 5 point uplift when compared to its MoE "reasoning-only" sibling. Similar results were also achieved in coding and mathematics-based tests.

However, the big win for Engram was in long-context tasks, increasing accuracy within the NIAH (Needle in a Haystack) benchmark to 97%, which is a leap from the MoE model's score of 84.2%. This is a large difference in reliability between the models, and could point toward AI's long-context and coherence issues eventually becoming a thing of the past, if Engram were to be deployed in a commercial AI model, especially if the demands for long-context AI queries increase.

Will Deepseek V4 be based on Engram?​

Engram has significant implications for the AI industry, especially as the paper details how this specific methodology is no longer bound by HBM, but instead longer-term storage. System DRAM can now be utilized to significantly improve the quality of Engram-based LLM outputs, meaning that the much more expensive HBM will only be used for computationally heavy queries.

Of course, if Engram were to take off, it may worsen the ongoing DRAM supply crisis, as AI hyperscalers adopting the methodology would then flock to system DRAM, instead of solely focusing on putting all of their memory ICs in production into HBM for GPUs.

"We envision conditional memory functions as an indispensable modeling primitive for next-generation sparse models," Deepseek said, hinting at a possible V4 deploying Engram in a new AI model. With the company rumored to announce a new AI model within the next few weeks, don't be surprised if it implements Engram within it.

While the results are impressive on paper, Engram's impact has yet to be determined in real-world deployment. But, if everything the paper says holds in a real-world context, the company could be onto a new 'Deepseek moment.'


DeepSeek’s conditional memory fixes silent LLM waste: GPU cycles lost to static lookups​



DeepSeek stays mum on next AI model release as technical papers show frontier innovation​

 

China’s Zhipu Unveils New AI Model Trained on Huawei’s Chips

January 14, 2026 at 12:12 PM GMT+8

Chinese AI startup Knowledge Atlas Technology JSC Ltd. released a new multimodal model that it says is the country’s first to be fully trained using domestic chips, in a sign of progress toward meeting Beijing’s goal of reducing reliance on American technology.

The company, also known as Zhipu, said Wednesday in a statement that its open-source image generation model, GLM-Image, is the first state-of-the-art multimodal model to complete training using Huawei Technologies Co.’s Ascend chips.

Zhipu’s model was based on Huawei’s Ascend Atlas 800T A2 server and MindSpore framework, according to the statement. The server carries Huawei’s Kunpeng-branded processors and Ascend-branded AI chips.

“This proves the feasibility of training high-performance multimodal generative models on a domestically developed full-stack computing platform,” Zhipu said.

Zhipu’s move aligns with Beijing’s policy to cut dependence on foreign expertise in light of the yearslong campaign by the US and its allies to limit China’s access to advanced technologies from the West. Hours before the Beijing firm touted its partnership with Huawei on Wednesday, the US moved closer to allowing Nvidia Corp. to sell its H200 artificial intelligence chips to China by issuing revised criteria for winning Washington’s approval to ship the processors to Chinese buyers.

Last Thursday, Zhipu became the first of China’s major AI startups to go public. Since then, its shares have jumped more than 80% as investors pile in on enthusiasm about China’s AI industry.


Chinese ‘AI tiger’ Zhipu unveils new model trained entirely on Huawei chips​

 

The newly unveiled STEP3-VL-10B model from StepFun AI is redefining the capabilities of sub-10 billion parameter models. Despite its relatively small size, the model delivers performance on par with much larger counterparts like GLM-4.6V-106B and Qwen3-VL-235B.

It achieved impressive benchmark scores, including 94.43% on AIME2025 for mathematical reasoning and 80.11% on MMMU for expert multimodal understanding. STEP3-VL-10B incorporates a novel technique known as Parallel Coordinated Reasoning (PaCoRe) to enhance test-time computation. The model's development involved a rigorous post-training process with over 1,000 iterations of Reinforcement Learning.

It was trained on 1.2 trillion multimodal tokens, emphasizing critical areas such as K-12 education, OCR, GUI, and mathematical reasoning.

1768911278038.png



Key Points:
  • Small Parameters Outperforming Larger Models: Step3-VL-10B challenges and surpasses 200B-scale models with 10B parameters, achieving an optimal leverage ratio of performance and scale.
  • Deep Logic and Perception: Introducing the PaCoRe mechanism and large-scale reinforcement learning, it reaches world-class levels in competition-level mathematics, complex GUI perception, and 3D spatial reasoning.
  • Edge Intelligence Deployment: Supports high-performance multimodal capabilities on low-computing-power devices, providing a strong foundation for "active understanding and interaction" in smartphones and industrial embedded devices.
 

Alibaba’s Qwen (Tongyi Qianwen) open-source large language model family has reached major milestones, according to the latest data from Hugging Face, the world’s largest AI open-source platform.

The number of derivative models built on Qwen has exceeded 200,000, making it the first open-source foundation model to reach this scale globally. In parallel, cumulative downloads of Qwen models have surpassed 1 billion, with an average of approximately 1.1 million downloads per day.

These figures place Qwen ahead of Meta’s Llama series, firmly establishing it as the most downloaded open-source large model worldwide. The milestone highlights Qwen’s growing influence and adoption across the global developer and research communities.
 

Qwen3-Max Thinking beats Gemini 3 Pro and GPT-5.2 on Humanity's Last Exam (with search)​

January 26, 2026

Chinese AI and tech firms continue to impress with their development of cutting-edge, state-of-the-art AI language models.

Today, the one drawing eyeballs is Alibaba Cloud's Qwen Team of AI researchers and its unveiling of a new proprietary language reasoning model, Qwen3-Max-Thinking.

You may recall, as VentureBeat covered last year, that Qwen has made a name for itself in the fast-moving global AI marketplace by shipping a variety of powerful, open source models in various modalities, from text to image to spoken audio. The company even earned an endorsement from U.S. tech lodgings giant Airbnb, whose CEO and co-founder Brian Chesky said the company was relying on Qwen's free, open source models as a more affordable alternative to U.S. offerings like those of OpenAI.

Now, with the proprietary Qwen3-Max-Thinking, the Qwen Team is aiming to match and, in some cases, outpace the reasoning capabilities of GPT-5.2 and Gemini 3 Pro through architectural efficiency and agentic autonomy.

The release comes at a critical juncture. Western labs have largely defined the "reasoning" category (often dubbed "System 2" logic), but Qwen’s latest benchmarks suggest the gap has closed.

In addition, the company's relatively affordable API pricing strategy aggressively targets enterprise adoption. However, as it is a Chinese model, some U.S. firms with strict national security requirements and considerations may be wary of adopting it.

The Architecture: "Test-Time Scaling" Redefined

The core innovation driving Qwen3-Max-Thinking is a departure from standard inference methods. While most models generate tokens linearly, Qwen3 utilizes a "heavy mode" driven by a technique known as "Test-time scaling."

In simple terms, this technique allows the model to trade compute for intelligence. But unlike naive "best-of-N" sampling—where a model might generate 100 answers and pick the best one — Qwen3-Max-Thinking employs an experience-cumulative, multi-round strategy.

This approach mimics human problem-solving. When the model encounters a complex query, it doesn't just guess; it engages in iterative self-reflection. It uses a proprietary "take-experience" mechanism to distill insights from previous reasoning steps. This allows the model to:
  1. Identify Dead Ends: Recognize when a line of reasoning is failing without needing to fully traverse it.
  2. Focus Compute: Redirect processing power toward "unresolved uncertainties" rather than re-deriving known conclusions.
The efficiency gains are tangible. By avoiding redundant reasoning, the model integrates richer historical context into the same window. The Qwen team reports that this method drove massive performance jumps without exploding token costs:
  • GPQA (PhD-level science): Scores improved from 90.3 to 92.8.
  • LiveCodeBench v6: Performance jumped from 88.0 to 91.4.

Beyond Pure Thought: Adaptive Tooling

While "thinking" models are powerful, they have historically been siloed — great at math, but poor at browsing the web or running code. Qwen3-Max-Thinking bridges this gap by effectively integrating "thinking and non-thinking modes".

The model features adaptive tool-use capabilities, meaning it autonomously selects the right tool for the job without manual user prompting. It can seamlessly toggle between:

  • Web Search & Extraction: For real-time factual queries.
  • Memory: To store and recall user-specific context.
  • Code Interpreter: To write and execute Python snippets for computational tasks.
In "Thinking Mode," the model supports these tools simultaneously. This capability is critical for enterprise applications where a model might need to verify a fact (Search), calculate a projection (Code Interpreter), and then reason about the strategic implication (Thinking) all in one turn.

Empirically, the team notes that this combination "effectively mitigates hallucinations," as the model can ground its reasoning in verifiable external data rather than relying solely on its training weights.

Benchmark Analysis: The Data Story

Qwen is not shy about direct comparisons.

On HMMT Feb 25, a rigorous reasoning benchmark, Qwen3-Max-Thinking scored 98.0, edging out Gemini 3 Pro (97.5) and significantly leading DeepSeek V3.2 (92.5).

However, the most significant signal for developers is arguably Agentic Search. On "Humanity's Last Exam" (HLE) — the benchmark that measures performance on 3,000 "Google-proof" graduate-level questions across math, science, computer science, humanities and engineering — Qwen3-Max-Thinking, equipped with web search tools, scored 49.8, beating both Gemini 3 Pro (45.8) and GPT-5.2-Thinking (45.5) .

image

Qwen3-Max key benchmarks. Credit: Alibaba Cloud Qwen Team on X

This suggests that Qwen3-Max-Thinking’s architecture is uniquely suited for complex, multi-step agentic workflows where external data retrieval is necessary.

In coding tasks, the model also shines. On Arena-Hard v2, it posted a score of 90.2, leaving competitors like Claude-Opus-4.5 (76.7) far behind.

 

Kimi K2.5: Visual Agentic Intelligence

Today, we are introducing Kimi K2.5, the most powerful open-source model to date.

Kimi K2.5 builds on Kimi K2 with continued pretraining over approximately 15T mixed visual and text tokens. Built as a native multimodal model, K2.5 delivers state-of-the-art coding and vision capabilities and a self-directed agent swarm paradigm.

For complex tasks, Kimi K2.5 can self-direct an agent swarm with up to 100 sub-agents, executing parallel workflows across up to 1,500 tool calls. Compared with a single-agent setup, this reduces execution time by up to 4.5x. The agent swarm is automatically created and orchestrated by Kimi K2.5 without any predefined subagents or workflow.

Kimi K2.5 is available via Kimi.com, the Kimi App, the API, and Kimi Code. Kimi.com & Kimi App now supports 4 modes: K2.5 Instant, K2.5 Thinking, K2.5 Agent, and K2.5 Agent Swarm (Beta). Agent Swarm is currently in beta on Kimi.com, with free credits available for high-tier paid users.

1769498921377.png


 

DeepSeek taps Alibaba open-source AI technology to boost OCR performance

The Chinese AI start-up says its latest OCR model delivers stronger performance after adopting an Alibaba-developed open-source model

Published: 10:00pm, 27 Jan 2026Updated: 11:57pm, 27 Jan 2026

Chinese artificial intelligence start-up DeepSeek on Tuesday unveiled an upgraded version of its optical character recognition (OCR) model, incorporating an Alibaba Cloud-developed open-source system to boost performance.

The new model, DeepSeek-OCR 2, replaced a key component of its original architecture with Alibaba Cloud’s lightweight Qwen2-0.5b model, according to a research paper released by the company.

The update, which comes just over three months after DeepSeek launched the first version of its OCR system, underscores the growing role of China’s open-source ecosystem in advancing domestic AI development.

Alibaba Cloud is the artificial intelligence and cloud computing arm of Alibaba Group Holding, which owns the Post.

In the original model, DeepSeek relied on Contrastive Language Image Pre-training (CLIP), a neural network framework developed by Microsoft-backed OpenAI in 2021 that links images with text descriptions.

In OCR applications, CLIP helps systems identify and interpret text embedded in images.

DeepSeek said that replacing CLIP with Alibaba’s Qwen2-0.5b enabled its OCR model to process documents in a way that mimicked how humans read, following “flexible yet semantically coherent scanning patterns driven by inherent logical structures”, according to the research.

Benchmark tests showed the updated model delivered a 3.73 per cent performance improvement over its predecessor, which the company described as a meaningful gain on an already high accuracy base, it said.

DeepSeek has open-sourced DeepSeek-OCR 2 on Hugging Face, a widely used open-source AI developer platform.

The collaboration highlights how Chinese AI developers are increasingly drawing on one another’s open-source innovations to accelerate progress.

Last year, Beijing-based start-up Moonshot AI launched its Kimi K2 system, which borrowed elements from DeepSeek’s V3 architecture while introducing significant redesigns, according to a company researcher.

That launch reverberated through the global tech community, with some experts calling it another “DeepSeek moment” – a reference to the surprise impact of DeepSeek’s V3 and R1 model releases in early 2025.

DeepSeek’s latest OCR update follows fresh academic scrutiny of its original approach. Researchers from China and Japan recently challenged the initial DeepSeek-OCR research, arguing that the model showed inconsistent performance under certain conditions.

Their study found that the original system’s accuracy in visual question-answering tasks could fall to about 20 per cent when exposed to additional text intended to influence its reasoning, compared with roughly 90 per cent accuracy for standard AI models.

DeepSeek said in the Tuesday research it would continue refining its OCR architecture for broader applications, while pushing “towards a more comprehensive vision of multimodal intelligence”.
 
Alibaba-developed AI processor on par with Nvidia’s H20 chip, CCTV report shows
The broadcast offers fresh evidence that Chinese developers are designing advanced chips that could replace imports

Published: 9:00pm, 17 Sep 2025

Alibaba Group Holding’s semiconductor design unit, T-Head, has developed an artificial intelligence chip with capabilities that are on par with Nvidia’s H20 graphics processing unit (GPU), according to a report by state broadcaster China Central Television (CCTV).

The report, which aired on Tuesday, showed T-Head’s PPU, an application-specific integrated circuit, being compared with Nvidia’s H20 and A800 GPUs in a performance benchmark during Premier Li Qiang’s visit to a data centre operated by China Unicom in northwestern Qinghai province.

Li was briefed by China Unicom on the use of mainland-developed chips in the telecommunications network operator’s infrastructure.

This marked the first time that Alibaba’s proficiency in semiconductor design was highlighted in a state broadcast, which offered fresh evidence that Chinese developers are designing advanced chips that could replace imports like Nvidia’s GPUs. Alibaba owns the South China Morning Post.

Alibaba’s Hong Kong-listed shares closed 5.28 per cent higher at HK$161.60 on Wednesday, as the CCTV report seized the market’s attention.

A China Central Television report on Tuesday shows Alibaba Group Holding’s PPU, an AI chip developed by the company’s T-Head semiconductor design unit, being compared to Nvidia’s H20 and A800 processors in a performance benchmark during Premier Li Qiang’s visit to China Unicom’s data centre in northwestern Qinghai province. Photo: CCTV

A China Central Television report on Tuesday shows Alibaba Group Holding’s PPU, an AI chip developed by the company’s T-Head semiconductor design unit, being compared to Nvidia’s H20 and A800 processors in a performance benchmark during Premier Li Qiang’s visit to China Unicom’s data centre in northwestern Qinghai province. Photo: CCTV

The footage aired by CCTV showed a chart that compared a number of locally designed AI accelerators with Nvidia’s two GPUs, which were tailored for China to comply with US tech export restrictions.

T-Head’s PPU card, which had 96 gigabytes of high-bandwidth memory per unit, matched Nvidia’s H20 and surpassed Huawei TechnologiesAscend 910B, according to the CCTV footage. Alibaba’s PPU card also featured chip-to-chip bandwidth of 700 gigabytes per second, high-speed Peripheral Component Interconnect Express standard, which connects hardware components within a computer; and a 400-watt power consumption, which was lower than what the H20 needed.

Another chart shown on the broadcast listed China Unicom’s contracts with four domestic chip providers, which totalled 22,832 cards that provided 3,579 petaflops, a measure of computing speed. A petaflop is equal to 1,000 trillion calculations per second.

T-Head’s PPU cards accounted for 16,384 units, which provided 1,945 petaflops.

The other suppliers to China Unicom were MetaX, a GPU start-up that collaborates with the Chinese Academy of Sciences; Biren Technology, a Shanghai-based AI chip designer; and Zhonghao Xinying Technology, founded by former Google tensor processing unit engineer Yanggong Yifan.

Other chip suppliers mentioned in the CCTV report as potential partners included Tecorigin, a company based in Wuxi in eastern Jiangsu province, as well as Moore Threads Technology and Tencent Holdings-backed Enflame.

China’s growing number of mainland AI chip suppliers gave credence to Alibaba CEO Eddie Wu Yongming’s assurance during the firm’s earnings call last month that the company had prepared “backup plans” to secure AI chip supplies amid US restrictions and heightened geopolitical tensions.

China’s demand for high-performance computing has grown rapidly alongside the expansion of AI development projects. This has prompted local governments and telecoms network operators to invest in large-scale data centres for AI-related projects.

Li visited China Unicom’s 2.77 billion yuan (US$389 million) Sanjiangyuan Green Energy Intelligent Computing Centre, which broke ground in August 2024. It covers an area of 5.3 hectares, where the facility will be built in four phases and provide a total capacity of more than 20,000 petaflops when completed, according to a report by the local Qinghai Daily.


Alibaba AI chip push hits 100,000 mark, beating local rival Cambricon: sources
The milestone underscores Alibaba’s growing footprint in AI hardware, as Chinese firms accelerate efforts to build home-grown processors

Published: 7:24pm, 30 Jan 2026

Alibaba Group Holding has delivered more than 100,000 units of its most advanced artificial intelligence chip, the Zhenwu 810E, whose performance is said to be comparable to Nvidia’s H20, according to people familiar with the matter – a milestone that highlights the company’s growing role in China’s race to build domestic alternatives to US processors.

The shipments, handled by Alibaba’s semiconductor arm T-Head, have already surpassed those of domestic rival Cambricon Technologies, the sources said, declining to be named because the information is not public. Alibaba owns the South China Morning Post.

The Zhenwu 810E is a parallel processing unit – an application-specific integrated circuit designed for both AI training and inference – and represents Alibaba’s most advanced in-house AI chip to date.

Alibaba and Cambricon did not immediately respond to requests for comment on Friday.

The move underscores how Alibaba is carving out a larger share of China’s fast-growing AI chip market, as domestic players push to reduce reliance on Nvidia amid strong demand for AI computing power and ongoing uncertainty around US export controls.

The shipment update comes days after Bloomberg reported that Alibaba was preparing a potential listing of T-Head, as investor interest in China’s semiconductor sector gathers pace.
 
Last edited:

First in History! Alibaba Qwen Builds an Official Large Model, Milan Winter Olympics Enter the AI Era​

Published in Latest AI News
Feb 5, 2026

On February 5, 2026, as the Milan Winter Olympics approached, International Olympic Committee (IOC) President Kirsty Coventry officially announced at the International Broadcast Center that the IOC had developed the first official large model in Olympic history based on Qwen Large Model. Coventry stated that thanks to the technical support of Qwen, this Winter Olympics will be the "smartest" in history.

1770272850587.png
Image | National Olympic Committee AI Assistant

Two-way Implementation: Reshaping Event Management and Viewing Experience​

This official large model has been launched simultaneously on both professional event management and public service sides:

  • Event Management Side: The "National Olympic Committee AI Assistant" for staff from various countries has been launched, relying on Qwen's powerful multilingual understanding capabilities to read millions of words of official manuals. Delegation members can simply ask questions in their native language to accurately resolve complex issues ranging from qualification review to logistics scheduling.
  • Public Side: The "Olympic AI Assistant" was officially launched on the official website (Olympics.com), providing real-time answers to competition rules and Olympic history for global audiences, bringing the public closer to winter sports.

Visuals and Broadcasting: Hard-Hitting Applications of AIGC and 3D Reconstruction​

In the content production process, an automated media description system based on Qwen-VL has been put into operation, capable of identifying key events such as goals and fouls in real time. At the same time, the Milan Winter Olympics organizing committee used the Aliyun Wanxiang Large Model to efficiently create multimedia promotional materials, achieving large-scale application of AIGC in the Winter Olympics.

To address technical challenges such as monotonous textures in snowy scenes and many visual blind spots, Alibaba Cloud's multi-model fusion algorithm successfully achieved high-precision scene reconstruction. This technology has covered 10 core venues, including alpine skiing and ice hockey. Global audiences will clearly capture athletes' rapid flips in the air through **"bullet time" and "time slice"** effects.

Green Foundation: Cross-Regional Collaboration and Sustainable Development​

As the Winter Olympics with the widest distribution of venues, Alibaba Cloud not only supported the transportation management system in the Alps region but also supported the Milan Winter Olympics' sustainable development goals through digital means, including the **"Energy Saving Treasure"** and a newly added "Energy Issue Tracking System".

Coventry emphasized that since the Tokyo, Beijing, and Paris Olympics, Alibaba has always been a "transformative partner." She summarized: "The legacy of the Milan Winter Olympics will be intelligence. This AI capability will reshape the future of the Olympic movement."
 

February 12, 2026

ByteDance Unleashes Seedance 2.0 Shaking Hollywood

The viral launch of Seedance 2.0 by ByteDance promises faster, cheaper video production but sparks fierce debate over copyright, ethics, and the future of filmmaking.

Key Points

  • ByteDance launched Seedance 2.0 on February 10, 2026, as a generative AI video model capable of producing short cinematic clips with advanced director-level controls.
  • Seedance 2.0 is being adopted for advertising, ecommerce, and film production, promising faster, cheaper video creation and challenging traditional studio workflows.
  • The model raises ethical concerns over data transparency and copyright infringement due to unclear training data sources and resemblance to existing content.
The global race to dominate artificial intelligence has just taken a cinematic turn, and all eyes are on China’s ByteDance as it unveils Seedance 2.0—a generative AI video model that’s already sparking both awe and anxiety across the tech and creative industries. In just days since its limited beta release, Seedance 2.0 has gone viral, drawing comparisons to previous Chinese AI breakthroughs and igniting fierce debates over the future of filmmaking, advertising, and intellectual property.

Seedance 2.0 was officially launched by ByteDance, the parent company of TikTok, on February 10, 2026, as a limited beta on the Jimeng AI platform, according to India Today. This new model is not just another text-to-video tool; it’s a sophisticated engine built on ByteDance’s Seedream 5.0 architecture, capable of generating short cinematic clips—typically between 4 and 15 seconds—using a combination of text prompts, images, video references, and even audio inputs. Unlike its predecessors, Seedance 2.0 boasts director-level controls, offering users unprecedented command over motion, lighting, framing, and character consistency. The result? Videos that look and feel strikingly close to real studio productions.

Clips generated with Seedance 2.0 have flooded X (formerly Twitter) and other social platforms, with users marveling at the tool’s ability to produce everything from Kanye West-style music videos with dramatic lighting to outlandish action sequences like "Will Smith fighting a giant spaghetti monster" or "Godzilla versus a tiny cat." The buzz is so intense that many are calling Seedance 2.0 a "Hollywood killer," suggesting that the traditional film industry may need to brace itself for a new era of AI-driven content creation. One user summed up the sentiment by posting, "Tom Cruise v Brad Pitt created using Seedance 2.0. The character consistency, accuracy and details are too real. Hollywood is cooked."

But what’s really driving the excitement isn’t just the wow factor of the visuals. According to The China Project, Seedance 2.0 is being positioned not as a toy, but as a powerful production tool for advertising, ecommerce, and film. The promise is clear: faster, cheaper video production at a quality level that could upend existing workflows. Marketers, in particular, are eyeing the potential to test more creative variations and refresh campaigns more often—an advantage that could reshape the economics of digital advertising. As the volume of high-end video content rises, competition for viewers' attention and ad dollars is expected to intensify, putting pressure on traditional studios and agencies that rely on slower, more expensive production cycles.

Seedance 2.0 is already being integrated into various AI platforms, including Dreamina—a software suite similar to CapCut—as well as other video-generator tools, according to No Film School. ByteDance’s move is seen as a direct escalation in the AI arms race, with industry watchers framing it as China’s next breakout moment after the DeepSeek models made waves in 2025. The shift from text-based chatbots to generative media stacks—encompassing models, editing tools, and distribution platforms—signals a broader transformation in how digital content is conceived, produced, and consumed. As these tools spread globally, breakthroughs in one market can rapidly reset expectations everywhere.

Yet, for all the technological marvel, Seedance 2.0 is raising a host of practical and ethical questions that are impossible to ignore. The first and most glaring issue is data transparency. ByteDance, as No Film School points out, does not have a strong track record of clarifying what data is used to train its models. This opacity has fueled concerns that user data from apps like TikTok—or content shared online more broadly—could be feeding the AI’s learning process without adequate consent or oversight.

Copyright, too, looms large in the debate. Early Seedance 2.0 videos have been observed closely resembling well-known icons, characters, and intellectual property, often without any apparent partnerships or licensing agreements. As Justin Moore, a partner at a major U.S. venture capital firm, commented on X, "We are getting absolutely mogged by China on video models because they don’t care about copyright. And Seedance 2 isn’t even broadly available in the US yet. It’s going to dominate all your feeds with the real release later this month. Things to think about." The risk here is clear: as AI-generated content becomes indistinguishable from real footage, the potential for copyright infringement, unauthorized likenesses, and deepfakes grows exponentially.

The anxiety isn’t confined to legal circles. Filmmakers and creators are grappling with what some describe as a "perpetual existential crisis," unsure whether to embrace these technologies or fear for their livelihoods. While many in the creative community are open to new tools that can enhance their craft, the lack of clear answers about data usage, consent, and fair compensation is fueling skepticism and, at times, outright resistance. As No Film School notes, "New AI video models, or AI slop models, whatever you want to call them, like Seedance 2.0, are not providing answers and stability, and generally only making the debates surrounding the ethics and usability of AI louder and more severe."

Despite the storm of controversy, the momentum behind Seedance 2.0 is undeniable. Its demos showcase sharp visuals, surprisingly coherent narratives, and a level of polish that has outpaced rival models like Kling, Veo, and Sora. Tech analysts argue that if the quality holds up at scale, the competitive edge will shift decisively toward those who can iterate creative content the fastest—potentially redrawing the map for global media production.

The broader implications are profound. As the AI race pivots from words to worlds, the ability to generate high-quality video on demand could democratize filmmaking and advertising, making it accessible to a wider range of creators and brands. At the same time, it threatens to disrupt established industries, challenge regulatory frameworks, and force society to confront difficult questions about authenticity, ownership, and the value of human creativity.

For now, Seedance 2.0 remains in limited beta, but its impact is already being felt worldwide. The flood of viral clips, the heated debates on social media, and the scramble by competitors to catch up all point to a future where AI-generated video is not just a novelty, but a central force shaping the stories we see and the way we see them.

As the dust settles, one thing is clear: the release of Seedance 2.0 marks a new chapter in the AI revolution—one where the boundaries between human and machine-made art are being rewritten in real time, and the outcome is anyone’s guess.
 

February 12, 2026

ByteDance Unleashes Seedance 2.0 Shaking Hollywood

The viral launch of Seedance 2.0 by ByteDance promises faster, cheaper video production but sparks fierce debate over copyright, ethics, and the future of filmmaking.

Key Points

  • ByteDance launched Seedance 2.0 on February 10, 2026, as a generative AI video model capable of producing short cinematic clips with advanced director-level controls.
  • Seedance 2.0 is being adopted for advertising, ecommerce, and film production, promising faster, cheaper video creation and challenging traditional studio workflows.
  • The model raises ethical concerns over data transparency and copyright infringement due to unclear training data sources and resemblance to existing content.
The global race to dominate artificial intelligence has just taken a cinematic turn, and all eyes are on China’s ByteDance as it unveils Seedance 2.0—a generative AI video model that’s already sparking both awe and anxiety across the tech and creative industries. In just days since its limited beta release, Seedance 2.0 has gone viral, drawing comparisons to previous Chinese AI breakthroughs and igniting fierce debates over the future of filmmaking, advertising, and intellectual property.

Seedance 2.0 was officially launched by ByteDance, the parent company of TikTok, on February 10, 2026, as a limited beta on the Jimeng AI platform, according to India Today. This new model is not just another text-to-video tool; it’s a sophisticated engine built on ByteDance’s Seedream 5.0 architecture, capable of generating short cinematic clips—typically between 4 and 15 seconds—using a combination of text prompts, images, video references, and even audio inputs. Unlike its predecessors, Seedance 2.0 boasts director-level controls, offering users unprecedented command over motion, lighting, framing, and character consistency. The result? Videos that look and feel strikingly close to real studio productions.

Clips generated with Seedance 2.0 have flooded X (formerly Twitter) and other social platforms, with users marveling at the tool’s ability to produce everything from Kanye West-style music videos with dramatic lighting to outlandish action sequences like "Will Smith fighting a giant spaghetti monster" or "Godzilla versus a tiny cat." The buzz is so intense that many are calling Seedance 2.0 a "Hollywood killer," suggesting that the traditional film industry may need to brace itself for a new era of AI-driven content creation. One user summed up the sentiment by posting, "Tom Cruise v Brad Pitt created using Seedance 2.0. The character consistency, accuracy and details are too real. Hollywood is cooked."

But what’s really driving the excitement isn’t just the wow factor of the visuals. According to The China Project, Seedance 2.0 is being positioned not as a toy, but as a powerful production tool for advertising, ecommerce, and film. The promise is clear: faster, cheaper video production at a quality level that could upend existing workflows. Marketers, in particular, are eyeing the potential to test more creative variations and refresh campaigns more often—an advantage that could reshape the economics of digital advertising. As the volume of high-end video content rises, competition for viewers' attention and ad dollars is expected to intensify, putting pressure on traditional studios and agencies that rely on slower, more expensive production cycles.

Seedance 2.0 is already being integrated into various AI platforms, including Dreamina—a software suite similar to CapCut—as well as other video-generator tools, according to No Film School. ByteDance’s move is seen as a direct escalation in the AI arms race, with industry watchers framing it as China’s next breakout moment after the DeepSeek models made waves in 2025. The shift from text-based chatbots to generative media stacks—encompassing models, editing tools, and distribution platforms—signals a broader transformation in how digital content is conceived, produced, and consumed. As these tools spread globally, breakthroughs in one market can rapidly reset expectations everywhere.

Yet, for all the technological marvel, Seedance 2.0 is raising a host of practical and ethical questions that are impossible to ignore. The first and most glaring issue is data transparency. ByteDance, as No Film School points out, does not have a strong track record of clarifying what data is used to train its models. This opacity has fueled concerns that user data from apps like TikTok—or content shared online more broadly—could be feeding the AI’s learning process without adequate consent or oversight.

Copyright, too, looms large in the debate. Early Seedance 2.0 videos have been observed closely resembling well-known icons, characters, and intellectual property, often without any apparent partnerships or licensing agreements. As Justin Moore, a partner at a major U.S. venture capital firm, commented on X, "We are getting absolutely mogged by China on video models because they don’t care about copyright. And Seedance 2 isn’t even broadly available in the US yet. It’s going to dominate all your feeds with the real release later this month. Things to think about." The risk here is clear: as AI-generated content becomes indistinguishable from real footage, the potential for copyright infringement, unauthorized likenesses, and deepfakes grows exponentially.

The anxiety isn’t confined to legal circles. Filmmakers and creators are grappling with what some describe as a "perpetual existential crisis," unsure whether to embrace these technologies or fear for their livelihoods. While many in the creative community are open to new tools that can enhance their craft, the lack of clear answers about data usage, consent, and fair compensation is fueling skepticism and, at times, outright resistance. As No Film School notes, "New AI video models, or AI slop models, whatever you want to call them, like Seedance 2.0, are not providing answers and stability, and generally only making the debates surrounding the ethics and usability of AI louder and more severe."

Despite the storm of controversy, the momentum behind Seedance 2.0 is undeniable. Its demos showcase sharp visuals, surprisingly coherent narratives, and a level of polish that has outpaced rival models like Kling, Veo, and Sora. Tech analysts argue that if the quality holds up at scale, the competitive edge will shift decisively toward those who can iterate creative content the fastest—potentially redrawing the map for global media production.

The broader implications are profound. As the AI race pivots from words to worlds, the ability to generate high-quality video on demand could democratize filmmaking and advertising, making it accessible to a wider range of creators and brands. At the same time, it threatens to disrupt established industries, challenge regulatory frameworks, and force society to confront difficult questions about authenticity, ownership, and the value of human creativity.

For now, Seedance 2.0 remains in limited beta, but its impact is already being felt worldwide. The flood of viral clips, the heated debates on social media, and the scramble by competitors to catch up all point to a future where AI-generated video is not just a novelty, but a central force shaping the stories we see and the way we see them.

As the dust settles, one thing is clear: the release of Seedance 2.0 marks a new chapter in the AI revolution—one where the boundaries between human and machine-made art are being rewritten in real time, and the outcome is anyone’s guess.
seedance is just insane, another deepseek moment.
 

Chinese AI Models Surpass US Peers by Global Usage for Third Straight Week

DATE: 14 hours ago

(Yicai) March 23 -- Chinese large language models are dominating global usage of artificial intelligence, staying ahead of US competitors for the third week in a row, according to rankings released by OpenRouter.

Among the top nine models by global usage volume last week, five were Chinese, OpenRouter’s leaderboard showed today. The top four were Xiaomi Group’s MiMo-V2-Pro, Step 3.5 Flash from Shanghai Jieyue Xingchen Intelligent Technology, M2.5 from MiniMax, and DeepSeek’s V3.2. Z.ai’s GLM5 Turbo took the sixth spot.

Total usage was 7.359 trillion tokens, up 57 percent from 4.69T in the previous week.

The US models in the top nine were Claude Sonnet 4.6 and Claude Opus 4.6 from Anthropic, along with Gemini 3 Flash Preview and Gemini 2.5 Flash from Google, with total usage of 3.536T, up 7.3 percent from 3.294T in the prior week.

China’s open-source models saw the highest global download volume last year. They have proved popular partly through being free to use, but also because they allow developers worldwide to fine-tune them.

This has significantly lowered the barriers to using AI globally and made the technology truly a public good, Joseph Tsai, chairman of Chinese tech giant Alibaba Group Holding, said at the China Development Forum’s annual conference.

There was further evidence of the global influence of Chinese models last week, after US-based AI coding firm Cursor was found to have based its latest release on Kimi 2.5, a model released earlier this year by Chinese startup Moonshot AI.

Users soon spotted the similarities, with one posting online that Cursor’s Composer 2 is just Kimi 2.5 with reinforcement learning. This drew the attention of Tesla boss Elon Musk who commented on the post, saying "Yeah, it's Kimi 2.5."

Cursor swiftly admitted the connection. "It was a miss to not mention the Kimi base in our blog from the start. We'll fix that for the next model," founder Adam Sanger said in a post on X.


China's open-source dominance threatens US AI lead, US advisory body warns​

March 23, 20268:02 PM GMT+8Updated 6 hours ago

BEIJING, March 23 (Reuters) - The dominance of China's open-source artificial intelligence is creating a "self-reinforcing competitive advantage", allowing it to challenge U.S. rivals despite restricted access to advanced AI chips, a U.S. congressional ‌advisory body said on Monday.

Some estimates suggest that around 80% of U.S. AI startups now use Chinese open-source AI models.
 

Users who are viewing this thread

Latest Posts

Back
Top