📑 Table of Contents

Huawei Ascend 910C Trains 1.6T Model

📅 · 📁 Industry · 👁 1 views · ⏱️ 9 min read
💡 Huawei and partners train a 1.6 trillion parameter model on Ascend 910C, marking a major milestone for Chinese AI compute independence.

Chinese tech giant Huawei has achieved a significant breakthrough in domestic artificial intelligence infrastructure. A consortium led by Shenzhen Hetao College successfully completed the full-parameter post-training of a 1.6 trillion parameter model using Huawei's Ascend 910C chips.

This achievement demonstrates that Chinese hardware can now support the training of ultra-large language models (LLMs) without relying on restricted Western technology. The project involved collaboration with Harbin Institute of Technology (Shenzhen), the Shenzhen Big Data Research Institute, and Huawei itself.

Key Facts: The Ascend 910C Breakthrough

  • Model Scale: The consortium trained a massive 1.6 trillion parameter model, showcasing extreme scale capabilities.
  • Hardware Used: The training ran exclusively on Ascend 910C AI accelerators, Huawei's latest flagship chip.
  • Training Type: This was a full-parameter post-training session, not just fine-tuning or inference.
  • Collaboration Partners: The effort included Shenzhen Hetao College, HIT Shenzhen, and the Shenzhen Big Data Research Institute.
  • Strategic Impact: This proves domestic supply chains can handle next-gen AI workloads despite export controls.
  • Performance Metric: The system maintained high stability and efficiency during the extended training period.

Breaking the Compute Bottleneck

The successful training of a 1.6 trillion parameter model represents a critical threshold in AI development. Most global competitors rely on NVIDIA's H100 or B200 GPUs for such tasks. Huawei's ability to execute this workload on the Ascend 910C signals a maturing ecosystem.

Previously, Chinese firms struggled with software compatibility and hardware scalability when building large models. The Ascend 910C addresses these gaps by offering competitive floating-point performance. It rivals earlier generations of Western chips in raw computational power.

This milestone is not just about hardware specs. It validates the entire software stack, including CANN (Compute Architecture for Neural Networks). Developers often cite software fragmentation as a major barrier. This success suggests that Huawei's proprietary tools are becoming robust enough for enterprise-grade applications.

The collaboration between academic institutions and industry giants also highlights a shift in strategy. By pooling resources, these entities can share the immense costs of training large models. This model of cooperation may become standard for future research in regions facing technological sanctions.

Strategic Independence from US Chips

The geopolitical context of this announcement cannot be overstated. US export controls have severely limited China's access to advanced semiconductors. Companies like Alibaba and Tencent previously faced hurdles in scaling their AI operations due to chip shortages.

The Ascend 910C offers a viable alternative. While it may not yet match the absolute peak performance of NVIDIA's latest Blackwell architecture, it provides sufficient power for most commercial and research applications. This reduces reliance on imported technology and mitigates supply chain risks.

For Western observers, this development underscores the resilience of China's tech sector. Sanctions intended to slow progress may have inadvertently accelerated domestic innovation. Huawei has invested heavily in R&D to create a self-sufficient AI ecosystem.

This shift impacts global market dynamics. International companies operating in China must now consider local hardware options. The availability of capable domestic chips encourages local developers to build solutions optimized for Ascend architecture rather than CUDA.

Implications for Global AI Development

The rise of competitive non-Western AI hardware changes the landscape for developers worldwide. It introduces more options for cloud providers and enterprises seeking cost-effective alternatives. Competition often drives down prices and improves service quality.

However, fragmentation remains a concern. The AI community has largely standardized around NVIDIA's CUDA platform. As Huawei promotes its own ecosystem, developers face a choice: stick with the dominant standard or adapt to new tools.

This divergence could lead to a bifurcated AI market. One segment will continue to use Western hardware and software stacks. Another will emerge based on Chinese technologies, potentially creating incompatibilities in model portability.

Despite these challenges, increased competition benefits the overall industry. It forces all players to innovate faster. For researchers, having multiple high-performance platforms allows for broader experimentation and optimization techniques.

What This Means for Businesses

Enterprises in Asia and emerging markets should take note of this development. Access to powerful AI compute is no longer monopolized by Western providers. Local data centers equipped with Ascend chips can offer competitive pricing and latency advantages.

Businesses should evaluate their current AI infrastructure strategies. If you operate in regions affected by export controls, exploring Huawei's ecosystem is prudent. Early adoption can provide a strategic advantage as the software libraries mature.

Developers need to prepare for potential skill shifts. Learning CANN and MindSpore, Huawei's deep learning framework, may become valuable. These skills will be essential for optimizing models on Ascend hardware.

Furthermore, investors should watch for partnerships between Huawei and other tech firms. Collaborations similar to the one with Shenzhen Hetao College will likely expand. These alliances strengthen the domestic supply chain and reduce vulnerability to external shocks.

Looking Ahead: Future Milestones

The completion of this 1.6 trillion parameter training is just the beginning. Future efforts will focus on optimizing inference speeds and reducing energy consumption. Efficiency is crucial for making large-scale AI deployment economically viable.

We can expect to see more models trained on Ascend hardware in the coming months. As the software ecosystem matures, performance benchmarks will improve. This will narrow the gap with leading Western solutions.

Regulatory bodies in Europe and the US will monitor this trend closely. They may adjust policies in response to China's growing AI capabilities. This could lead to further restrictions or, conversely, new opportunities for dialogue and cooperation.

Ultimately, the AI race is becoming multipolar. No single region or company will dominate indefinitely. Diversity in hardware and software ecosystems fosters innovation and resilience across the global tech landscape.

Gogo's Take

  • 🔥 Why This Matters: This breaks the myth that Western chips are indispensable for top-tier AI. It empowers Chinese firms to compete globally without fearing sudden supply cuts, potentially lowering AI costs in emerging markets through localized competition.
  • ⚠️ Limitations & Risks: The software ecosystem (CANN/MindSpore) still lags behind NVIDIA's CUDA in maturity and developer community size. Porting existing models may require significant engineering effort, and long-term reliability at extreme scales needs more validation.
  • 💡 Actionable Advice: Developers should start experimenting with Huawei's MindSpore framework now to understand the differences from PyTorch/TensorFlow. Enterprises in Asia should pilot Ascend-based deployments to diversify their cloud infrastructure and avoid vendor lock-in with Western providers.