Skymizer Unveils HTX301 AI Inference Accelerator
Taiwan-based Skymizer launches HTX301 decode accelerator that packs 384GB memory on a single PCIe card to run 700B-param…
9 articles about 'AI Inference'
Taiwan-based Skymizer launches HTX301 decode accelerator that packs 384GB memory on a single PCIe card to run 700B-param…
Pet-tech startup Tomofun deploys vision-language models on AWS Inferentia2 to slash inference costs while maintaining ac…
Cerebras Systems' third-generation wafer-scale chip delivers unprecedented AI inference speeds, outpacing GPU-based solu…
Cloudflare launches a new edge-native AI inference platform designed to bring low-latency LLM deployment to its global n…
Amazon AWS unveils its next-gen Trainium 3 AI chips, promising up to 4x better performance and significantly lower infer…
Meta's multi-billion dollar deal for tens of millions of AWS Graviton CPU cores may be the most underrated signal in AI …
Cloudflare expands Workers AI with serverless GPU inference, enabling developers to run AI models at the network edge wi…
NVIDIA's next-gen Blackwell Ultra architecture triples inference performance for large AI models, reshaping data center …
Amazon Web Services has announced the launch of G7e instances powered by NVIDIA RTX PRO 6000 Blackwell Server Edition GP…